Coder Social home page Coder Social logo

wyattowalsh / nbadb Goto Github PK

View Code? Open in Web Editor NEW
39.0 3.0 7.0 2.23 MB

Data Extraction (from https://stats.nba.com) and Processing Scripts to Produce the NBA Database on Kaggle (https://kaggle.com/wyattowalsh/basketball)

Home Page: https://nba-db.readthedocs.io/

License: GNU General Public License v3.0

Python 20.76% Jupyter Notebook 74.22% Shell 0.59% Makefile 0.39% HTML 4.04%
database kaggle kaggle-dataset nba nba-data nba-statistics nba-stats-api python3 sqlite

nbadb's People

Contributors

wyattowalsh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

nbadb's Issues

NBA dataset removed from Kaggle

The NBA dataset removed from Kaggle

To Reproduce
kaggle datasets download --unzip -o -q -d wyattowalsh/basketball

Expected behavior
The dataset is downloaded, but received 403 - Forbidden

name inconsistency

Github project is nba-db but for pretty much everything within the project is an nba_db reference. Leads to unintentional breaks in reference like the footer on the read the docs:

https://github.com/wyattowalsh/nba-db/blob/6af8db949ebae8e1ee0e8245677ef24d079d1c1e/docs/_templates/footer-item-github.html#L4

The https://www.github.com/wyattowalsh/nba_db doesn't exist. Seems like the consistent solution would be to change repository to nba_db to align with that as the consistently referenced name, but not sure if that breaks too much of references outside of Github and it's easier to just correct for the footer. Awesome resource here though!

season_id 22017 missing within game table

Corresponds to the 2017-18 NBA Season, the game table does not appear to have any observations from that season.

game <- read_csv(unz("basketball.zip", "csv/game.csv"))
 game |> filter(season_id > 22014) |>  group_by(season_id) |> tally()
# A tibble: 7 × 2
  season_id     n
      <dbl> <int>
1     22015  1230
2     22016  1230
3     22018  1230
4     22019  1059
5     22020  1080
6     22021  1230
7     22022  1018

Issue with daily() update function

Describe the bug
A clear and concise description of what the bug is.

I've been trying to run the daily() function to update the dataset for my data science project and I keep running into errors with the get_proxies function.

To Reproduce
Steps to reproduce the behavior:

  1. Go to '...'

  2. Click on '....'

  3. Scroll down to '....'

  4. See error

  5. Open Jupyter Notebooks via poetry

  6. Run from nba_db.update import daily

  7. daily()

  8. Encounter errors below

Expected behavior
A clear and concise description of what you expected to happen.
I figured the data set would update with the games that it doesn't include since its last update 8 months ago.

Screenshots
If applicable, add screenshots to help explain your problem.
Screenshot 2024-03-03 at 9 09 54 PM
Screenshot 2024-03-03 at 9 11 37 PM

Desktop (please complete the following information):

  • OS: MacOS Sonoma 14.2.1
  • Browser Safari
  • Version 17.2.1

Smartphone (please complete the following information):

  • Device: [e.g. iPhone6]
  • OS: [e.g. iOS8.1]
  • Browser [e.g. stock browser, safari]
  • Version [e.g. 22]

Additional context
Add any other context about the problem here.
I'm trying to update the dataset so I can use it for my intro data science final project, any help would be greatly appreciated!

Adjusting dependencies to JupySQL

I think it'll be beneficial for you to use Jupysql in your repo as we've added support in many interactive features on top of ipython-sql. There's backward compatibility so it's just changing the dependency.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.