Coder Social home page Coder Social logo

henrike-94 / predicting_bike_rental_demand Goto Github PK

View Code? Open in Web Editor NEW
3.0 2.0 0.0 107.1 MB

My first ai project as part of my take on the amazing online course "Introduction to Machine Learning for Coders" taught by Jeremy Howard. I will be contributing to the Kaggle competition "Bike Sharing Demand", aiming to predict bike rental demand depending on the weather.

Python 100.00%
anaconda3 fastai bike rental prediction-model random-forest kaggle-competition python3 first-project ai introduction-to-machine-learning

predicting_bike_rental_demand's People

Contributors

henrike-schwenn avatar henrikeschwenn avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

predicting_bike_rental_demand's Issues

README

  • Andere Readmes lesen
  • Bausteine sortieren
  • #67
  • Dateiformat wählen
  • 1. Entwurf
  • 1. Überarbeitung
  • Finale Überarbeitung
  • Übersetzen
  • Layout

Unit test TrainingSet

Tasks

  • Upgrade PyCharm to Python 3.10
  • Upgrade system to Python 3.10
  • Check installation error
  • Check requirements for unittest
  • Check if packages need to be reinstalled in Python 3.10
  • import unittest into TrainingSet.py
  • Run unittest on TrainingSet.py
  • Make a sketch from Getting Started With Testing in Python

Create dataframe "Training Set"

  • Download files from Kaggle
  • Create training script file and dataframe object "TrainingSet"
  • Decide: Methods in constructor?
  • - [x] Understand evaluation score: Root Mean Square Log Error
  • Set dependent variable "count"
  • Rename column "count" in train.csv -> Avoid Python to mistake it for function "count"
  • Replace values with their logs
  • Turn all values into numerical values
  • Display all columns with their datatypes
  • Clone package "fastai"
  • Install package "fastai"
  • Research encoding for python.exe
  • Try solutions suggested by PyCharm
  • Submit request to JetBrains Support
  • Try installing torch via powershell
  • Retry installing torch via powershell C:\Users\henri\OneDrive\Dokumente\Berufseinstieg\Sprachtechnologie\Predicting_Bike_Rental_Demand\venv\Scripts>
  • Change interpreter to User Current Version:- 3.9.5 (tags/v3.9.5:0a7dcbd, May 3 2021, 17:27:52) [MSC v.1928 64 bit (AMD64)] inside PyCharm
  • Research: Drop "rent_count" out of training set, yes or no?
  • Fix key error date
  • Split "date" using function add_datepart
  • Check for missing values
  • Split dataframe into training and validation set
  • Isolate dependent variable in validation set
  • Save dataframes and dependent variable into objects (feather)

Troubleshooting numpy import error

Error warning

import pandas

---------------------------------------------------------------------------
ImportError                               Traceback (most recent call last)
<ipython-input-26-38d4b0363d82> in <module>
----> 1 import pandas

/home/henrike/anaconda3/lib/python3.8/site-packages/pandas/__init__.py in <module>
     15 if missing_dependencies:
     16     raise ImportError(
---> 17         "Unable to import required dependencies:\n" + "\n".join(missing_dependencies)
     18     )
     19 del hard_dependencies, dependency, missing_dependencies

ImportError: Unable to import required dependencies:
numpy: 

IMPORTANT: PLEASE READ THIS FOR ADVICE ON HOW TO SOLVE THIS ISSUE!

Importing the numpy C-extensions failed. This error can happen for
many reasons, often due to issues with your setup or how NumPy was
installed.

We have compiled some common reasons and troubleshooting tips at:

    https://numpy.org/devdocs/user/troubleshooting-importerror.html

Please note and check the following:

  * The Python version is: Python3.7 from "/snap/jupyter/6/bin/python"
  * The NumPy version is: "1.18.5"

and make sure that they are the versions you expect.
Please carefully study the documentation linked above for further help.

Original error was: No module named 'numpy.core._multiarray_umath'



Define class dataset csv

Planning

Creates a dataset object from a csv file

Needs:

  • os
  • pandas

Instance variables:

  • path to data file
  • name of data file

Methods:

  • get_dataset = os.chdir(path)
  • display_dataset = read_csv()

Update CsvObject

  • Decide: Methods in constructor?
  • Check pd.readcsv settings
  • Why does it return values as FALSE?
  • Does that mean the function checks whether the values are numerical (TRUE) or not (FALSE)?
  • Decide whether to keep this feature or not
  • Turn all values into numerical values
  • Save dataframe into object (feather)

Evaluate RF

To Do

  • Publish score on Kaggle and check ranking

Ersten Zyklus überarbeiten

  • Skript "TrainingSet" ausführen
  • Skript "Train" ausführen
  • Alle Dateien am richtigen Ort?
  • Max. Zeilenlänge 80
  • PEP8 Richtlinien gewährleisten
  • Auf ungenutzte Importe verzichten
  • Relative Pfade für Datensets
  • Richtige Nomenklatur: Nur noch "train" und "test
  • Einheitliche Varienschreibung
  • Einheitliche Kommentare: # für Programmfluss ,''' zur Dokumentation

Get IDE running on Android

To Do

  • Install Git
  • Try alternative to PyCharm
  • Install packages
  • numpy
  • pandas
  • scikit-learn
  • fastai
  • matplot
  • Pydroid Support?
  • Write to Google Support
  • Install PyCharm

Train Set

To Do

  • Create training file
  • Make a list of parameters to be measured
  • Also save validation set into file
  • Read files and define variables
  • Set up object RandomForestRegressor( ) and its methods fit( ), and print_score( )
  • Set up an alternative for print_score
  • Fix value error
  • Fix output problem: Execute print statements in console/move them to their respective definitions?
  • Check: After which line does the execution begin to slow down?
  • Return parameters
  • Select presentation format for GitHub
  • Move all visualisation statements to console?
  • Sketch out how the functions work: RandomForestRegressor( ), fit( ), and print_score( )

Feedback

  • Beta-Feedback / Präsentation?
  • #68
  • Birte
  • Sam

Texte Website

  • #69
  • Feedback Joao
  • 1. Entwurf
  • 1. Überarbeitung
  • DeepL Übersetzung
  • Mehrsprachigkeit einstellen

Re-install scikit-learn and fastai

Re-installing scikit-learn and fastai

PowerShell

  ERROR: Failed building wheel for scikit-learn
Failed to build scikit-learn
ERROR: Could not build wheels for scikit-learn, which is required to install pyproject.toml-based projects

Research error message

"scikit-learn python 3.10 windows"
Installing scikit-learn

Installing the latest release

Operating System Windows macOS Linux
Packager pip conda

Install the 64bit version of Python 3, for instance from https://www.python.org.
Then run:

pip install -U scikit-learn
In order to check your installation you can use

python -m pip show scikit-learn # to see which version and where scikit-learn is installed
python -m pip freeze # to see all packages installed in the active virtualenv
python -c "import sklearn; sklearn.show_versions()"

Note that in order to avoid potential conflicts with other packages it is strongly recommended to use a virtual environment (venv) or a conda environment.

Try latest version

pip install -U scikit-learn

distutils.errors.DistutilsPlatformError: Microsoft Visual C++ 14.0 or greater is required. Get it with "Microsoft C++ Build Tools": https://visualstudio.microsoft.com/visual-cpp-build-tools/

Originally posted by @henrike-94 in #19 (comment)

Get Code Reviews

To Do

  • Enlist code reviewers
  • Research code review - how to
  • Evaluate Eike's review
  • GitHub review
  • Add review to project presentation

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.