Coder Social home page Coder Social logo

predicting-dengue's Introduction

predicting-dengue

Alex Freeman - 9/19/17

DrivenData is currently hosting a competition to predict cases of Dengue Fever in San Juan, Puerto Rico and Iquitos, Peru. The competition can be found here: https://www.drivendata.org/competitions/44/dengai-predicting-disease-spread/

This repository includes: The dataset as provided by the competition website in the datasets folder, my personal submissions so far in the submissions folder. I have rewritten multiple of the models, so it is not possible to reproduce some of these submissions, a data dictionary - dengue data dictionary.txt, iPython Notebooks going through the data science pipeline. The notebooks follow logically along the numbered order. The unnumbered notebooks are extra modelling techniques:

  • 1 - Dengue cases data cleaning.ipynb - Cleaning, fill in missing values, etc.

  • 2 - pickle dengue data.ipynb - pickle cleaned data for easier loading in later notebooks.

  • 3 - Dengue cases EDA.ipynb - Explore and visualize the feature set of weather variables

  • 4 - Predict based on months only.ipynb - Find and plot the monthly trend

  • 5 - Predict residuals with rolling weather data.ipynb - Use the monthly trend to find residuals and see which weather features to use.

  • 6 - DENGUE - BEST MODEL.ipynb - The solution code for my best performing model.

Extra notebooks:

  • DENGUE - predict on log(cases).ipynb Take the log of the cases and repeat modelling process

  • Dengue ARIMA.ipynb - Try the ARIMA method

  • Predict outbreaks & low case classes.ipynb - Try classification models using models to predict if it is an 'outbreak' or a 'lull' in cases

  • Residuals from seasonality Model.ipynb - Remove the monthly trend from the cases and use weather features to predict residuals.

  • Dengue Time Series Models.ipynb - Try multiple different time-based models

  • Predict based on months and rolling or shifted data.ipynb - Create new features from rolling mean, rolling standard deviation and shifted weather data to predict the residuals.

  • XGBoost predictions.ipynb - Try the XGBoost model to predict the residuals.

predicting-dengue's People

Contributors

alexjf12 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.