Coder Social home page Coder Social logo

analytics's Introduction

Analytics

Repository of R code for the course "The Analytics Edge" offered by MIT via edX.

The course was completed in May, 2015.

Index:

  1. Introduction to Anaytics
  2. Linear Regression
  3. Logistic Regression
  4. Trees
  5. Text Analytics
  6. Clustering
  7. Visualization
  8. Linear Optimization (R script was written by the course's staff)
  9. Integer Optimization (no R code)

Some course materials can be found here

I have also added my scripts for the Kaggle In-Class competition. The competition consisted of predicting which NYT articles would be popular. The following files relate to the competition:

  1. NYTimesBlogTest.csv and NYTimesBlogTrain: the split datasets.
  2. Data Exploration.R: my initial analysis of the data.
  3. Models.R: script with a series of 20 models (not all of them were submitted).
  4. Submission_mod9_941.csv: the top model of the two I chose for evaluation, based on Random Forests. This model gave the 852nd position (out of 2923).
  5. Submission_mod17_123.csv: my best model, based on Generalized Boosting Regression, which would have given me the 123rd position (top 4%) if I had chosen it for evaluation. I ran the gbm models in a hurry, by just guessing the parameters instead of optimizing them, and yet I got a result superior to RF. I only worked 2 nigths on creating models for submission. The winners used optimized GBM and ensembles with RF, as well as additional feature engineering.

The instructions for the Kaggle competition can be found here

analytics's People

Contributors

geoninja avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.