Coder Social home page Coder Social logo

dat_sf_12's Introduction

DAT SF 12

Instructor:Alessandro Gagliardi
EiRs:Ramesh Sampath
Otto Stegmaier
Alex Chao
Classes:6:30pm-9:30pm, Tuesday and Thursdays
January 15 – March 31
Office Hours:Alex Chao, 5:30 - 6:30 before class at GA
Otto Stegmaier, 9:30 - 10:00 after class at GA
Ramesh Sampath, 4:00 - 6:00 Saturdays remote
Can also set by appointment

Homework is to be submitted by posting it to your own github repo. Then post the URL and folder where the homework lives at here.


Tentative Course Outline

  1. Intro to Data Science, Relational Databases & SQL
  2. Getting started with IPython & Git
  3. APIs and semi-structured data
  4. IPython.parallel & StarCluster
  5. Hadoop Distributed File System and Spark
  6. Intro to ML: k-Nearest Neighbor Classification
  7. Clustering: Hierarchical and K-Means
  8. Probability, A/B Tests & Statistical Significance
  9. Multiple Linear Regression and ANOVA
  10. Logistic Regression and Generlized Linear Models
  11. Project Elevator Pitches
    • See Student Project Repos below
  12. Naïve Bayes, Cross Validation, ROC, AUC & Midterm Review - Part I
  13. Naïve Bayes, Cross Validation, ROC, AUC - Part II
  14. Principal Components Analysis
  15. Decision Trees and Forests
  16. Support Vector Machines
  17. Scaling Out
  18. Recommendation Systems
  19. Visualization
  20. Final Project Presentations (12 min. each)
  21. Final Project Presentations (12 min. each)
  22. Future Directions

Project Schedule

Date Due Returned
1/22 Preliminary Project Proposals Due (3-4 sentences)
1/27 Homework 1
1/29 EiR Feedback on Project Proposals
2/3 EiR Feedback on Homework 1
2/5 Formal Proposals (including data and methods chosen)
2/10 Homework 2 Assigned
2/12 EiR Feedback on Formal Proposals
2/17 Homework 2 Due
2/19 Project Elevator Pitch in class (4 minutes each) Project Live on Github
2/24 Homework 3 Assigned
2/26 Peer Feedback of Projects Peer Feedback on Project
3/3 Midterm Assessment Posted
3/12 Midterm Assessment Due
3/17 At least one working model
3/24-26 Final Presentations (12 minutes each) Midterm Graded

Student Project Repos:

| Student | Repo | | Ajay Anand | sryballin/GeneralAssembly-DS | | Zachary Cousens | zfcousens/DAT_SF_12/tree/gh-pages/Project | | Carmen Diaz Echauri | cde/? | | Deepthi Duddempudi | DeepthiGA/Project | | Vijay Duraipalam | coolcalguy/DAT_SF_12/tree/gh-pages/Project | | Cheong-tseng Eng | ctteng/GA-Proj-GPSAnomalyDetection.git | | David Feng | selwyth/neighborhood | | Isabel Friedman | isabitz/whales | | Dave Halvorson | git-halvorson/DAT_SF_12/tree/gh-pages/FinalProject | | Alison Harmon | alharmon13/DAT_SF_12/tree/gh-pages/project | | Markus Huber | mbhuber/USconsumers | | Ryan Hughes | cryhughes/AVS-Kaggle | | Tania Ibanez | positiveepsilon/GA_Project | | Roxana Ordonez | rockyroxana/bike-share-forecast.git | | Justin Peterson | justinrpeterson/? | | April Song | khsong92/ga_ds | | India Swearingen | iswearingen/DAT_SF_12/blob/gh-pages/Homework/Project-IS-load-data.ipynb | | Bing Wang | bingbingboo/DAT_SF_12/blob/gh-pages/Homework/2014flightdatalab.ipynb | | Jaime Williams | jawilliams3000/OaklandCrime | | David Yerrington | dyerrington/Rapstats | | Matt Jones | jonesmatt415/NCAA-Prediction-Project- |

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.