Coder Social home page Coder Social logo

ds-skills-final-london-ds-091018's Introduction

Introduction to Data Science: Final Project

For the final project, we will be building upon the midterm and conducting a full data science pipeline from start to finish. Your goal is to come up with a question that could be answered using your current toolset. You’ll combine your analytical and modelling skills with API calls to acquire your own dataset to answer your question. Pick a dataset that you can split into multiple groups to compare to one another, make observations and answer a larger question. See the rubric below for specific requirements. From there, you will practice a standard data science workflow, exploring the initial data, cleaning null values and transforming features as appropriate. (For example, you might have to create dummy variables from a categorical variable if you wish to use that information within a regression model.) Finally you will then apply a machine learning technique whether it be regression, classification or clustering in order to yield further insights and a useful model for analysis or predictive purposes.

Here's a rubric to guide you for the most important aspects regarding the project:

When choosing a project topic, thinking of the data available is mission critical; what data will you acquire? How will this help you answer a question of interest? What do you hope to find or do? We’ve investigated some of the foundations of API calls, but all vary from one to another and some OAuth patterns can become quite complicated. For this reason, we will support the following APIs for the final project:

  • Yelp
  • Open311
  • Open Movie Database

Using other APIs may be acceptable, but support may be limited. Talk to your instructor early and be prepared to choose an alternative option if you attempt to use another API.

There are also a number of flat data files available at many sources including:

  • U.S. Census
  • UK Data Service
  • Data.Gov.UK
  • Kaggle
  • Google’s [New] Dataset Search
  • Data.gov
  • NYC Open Data

Here’s another very comprehensive list for further ideas:
https://github.com/awesomedata/awesome-public-datasets

ds-skills-final-london-ds-091018's People

Contributors

mathymitchell avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.