Coder Social home page Coder Social logo

trinhquocnguyen / machine-learning-specialization Goto Github PK

View Code? Open in Web Editor NEW

This project forked from rashikaranpuria/machine-learning-specialization

0.0 2.0 0.0 503.98 MB

Project work and Assignments for Machine learning specialization course on Coursera by University of washington

License: MIT License

Jupyter Notebook 99.97% Python 0.03%

machine-learning-specialization's Introduction

Machine Learning Specialization University of Washington

Coursera Assignment and Project of Machine learning specialization on coursera from University of washington.

Installation:

  • Install anaconda
  • Install graphlab
  • download files
  • python file.py
  • You can also run the file as ipynb for graphical results
  • import your dataset in place of default dataset

1. Machine learning: A case study approach:

In this, I learned about the various models in machine learning and used the pre-implemented models in graphlab to get an over view.

2. Regression:

Case Study - Predicting Housing Prices

In our first case study, predicting house prices, I created models that predicts a continuous value (price) from input features (square footage, number of bedrooms and bathrooms,etc).

In this course, I explored regularized linear regression models for the task of prediction and feature selection. I handled very large sets of features and select between models of various complexity. I also analyzed the impact of aspects of data such as outliers on your selected models and predictions. To fit these models, I implemented optimization algorithms that scale to large datasets.

Work done

  • Compare and contrast bias and variance when modeling data.
  • Estimate model parameters using optimization algorithms.
  • Tune parameters with cross validation.
  • Analyze the performance of the model.
  • Describe the notion of sparsity and how LASSO leads to sparse solutions.
  • Deploy methods to select between models.
  • Exploit the model to form predictions.
  • Build a regression model to predict prices using a housing dataset.

3. Classification:

Case Studies: Analyzing Sentiment & Loan Default Prediction

Data set:

  • Amazon review database consisting 183531 entries
  • Lending club data sub-set consisting of 122607 entries

In this case study on analyzing sentiment, I created models that predicted a class (positive/negative sentiment) from input features (text of the reviews, user profile information, etc). In this second case study for the specialization, loan default prediction, I tackled financial data, and predicted when a loan is likely to be risky or safe for the bank.

Work done

  • Handle binary and multiclass classification problems.
  • Implement a logistic regression model for large-scale classification.
  • Create a non-linear model using decision trees.
  • Improve the performance of any model using boosting.
  • Scale methods with stochastic gradient ascent.
  • Describe underlying decision boundaries.
  • Build a classification model to predict sentiment in a product review dataset.
  • Analyze financial data to predict loan defaults.
  • Use techniques for handling missing data.
  • Evaluate models using precision-recall metrics.

4. Clustering:

Case study: Finding Similar Documents

Data set: Wikipedia articles subset

This is the third case study in the specialization, finding similar documents, I examined similarity-based algorithms for retrieval. In this course, I also examined structured representations for describing the documents in the corpus, including clustering and mixed membership models, such as latent Dirichlet allocation (LDA). Implemented expectation maximization (EM) to learn the document clusterings, and see how to scale the methods using MapReduce.

Work done

  • Create a document retrieval system using k-nearest neighbors.
  • Identify various similarity metrics for text data.
  • Reduce computations in k-nearest neighbor search by using KD-trees.
  • Produce approximate nearest neighbors using locality sensitive hashing.
  • Cluster documents by topic using k-means.
  • Examine probabilistic clustering approaches using mixtures models.
  • Fit a mixture of Gaussian model using expectation maximization (EM).
  • Perform mixed membership modeling using latent Dirichlet allocation (LDA).
  • Describe the steps of a Gibbs sampler and how to use its output to draw inferences.
  • Compare and contrast initialization techniques for non-convex optimization

machine-learning-specialization's People

Contributors

rashikaranpuria avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.