Coder Social home page Coder Social logo

Regression, Decision Tree and SVM

Applied Machine Learning Project utilized data from 512 different species of mushrooms to classify them as poisonous or edible based on their characteristics like cap length, cap shape, color , stem length etc. Data was cleaned, visualized and 12 different models were tested to select the best performing models. It was implemented by a group of four students. I executed Logistic Regression, Polynomial Logistic Regression at degree 3, Decision Trees and Support Vector Machine. It could be interpreted from the scatter plot of data points that SVM is best suited for the dataset, and it is confirmed by the F1 score. By adjusting the threshold of sigmoid function of logistic regression I was able to reduce the false negatives predicted by the model which is crucial for the result as false positives represented the poisonous mushrooms as edible.

Computer Vision

Multiclass Classification: In this project I have used a convolutional neural network model to classifies images with hand signals into 25 different classes. Model architecture consists of 2 convolutional and max pooling layers, 1 dropout layer to avoid overfitting, 1 dense layer with 514 neurons and a final dense layer with 25 neurons and softmax.

Transfer Learning: Here I have created a computer vision model which uses imported weights from a model trained on extensive dataset- InceptionV3. All the layers of InceptionV3 model are set to untrainable except for the last. Then a final DNN architecture is added with a dense layer of 1024 neuron, a dropout layer and a final dense layer with 1 neuron and sigmoid activation to classify colored images of humans and horses.

Time Series Analysis using Tensorflow

Moving Average Baseline model:In this notebook I have created an artificial time series by adding gradual slope, seasonal variations over a fixed period and random noise to the graph. Then I have used an average of previous few predictions to predict the next value, and this has created a moving average time series prediction baseline model.

LSTM Model to outperform baseline: This model outperforms the moving average baseline in predicting the time series. It uses 2 bidirectional LSTMs layers to learn the features of the timeseries sending context to and from and then predicts the next value of a time series.

Prediction of sun spot temperatures: Sunspot Temperatures is a huge dataset that records temperature of sun's surface as far back as 1980's. My deep learning architecture aims at predicting this time series using a convolutional layer, two LSTM layers and three Dense layers. It also uses a Stochastic Gradient Descent optimizer which selects the best learning rate by running the model on a range of learning rates.

Natural Language Processing

Sarcasm Detection: This project highlights the importance of GLove embeddings in Natural Language Processing. It trains an architecture of one Bidirectional GRU - Gated Recurrent Unit, one dense layer with 32 neurons and a final dense layer with 1 neuron and sigmoid activation. This model is trained twice once without the weights of GLove embeddings and again with it. The difference is substantial and the model without GLove embeddings tends to drastically overfit.

Text Generator: In this notebook I train a Bidirectional LSTM to predict the next word in a sequence. For the training set I have used Shakespeare's sonnets and I have fed each sentence multiple times, each time removing one word from the sequence and using that word as the predicted label. Finally, the model tries to create a poem by accepting a seed text as input.

Roshan Tiwari's Projects

hackathon-spacepenguins icon hackathon-spacepenguins

My team won Jury's Pick award at CBRE Hackathon 2022. We trained a hybrid CNN model to predict housing prices. The model is trained on images from Google Earth and other publicly available data like a property's distance from top rated restaurants.

machine_learning_projects icon machine_learning_projects

This repository contains my projects from AML class (Regression, Decision Tree and Clustering). It also contains different projects that I built to understand and master TensorFlow libraries. These projects use TensorFlow to train NLP, Computer Vision and Time Series models. These models mostly use Bidirectional LSTMs and GRUs.

qualtrics-competition icon qualtrics-competition

A data analyst competition was organized by Qualtrics and I grabbed second place. I used Power BI to analyze Qualtrics client data through various scatter plots. I performed clustering analysis to derive insights using R Studio.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.