Coder Social home page Coder Social logo

comp551's Introduction

COMP 551

Applied Machine Learning coursework at McGill University.

Project 1 - Introduction

As an introduction to practical machine learning, these are implementations (from scratch) of logistic regression with gradient descent and linear discriminant analysis. Other introductory steps include loading and cleaning the data set, extracting some basic additional features, creating a training-validation-test split, and using K-fold validation to evaluate the models.

Project 2 - Reddit Comment Classification

This project classifies uses the text of Reddit comments to classify them by the subreddit in which they were posted. The data is pre-processed and feature vectors are extracted, then fed into the models: Bernoulli Naive Bayes (from scratch), Multinomial Naive Bayes (from SciKit-Learn), and a LSTM neural network (using PyTorch). Results were submitted to this Kaggle leaderboard of McGill and University of Montreal graduate-level students.

Project 3 - MNIST Image Classification

This classifier uses a convolutional neural network to perform classification on a modified version of the MNIST dataset. Pre-processing techniques and CNN structure are varied for optimal performance. The datasets are too large to be included directly in the repository, but can be found on Kaggle, along with the competition leaderboard.

Project 4 - Paper Reproduction of CNN for Sentence Classification

This is a reproduction of the results in Yoon Kim's Convolutional Neural Networks for Sentence Classification. writeup.pdf contains the full report of this project.

The CNN repdroduction and hyperparameter tuning is performed in ccn-rand.ipynb. This file is configured to run locally, but executing it on a GPU will greatly reduce runtime.

Data pre-processing code from Kim's original paper can be found in OriginalCode.ipynb. The LSTM and Naive Bayes model implementations are located in BasicModels.ipynb. These files were written to be run on Google Colab, and the "baseFilepath" variable must match the location of the data in your Google Drive.

comp551's People

Contributors

alexanderhale avatar erickzhao avatar ericzimmermann avatar priscillaip avatar ezz2802 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.