rochitasundar Goto Github PK

followers: 24.0 following: 0.0 repos: 17.0 gists: 0.0

Name: Rochita Sundar

Type: User

Company: Data Scientist @ Voices

Bio: Consistency is key.

Location: Vancouver, Canada

Blog: https://ca.linkedin.com/in/rochita-sundar

Hi, this is Rochita! 👋

Based out of Vancouver, British Columbia 🏔️
Interested in Data Science, Machine Learning & Big Data 🌱
Currently learning more on Deep Learning (NLP & CV) 📚
Check out some of my projects 🔭

Rochita Sundar's Projects

classification-booking-cancelation-prediction-starhotels

The aim is to develop an ML- based predictive classification model (logistic regression & decision trees) to predict which hotel booking is likely to be canceled. This is done by analysing different attributes of customer's booking details. Being able to predict accurately in advance if a booking is likely to be canceled will help formulate profitable policies for cancelations & refunds.

collaborative-filtering-book-recommendation-system

This project aims to build & optimise a book recommendation system based on collaborative filtering and will tackle an example of both memory based & model based approach (using KNNWithMeans & Singular Value Decomposition)

customer-profiling-using-ml-easyvisa

The aim is to find an optimal ML model (Decision Tree, Random Forest, Bagging or Boosting Classifiers with Hyper-parameter Tuning) to predict visa statuses for work visa applicants to US. This will help decrease the time spent processing applications (currently increasing at a rate of >9% annually) while formulating suitable profile of candidates more likely to have the visa certified.

deeplearning.ai-practical-data-science-on-aws-cloud-specialization

This repository contains my code solution to DeepLearning.AIs Practical Data Science On AWS Cloud Specialization.

deploying-machine-learning-models

Code for the online course "Deployment of Machine Learning Models"

generative-ai-with-large-language-models

This repository contains the lab work for Coursera course on "Generative AI with Large Language Models".

intro-to-deep-learning-with-pytorch

This repository contains my code solutions to Udacity's coursework 'Intro to Deep Learning with PyTorch'.

predictive-maintenance-cost-minimization-using-ml-renewind

The aim to decrease the maintenance cost of generators used in wind energy production machinery. This is achieved by building various classification models, accounting for class imbalance, and tuning on a user defined cost metric (function of true positives, false positives and false negatives predicted) & productionising the model using pipelines.

regression-dynamic-price-prediction-recell

The objective is to build a ML-based solution (linear regression model) to develop a dynamic pricing strategy for used and refurbished smartphones, identifying factors that significantly influence it.

rochitasundar

statistical-analysis-hypothesis-testing-e-newsexpress

The data relates to several user actions or interests recorded on two variants of landing pages for an online news portal. The objective is to analyse these interests by performing statistical analyses to determine if one variant is more effective based on chosen metrics (A/B testing).

stock-clustering-using-ml

The project involves performing clustering analysis (K-Means, Hierarchical clustering, visualization post PCA) to segregate stocks based on similar characteristics or with minimum correlation. Having a diversified portfolio tends to yield higher returns and face lower risk by tempering potential losses when the market is down.

tableau-visualization-canadiansuperstore

Storyboard published on Tableau Public: https://public.tableau.com/app/profile/rsundar/viz/CanadianSuperstoreDatasetVisualization/CanadianSuperstoreDataset

tutorial-smile-detector

Streamlit Smile Detector App

twitter-sentiment-analysis

Data consists of tweets scrapped using Twitter API. Objective is sentiment labelling using a lexicon approach, performing text pre-processing (such as language detection, tokenisation, normalisation, vectorisation), building pipelines for text classification models for sentiment analysis, followed by explainability of the final classifier

twittersentimentanalysis-bigdataproject

Scrapped tweets using twitter API (for keyword ‘Netflix’) on an AWS EC2 instance, ingested data into S3 via kinesis firehose. Used Spark ML on databricks to build a pipeline for sentiment classification model and Athena & QuickSight to build a dashboard

vpl-world-languages-webscrapping-project

This project aims to scrape the website of Vancouver Public Library using automation test software. The automated tool will scrape more than 70K+ records to gather information on the specific language collection, title, author, category, availability status and ratings of international language material to draw insights

rochitasundar Goto Github PK

Hi, this is Rochita! 👋

Rochita Sundar's Projects

Recommend Projects

Recommend Topics

Recommend Org