Collection of Projects I've done during my Data Science Immersive at General Assembly Boston
- Project 1: Python Basics
- Focus: Learning dictionaries, lists, and functions
- Project 2: Simple SAT and drug datasets
- Focus: Pandas, manipulating dataframes and visualizing distributions and basic stats.
- Project 3: Kaggle Competition, Ames Housing Dataset
- Focus: Linear regression, Regularization, Scaling, Feature Transformation, Principal Component Analysis ,Crossval and Model evaluation metrics. Visual story telling. Model interpretation.
- Unassigned Project: West Nile Virus Kaggle Competition
- Focus: Classification algorithms (Logistic Regression, KNN, Random Forrest), PCA, Merging Data Sets, Dummy Variables/Categorical -> Numeric Conversions
- Project 4: Indeed Job Scrapping/Salary Analysis
- Scrapping Notebook, Project Presentation Notebooks
- Focus: Learn Webscrapping, Building up a set of data, NLP related skills (Bag of Words, TFIDF, clustering(LDA, NMF)), Regex, Pipelines, GridSearching, Hyperparameter Tuning, Feature Engineering
- Capstone: Redfin Timeseries Analysis/Google Trends (pytrends-simple within side projects folder)
- Goal: Produce a one fits all model to predict Median Sale Price in any metro region. See Presentation for details.
- Included: Bokeh interactive MAE chart, D3.js animation, ppt rough draft presentation,
- Focus: Timeseries analysis, ARIMA, SARIMA, Decomposition, Stationarization, Pickle, Pandas manipulation, REST API