SYS 6018 Final Project: Music Genre Prediction System using Ensemble models
- audio_analysis/Audio EDA v0.1ss.ipynb: The code contains:
- Exploratory audio data analysis
- audio_analysis/Audio Models+Ensemble v0.1ss.ipynb:
-
Audio Modeling - QDA, Logistic, SVM
-
Ensemble models
- Text Modeling (5000 words, 1000 word) - text_analysis/final_text_analysis_Saurav.ipynb The code contains:
-
Text Modeling - Random Forest, Gradient Boosting, Multilayer Perceptron
-
Ensemble Model
-
Text EDA - data_eda/word_cloud.ipynb
-
Tree_MLP_Oversampling Analysis.ipynb: The code contains:
- Boosting, random forest, knn(No PCA), MLP, Oversampling for Audio
- Ensemble Boosting.ipynb: The code contains:
- Boosting for ensemble
- LyricBagofWordsClassifier.ipynb This code contains:
- Text modeling (500 word) - Random Forest, SVM, KNN, Logistic Regression
Datasets Used:
- Genre Annotations:
- http://www.tagtraum.com/msd_genre_datasets.html (File name: msd_tagtraum_cd2c.cls.zip)
- Audio Dataset:
- Lyrics Dataset: