drum_sound_classifier's Introduction

Drum Sound Classification

A python module for making pandas datasets out of drum libraries, and training drum type classification models using a few different methods. Read my accompanying blog post.

Set up

Install requirements with pip. From this cloned repository it should be enough to do pip install ., but using a virtual environment is encouraged.
Get drum sounds. I recommend r/drumkits
Run something like python drum_sound_classifier/preprocess.py --drum_lib_path /path/to/drums. This will recursively search, so nested directories are fine. It will safely skip any non-audio files. Run python preprocess.py --help for options.
- If you would like to inspect the resulting dataset yourself, this will create a pickled pandas dataframe data/interim/dataset.pkl

Training

To train a random forrest classifier on drum descriptors, run:

python drum_sound_classifier/models/train_sklearn.py --inputs descriptors --model random_forest

You may want to add --no_extract_spectrograms if you don't plan on using a GPU to train a CNN model. Otherwise, spectrogram data will be pre-extracted which takes up disk space.

To train a CNN-based model (a GPU is essential), run:

python drum_sound_classifier/models/train_cnn.py

And finally, assuming you have a CNN model trained, to try a SVC over CNN embeddings run:

python drum_sound_classifier/models/train_sklearn.py --inputs cnn_embeddings --model svc

Run any of the above with --help for options.

Inference

I have yet to add support for inference using sklearn-derived models (pull requests welcome!), but to infer with CNN models see inference.py

drum_sound_classifier's People

Stargazers

Watchers

drum_sound_classifier's Issues

train_cnn does not work with GPU

ERROR:ignite.engine.engine.Engine:Current run is terminating due to exception: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same
ERROR:ignite.engine.engine.Engine:Engine run is terminating due to exception: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same
Traceback (most recent call last):
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\ignite\engine\engine.py", line 775, in _internal_run
self._handle_exception(e)
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\ignite\engine\engine.py", line 469, in _handle_exception
raise e
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\ignite\engine\engine.py", line 745, in _internal_run
time_taken = self._run_once_on_dataset()
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\ignite\engine\engine.py", line 850, in _run_once_on_dataset
self._handle_exception(e)
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\ignite\engine\engine.py", line 469, in _handle_exception
raise e
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\ignite\engine\engine.py", line 833, in _run_once_on_dataset
self.state.output = self.process_function(self, self.state.batch)
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\ignite\engine_init.py", line 103, in _update
y_pred = model(x)
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "C:/Work/drum_sound_classifier/drum_sound_classifier/models/train_cnn.py", line 44, in forward
self.conv1_batch(self.conv1(tensor)),
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\torch\nn\modules\module.py", line 1051, in _call_impl
return forward_call(*input, **kwargs)
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\torch\nn\modules\conv.py", line 443, in forward
return self._conv_forward(input, self.weight, self.bias)
File "C:\Work\drum_sound_classifier\venv\lib\site-packages\torch\nn\modules\conv.py", line 439, in _conv_forward
return F.conv2d(input, weight, bias, self.stride,
RuntimeError: Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same

Recommend Projects

radkoff / drum_sound_classifier Goto Github PK

drum_sound_classifier's Introduction

Drum Sound Classification

Set up

Training

Inference

drum_sound_classifier's People

Stargazers

Watchers

Forkers

drum_sound_classifier's Issues

train_cnn does not work with GPU

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent