Is there a tutorial/guide to apply this code for a different custom dataset

I've written a preprocessing for this: <a class="commit-link" data-hovercard-ty

I've written a preprocessing for this: <a href="https://github.com

Tutorial for application on custom dataset about s3prl HOT 7 CLOSED

s3prl commented on July 24, 2024

Tutorial for application on custom dataset

from s3prl.

Comments (7)

andi611 commented on July 24, 2024

I've written a preprocessing script for this: 27ff0d5

For any arbitrary dataset that looks like this:

- Custom_dataset/
    - Custom_train/
       - *.wav / flac / mp3 ...
    - Custom_dev/
       - *.wav / flac / mp3 ...
    - Custom_test/
       - *.wav / flac / mp3 ...

The script will process the "train", "dev", "test" set one by one,
and users only need to specify the path of the directory of each set.
So for the example above,
the path to the "train" set should be: Custom_dataset/Custom_train/
the path to the "dev" set should be: Custom_dataset/Custom_dev/
the path to the "test" set should be: Custom_dataset/Custom_test/
The generated files will be compatible to our dataloader.

Also, in your config file, these should be changed:

  data_path: 'data/NewData_fbank80' 
  train_set: ['train']
  dev_set: ['dev'] 
  test_set: ['test']

If it is convenient, can you please test this script on your own dataset to see if it works.
(I currently don't have any other dataset to process)
Let me know if there is any problem.

from s3prl.

shivam-chandhok commented on July 24, 2024

Thank You very much for your help.I will look into it and get back.
Kindly have a look @Dhumketu

from s3prl.

juanting commented on July 24, 2024

Hello, thank you very much for sharing such an excellent project. As a newcomer in this field, I would like to ask whether this project can be used to generate speaker embedding for my future work. If so, could you please introduce the general process? Looking forward to your reply.

from s3prl.

andi611 commented on July 24, 2024

I would like to ask whether this project can be used to generate speaker embedding for my future work. If so, could you please introduce the general process?

Yes, of course. The general process is as follow:

Pre-train an upstream model in a self-supervised manner: Mockingjay, TERA, Audio ALBERT, APC, CPC, etc.
Extract representations from the pre-trained upstream model, these representations are the speaker embedding you are looking for.
Apply the extracted representations to your downstream task.

However, whether the learned representations are good speaker embedding largely depends on your downstream task, we've only verified them with speaker classification tasks using the LibriSpeech corpus. Various speaker classification experiment results are presented in the Mockingjay, TERA paper.

from s3prl.

juanting commented on July 24, 2024

Thank you very much for your reply. I will study the process

from s3prl.

aviasd commented on July 24, 2024

I've written a preprocessing script for this: 27ff0d5

For any arbitrary dataset that looks like this:
- Custom_dataset/
    - Custom_train/
       - *.wav / flac / mp3 ...
    - Custom_dev/
       - *.wav / flac / mp3 ...
    - Custom_test/
       - *.wav / flac / mp3 ...
The script will process the "train", "dev", "test" set one by one,
and users only need to specify the path of the directory of each set.
So for the example above,
the path to the "train" set should be: Custom_dataset/Custom_train/
the path to the "dev" set should be: Custom_dataset/Custom_dev/
the path to the "test" set should be: Custom_dataset/Custom_test/
The generated files will be compatible to our dataloader.

Also, in your config file, these should be changed:
  data_path: 'data/NewData_fbank80' 
  train_set: ['train']
  dev_set: ['dev'] 
  test_set: ['test']
If it is convenient, can you please test this script on your own dataset to see if it works.
(I currently don't have any other dataset to process)
Let me know if there is any problem.

Is this working for pretrained TERA too?
Or is it just suitable for training our own original model from scratch without using the pretrained TERA?

from s3prl.

andi611 commented on July 24, 2024

Is this working for pretrained TERA too?
Or is it just suitable for training our own original model from scratch without using the pretrained TERA?

No, this will not work for the pre-trained TERA. As pre-trained TERA requires fmllr data, which can be download from the provided Google drive link. (Pre-trained TERA needs the original fmllr data, not new extracted ones.)

from s3prl.

Tutorial for application on custom dataset about s3prl HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent