Comments (6)
Good point! We haven't verified delta_delta.
In our recent experiment, we find that the choice of acoustic features have an effect on model performance (These exps will be released soon with our new paper TERA).
We are still exploring which features are the best for these reconstruction-based models.
One thing certain for now is, acoustic features are a parameter choice that researchers have to explore.
from s3prl.
Good point! We haven't verified delta_delta.
In our recent experiment, we find that the choice of acoustic features have an effect on model performance (These exps will be released soon with our new paper TERA).
We are still exploring which features are the best for these reconstruction-based models.
One thing certain for now is, acoustic features are a parameter choice that researchers have to explore.
make sense. look forward to your new paper :)
from s3prl.
Thank you for having an interest in our work.
Yes, for Mockingjay:
First, features are extracted with Librosa here.
Next, delta is applied here.
And finally, cmvn is also applied here (the zero mean and unit variance you've mentioned).
from s3prl.
Thank you for having an interest in our work.
Yes, for Mockingjay:
First, features are extracted with Librosa here.
Next, delta is applied here.
And finally, cmvn is also applied here (the zero mean and unit variance you've mentioned).
Thank you for your answer! May I ask why you used delta instead of delta_delta? Doesn't delta_delta (240 dim) provide more information than delta (160 dim) for Mockingjay to learn?
from s3prl.
Thank you for having an interest in our work.
Yes, for Mockingjay:
First, features are extracted with Librosa here.
Next, delta is applied here.
And finally, cmvn is also applied here (the zero mean and unit variance you've mentioned).Thank you for your answer! May I ask why you used delta instead of delta_delta? Doesn't delta_delta (240 dim) provide more information than delta (160 dim) for Mockingjay to learn?
Hi, thanks for the impressive work on this repository!
I have a question: Have you tried to simply concatenate different acoustic features like Mel, mfcc, fmllr, etc together to form the new input feature? Because in intuition, I think features with higher dimensions may reach better results. I wonder whether this method make sense. Thanks!
from s3prl.
Hi, thanks for the impressive work on this repository!
I have a question: Have you tried to simply concatenate different acoustic features like Mel, mfcc, fmllr, etc together to form the new input feature? Because in intuition, I think features with higher dimensions may reach better results. I wonder whether this method make sense. Thanks!
Interesting thought!
It makes sense to me, in our recent study we find input features have an large effect on reconstruction-based pre-trained models.
We’ve tried mfcc, fbank, fmllr separately, but we haven’t used them in combination yet.
from s3prl.
Related Issues (20)
- ValueError: mutable default <class 's3prl.upstream.roberta.roberta_model.EncDecBaseConfig'> for field encoder is not allowed: use default_factory HOT 3
- Not able to submit the results. HOT 4
- The rules for conformity for emotion recognition. HOT 5
- Potential SpecAug Issue HOT 1
- What is the accept rate in the VC task evaluation output? HOT 1
- a question about two-stage downstream task HOT 1
- ASVspoof Dateset Support HOT 2
- Requesting to add CLSRIL-23 pretrained model as new upstream HOT 6
- Cannot submit my results in the leaderboard HOT 4
- Document link broken HOT 1
- Broken link HOT 4
- How to extract weighted sum SSL representations from an audio dataset?
- 使用自己的数据进行预训练
- run vq_apc pretrain failed HOT 3
- QbE downstream HOT 3
- Performance difference between converted models and official models HOT 1
- A way to save and load the output of upstream model for speedup HOT 1
- Adding MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations (https://arxiv.org/pdf/2406.05661) HOT 1
- Can't load local checkpoint
- [Deleted]
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from s3prl.