Coder Social home page Coder Social logo

Comments (6)

andi611 avatar andi611 commented on July 24, 2024 1

Good point! We haven't verified delta_delta.

In our recent experiment, we find that the choice of acoustic features have an effect on model performance (These exps will be released soon with our new paper TERA).
We are still exploring which features are the best for these reconstruction-based models.
One thing certain for now is, acoustic features are a parameter choice that researchers have to explore.

from s3prl.

rtst777 avatar rtst777 commented on July 24, 2024 1

Good point! We haven't verified delta_delta.

In our recent experiment, we find that the choice of acoustic features have an effect on model performance (These exps will be released soon with our new paper TERA).
We are still exploring which features are the best for these reconstruction-based models.
One thing certain for now is, acoustic features are a parameter choice that researchers have to explore.

make sense. look forward to your new paper :)

from s3prl.

andi611 avatar andi611 commented on July 24, 2024

Thank you for having an interest in our work.

Yes, for Mockingjay:
First, features are extracted with Librosa here.
Next, delta is applied here.
And finally, cmvn is also applied here (the zero mean and unit variance you've mentioned).

from s3prl.

rtst777 avatar rtst777 commented on July 24, 2024

Thank you for having an interest in our work.

Yes, for Mockingjay:
First, features are extracted with Librosa here.
Next, delta is applied here.
And finally, cmvn is also applied here (the zero mean and unit variance you've mentioned).

Thank you for your answer! May I ask why you used delta instead of delta_delta? Doesn't delta_delta (240 dim) provide more information than delta (160 dim) for Mockingjay to learn?

from s3prl.

yilunzhao avatar yilunzhao commented on July 24, 2024

Thank you for having an interest in our work.
Yes, for Mockingjay:
First, features are extracted with Librosa here.
Next, delta is applied here.
And finally, cmvn is also applied here (the zero mean and unit variance you've mentioned).

Thank you for your answer! May I ask why you used delta instead of delta_delta? Doesn't delta_delta (240 dim) provide more information than delta (160 dim) for Mockingjay to learn?

Hi, thanks for the impressive work on this repository!

I have a question: Have you tried to simply concatenate different acoustic features like Mel, mfcc, fmllr, etc together to form the new input feature? Because in intuition, I think features with higher dimensions may reach better results. I wonder whether this method make sense. Thanks!

from s3prl.

andi611 avatar andi611 commented on July 24, 2024

Hi, thanks for the impressive work on this repository!

I have a question: Have you tried to simply concatenate different acoustic features like Mel, mfcc, fmllr, etc together to form the new input feature? Because in intuition, I think features with higher dimensions may reach better results. I wonder whether this method make sense. Thanks!

Interesting thought!
It makes sense to me, in our recent study we find input features have an large effect on reconstruction-based pre-trained models.
We’ve tried mfcc, fbank, fmllr separately, but we haven’t used them in combination yet.

from s3prl.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.