Coder Social home page Coder Social logo

Comments (3)

apoorv2904 avatar apoorv2904 commented on July 24, 2024

A few more details:
-- I had extracted features using Kaldi and set them up using the instructions provided.
-- I changed gelu to torch.nn.functional.gelu
-- using torch.nn.LayerNorm as TransformerLayerNorm

Since I could get the mel-160 features to work with the above changes, I don't think this should be the reason why filterbank features are not working.

--Apoorv

from s3prl.

andi611 avatar andi611 commented on July 24, 2024

Hi,

The value of loss depends on the feature, hence loss computed from fbank is different then loss computed from mel-160.

The reason why you did bad on the Montreal-phone set is the following:
The fbank features are pre-processed with a stride of 10 (a.k.a a fbank vector for every 10 ms, a default of Kaldi), which does not match the phone alignment labels of the Montreal-phone set.

On the other hand, the mel-160 feature matches the duration of the Montreal-phone set, that's why you obtained reasonable results when pretrain using mel-160 features.

You should either evaluate with the cpc phone task (suggested approach), or change the parameters in preprocess_alignment.py to generate a phone label for every 10 ms (a more complicated approach).

I have verified that pre-training with fbank and evaluating with the cpc task can yield reasonable result (65.9% linear classification accuracy), pre-training with fbank feature is guaranteed to work. You only have to confirm your phone label.

I hope this helps!

from s3prl.

apoorv2904 avatar apoorv2904 commented on July 24, 2024

Hi,

Thank you for the clarification. After changing to cpc-phone set, I get the mentioned classification accuracy.

I will close the issue.

Thanks,
Apoorv

from s3prl.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.