Coder Social home page Coder Social logo

pragyak412 / improving-voice-separation-by-incorporating-end-to-end-speech-recognition Goto Github PK

View Code? Open in Web Editor NEW
17.0 5.0 2.0 268 KB

Implementing the paper -

Python 99.53% Shell 0.47%
pytorch kaldi-asr espnet voice-separation python speech-recognition

improving-voice-separation-by-incorporating-end-to-end-speech-recognition's People

Contributors

pragyak412 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

improving-voice-separation-by-incorporating-end-to-end-speech-recognition's Issues

problem with loading pre-trained models

Hi,

I have tried lo load the published pre-trained models but I got mismatches between the models definition (w/o asr) and the checkpoints files. Can you assist?

Thanks!

How to test pretrained model?

Hello @pragyak412
I tried to get script for voice separation using this repository.
But I think there's no such testing here.
Could you share simple test script for voice separation and also ASR?
Thanks.

Voice beginners on data sets and weight loading problems

Hello, I am a phonetic beginner. I would like to use wsj0-2mix data set s1 as the training set of ETESpeechRecognition to improve the separation performance of convtasnet by extracting features. I would like to ask whether this method is feasible. Your advice and help are urgently needed

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.