Coder Social home page Coder Social logo

yunzhongfei / kaldi-senan Goto Github PK

View Code? Open in Web Editor NEW

This project forked from sinica-slam/kaldi-senan

0.0 1.0 0.0 115.37 MB

License: Other

Shell 45.80% C++ 35.77% Python 8.67% Perl 5.41% C 1.20% Java 0.07% MATLAB 0.07% Awk 0.01% TeX 1.18% Cuda 1.06% Makefile 0.22% HTML 0.36% CMake 0.18% Cython 0.01% Dockerfile 0.01%

kaldi-senan's Introduction

kaldi-SENAN

kaldi-SENAN is the implementation of speech-enhanced and noise-aware network (SENAN, see the following paper) built on the open-sourced Kaldi toolkit. Example scripts for Aurora-4 task are provided and located at egs/aurora4/proposed.

Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, and Hsin-Min Wang, "Speech-enhanced and noise-aware networks for robust speech recognition," submitted to Interspeech 2022.


Prerequisites

Follow kaldi installation steps and install this project.

Aurora-4 example

  1. In stage 8 of run.sh, change command to
# TDNN-F as AM + proposed model

local/chain/tuning/run_tdnn-1a_mtae_mfcc-mfcc-cont_noise-stats.sh
# TDNN-F as AM + SpecAugment + proposed model

local/chain/tuning/run_tdnn-1a_mtae_mfcc-mfcc-cont_noise-stats_specaugment.sh 
# CNN-TDNN-F as AM + proposed model

local/chain/tuning/run_cnn-tdnn-1c_mtae_mfcc-mfcc-cont_noise-stats.sh
# CNN-TDNN-F as AM + SpecAugment + proposed model

local/chain/tuning/run_cnn-tdnn-1c_mtae_mfcc-mfcc-cont_noise-stats_specaugment.sh 
  1. The weight for the two output layers can be changed by modifying frame_weight_dae and frame_weight_dspae in run_{tdnn-1a,cnn-tdnn-1c}\_mtae\_*.sh

kaldi-senan's People

Contributors

danpovey avatar jtrmal avatar karelvesely84 avatar vimalmanohar avatar vijayaditya avatar chenguoguo avatar david-ryan-snyder avatar naxingyu avatar arnab4 avatar dogancan avatar xiaohui-zhang avatar rickychanhoyin avatar freewym avatar mhanneman avatar hhadian avatar vdp avatar sikoried avatar pegahgh avatar kangshiyin avatar sw005320 avatar yajiemiao avatar hainan-xv avatar kkm000 avatar tomkocse avatar minhua722 avatar nshmyrev avatar galv avatar luitjens avatar cweng6 avatar alumae avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.