Light

iamjanvijay / rnnt_decoder_cuda Goto Github PK

View Code? Open in Web Editor NEW

65.0 3.0 9.0 191.38 MB

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

License: MIT License

C++ 41.23% Makefile 2.48% Cuda 42.57% Python 13.72%

cuda rnnt beam-search prefix-search transducer speech-recognition speech-to-text handwriting-recognition

rnnt_decoder_cuda's Introduction

RNN-Transducer Prefix Beam Search

This repository provides an optimised implementation of prefix beam search for RNN-Tranducer loss function (as described in "Sequence Transduction with Recurrent Neural Networks" paper). This implementation takes ~100 milliseconds for a speech segment of ~5 seconds and beam size of 10 (beam size of 10 is adequate for production level error rates).

Sample Run

To execute a sample run of prefix beam search on your machine, execute the following commands:

Clone this repository.

git clone https://github.com/iamjanvijay/rnnt_decoder_cuda.git;

Clean the output folder.

rm rnnt_decoder_cuda/data/outputs/*;

Make the deocder object file.

cd rnnt_decoder_cuda/decoder;
make clean;
make;

Execute the decoder - decoded beams will be saved to data/output folder.

CUDA_VISIBLE_DEVICES=0 ./decoder ../data/inputs/metadata.txt 0 9 10 5001;
CUDA_VISIBLE_DEVICES=$GPU_ID$ ./decoder ../data/inputs/metadata.txt $index_of_first_file_to_read_from_metadata$ $index_of_last_file_to read_from_metadata$ $beam_size$ $vocabulary_size_excluding_blank$;

Contributing

Contributions are welcomed and greatly appreciated.

rnnt_decoder_cuda's People

Contributors

Stargazers

Watchers

Forkers

learnedvector entn-at zhangaustin bizzu5252 jinggaizi songtaoshi dophist steedman-uoo-1 gavin90s

rnnt_decoder_cuda's Issues

Will this work on a subword level?

Hello! Thanks for the repo! Will this implementation work on a subword level?

why not use jit?

Hi, quick question,

I noticed that you saved the pytorch weights for joint network and prediction network, and then reloaded them manually using cpp.

Why did you choose this way instead of JIT the files and upload them using torchscript? Is your way faster or something?

Thanks,
Wancong

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.