corticph / prefix-beam-search Goto Github PK
View Code? Open in Web Editor NEWCode for prefix beam search tutorial by @labodk
Code for prefix beam search tutorial by @labodk
I have my corpus in plain text and language model in .arpa
format generated from KenLM.
How can I input those to the algorithm?
Thank you for the implementation.
Is there any way to find the path the beam search uses for each final hypothesis?
For example, if the hypothesis was
"a loud laugh followed at chunkys expense"
-------a-- l-l-l--oo--ud---- etc. ...
Hi, thanks for this awesome tutorial! I was wondering, how could i integrate the end symbol ('>') in the algorithm if my CTC tables do not contain it? Would it be possible with the Language Model by predicting EOS? Another thing is, how to train the model with CTC and include the end symbol? Would ju just append each sentence with the end mark? Thanks!
https://github.com/corticph/prefix-beam-search/blob/master/prefix_beam_search.py#L61 should be
Pnb[t][l] += ctc[t][c_ix] * Pb[t - 1][l]
in function greedy_decoder, alphabet = list(ascii_lowercase) + [' ', '>']. But in function prefix_beam_search, alphabet = list(ascii_lowercase) + [' ', '>', '%']. i feel confused.
A_prev is [' '] so it doesn't loop. Is this a bug?
Edit: I mean, nothing is added to my A_prev in the loop. And in the end A_prev is empty.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.