Comments (6)
from eesen.
@ramonsanabria Hi, but let AM to learn where is the word segmentation is really reasonable? Sometimes this segmentation is not obvious. We are already using CTC, so the AM should not care for this and let LM to handle it.
Thanks for your reply and I'm really glad to discuss this issue with you.
from eesen.
from eesen.
@ramonsanabria Hi, good works in your papers! About space in AM, do you mean this part in 1708.04469?
Word boundaries can be modeled with a space symbol or by capitalizing the first letter of each word [11]. While decoding CTC acoustic models without adding external linguistic information works well, a vast amount of training data should be used to get competitive results [12].
Actually, this means the AM learned some linguistic information and embedded a weak LM in it from the labeled text. It's not a good ideal if we need to switch application domain by using corresponding LM. And it IS need a vast amount of training data in the meantime.
Thanks for your information again.
I only use this repository to build TLG and decode CTC outputs. I still wonder whether I can abandon these symbols in AM by using this repository? Do you have any ideal?
from eesen.
@ramonsanabria I got it. I can train AM in char mode and build TLG in phone mode. In this way, I can discard space, unk, silence and so on.
from eesen.
from eesen.
Related Issues (20)
- Clean up v2 for swb
- DeepBiLSTM HOT 2
- Missing label.counts HOT 3
- Query on LibriSpeech Character Error Rate HOT 2
- difference in output labels HOT 1
- Memory Leak HOT 1
- failed: Dim() == v.Dim() HOT 2
- Potential overflow when calculating exp
- Clarification Regarding Using WFST decoding HOT 1
- Installing error HOT 8
- Training Error when run tedlium recipe HOT 2
- LatticeFasterDecoder failed with "link_extra_cost == link_extra_cost" HOT 1
- Cannot install openfst-1.4.1 HOT 2
- Read failure in ReadBasicType, file position is -1, next char is -1
- KALDI_ASSERT: at train-ctc-parallel:AddMatMat:cuda-matrix.cc:570, failed: m == NumCols()
- Why do we need to compile the tokens to FST in wsj recipe?
- Can not run training program with cuda 10.2 HOT 3
- Librispeech - Training starting error HOT 3
- Determinizability of TLG.fst in the phonetic case
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from eesen.