Comments (9)
LibriVox doesn't have properly aligned transcriptions. Is figuring out a solution for that within the scope of this issue?
from deepspeech.
Another alternative would be using existing corpuses (corpi?) extracted from LibriVox like LibriSpeech: http://www.openslr.org/12/
from deepspeech.
Have you looked at the TED code in issue 2?
from deepspeech.
Yep. I started writing a bunch of code for downloading and formatting the LibriVox data directly, from the Internet Archive, but after reading the LibriSpeech paper I learned that proper alignment and segmentation is a very large effort and we should probably just use that corpus directly, so I'm gonna do that.
from deepspeech.
Before you go off on a wild goose chase, please define what you mean by "proper alignment".
from deepspeech.
Also did you read and understand the Deep Speech paper?
The Deep Speech paper and our code under master uses the CTC algorithm which does not require "alignment" in the sense used for HMM STT engines.
from deepspeech.
Using LibriSpeech directly is fine, it's actually what I expected form the start, but do not spend time trying to "align" the corpus in the sense used for HMM STT engines. CTC does not require such "alignment".
from deepspeech.
Also did you read and understand the Deep Speech paper?
Not as well as I thought I had, evidently! Either that or I'm just abusing the jargon.
I was under the impression that the transcriptions need to have a minimal resemblance to the audio, which the raw LibriVox data, by default, doesn't have. That's as far as my definition of "alignment" went: skipping the initial audio disclaimers, skipping the license header on the Project Gutenberg files, etc.
In any case, we've ended up on the same page, albeit in my case that included a few bumps along the way :P
from deepspeech.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
from deepspeech.
Related Issues (20)
- please add support for Apple silicon M1 Mac os, missing nodejs binding for darwin_arm64 HOT 10
- Deep Speech with C# .Net application initial setup? HOT 1
- Help with training own language model using DeepSpeech HOT 2
- Error while installing deepspeech HOT 1
- ERROR: Could not find a version that satisfies the requirement deepspeech (from versions: none) ERROR: No matching distribution found for deepspeech HOT 1
- onnx inference HOT 1
- Unable to install deepspeech HOT 14
- deepspeech-0.9.3-models-zh-CN.scorer invalid???
- Request for precompiled DeepSpeech-TFLite for ARMv7l (armhf) HOT 1
- Is repo dead ? HOT 1
- How can I get syllable or phoneme with Deepspeech HOT 1
- how to convert LM's binary model to arpa file HOT 2
- Why I call DS_IntermediateDecode always crash a few seconds later at the iphone device? HOT 1
- Chaloemphon Praphuchakang
- install DeepSpeech Python error HOT 4
- Still Acrive? HOT 9
- Index.rst is either out of date or I cannot cut and paste any more HOT 1
- .Net version is at End of Life ? HOT 1
- LINK : fatal error LNK1561: entry point must be defined
- well-maintained derived project
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepspeech.