Comments (4)
NeMo provides German models. Writing phonemizers/tokenizers for German should not be difficult.
from nemoonnxsharp.
This page lists available models https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/tts/checkpoints.html
If you have NeMo installed. You can run
from nemo.collections.tts.models.base import SpectrogramGenerator, Vocoder
from nemo.collections.asr.models import EncDecCTCModel
SpectrogramGenerator.list_available_models()
Vocoder.list_available_models()
EncDecCTCModel.list_available_models()
to get lists.
[PretrainedModelInfo(
pretrained_model_name=QuartzNet15x5Base-En,
description=QuartzNet15x5 model trained on six datasets: LibriSpeech, Mozilla Common Voice (validated clips from en_1488h_2019-12-10), WSJ, Fisher, Switchboard, and NSC Singapore English. It was trained with Apex/Amp optimization level O1 for 600 epochs. The model achieves a WER of 3.79% on LibriSpeech dev-clean, and a WER of 10.05% on dev-other. Please visit https://ngc.nvidia.com/catalog/models/nvidia:nemospeechmodels for further details.,
location=https://api.ngc.nvidia.com/v2/models/nvidia/nemospeechmodels/versions/1.0.0a5/files/QuartzNet15x5Base-En.nemo
),
PretrainedModelInfo(
pretrained_model_name=stt_en_quartznet15x5,
description=For details about this model, please visit https://ngc.nvidia.com/catalog/models/nvidia:nemo:stt_en_quartznet15x5,
location=https://api.ngc.nvidia.com/v2/models/nvidia/nemo/stt_en_quartznet15x5/versions/1.0.0rc1/files/stt_en_quartznet15x5.nemo
),
PretrainedModelInfo(
...
from nemoonnxsharp.
@kaiidams
I went through NeMo, I could not find how German is supported. Any link would really appreciate.
from nemoonnxsharp.
Any suggestion which pair to use? If I understand correctly?
I will read more and come back. Thank you
German
For German STT
https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_de_quartznet15x5
string modelPath = await DownloadModelAsync("stt_de_quartznet15x5");
Todo
- phonemizers for German
- tokenizers for German
Writing phonemizers/tokenizers for German should not be difficult.
For German TTS
Mel-Spectrogram Generators
de-DE | tts_de_fastpitch_multispeaker_5 |
---|
de-DE | tts_de_fastpitch_singleSpeaker_thorstenNeutral_2102: 21.02 |
---|
de-DE | tts_de_fastpitch_singleSpeaker_thorstenNeutral_2102: 22.10 |
---|
Vocoders
de-DE | tts_de_hui_hifigan_ft_fastpitch_multispeaker_5 |
---|
de-DE | tts_de_hifigan_singleSpeaker_thorstenNeutral_2102:21.02 |
---|
de-DE | tts_de_hifigan_singleSpeaker_thorstenNeutral_2210: 22.10 |
---|
English
For English STT
https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/stt_en_quartznet15x5
string modelPath = await DownloadModelAsync("stt_en_quartznet15x5");
For English TTS
string phonemeDict = await DownloadModelAsync("cmudict-0.7b_nv22.10");
string heteronyms = await DownloadModelAsync("heteronyms-052722");
string specGenModelPath = await DownloadModelAsync("tts_en_fastpitch");
string vocoderModelPath = await DownloadModelAsync("tts_en_hifigan");
from nemoonnxsharp.
Related Issues (6)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nemoonnxsharp.