keonlee9420 / comprehensive-tacotron2 Goto Github PK

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

License: MIT License

Python 100.00%

text-to-speech tts tacotron tacotron2 pytorch speech-synthesis autoregressive single-speaker multi-speaker robustness

comprehensive-tacotron2's People

Contributors

Stargazers

Watchers

Forkers

shaun95 tiamat-tech entn-at ishine mandar2bn2b mbarnig friendmine zdisket xanomanox raoprer ziyaad30 harshalplus1 ma5onic bozorgmehr

comprehensive-tacotron2's Issues

About hifigan package

Hi @keonlee9420

How to install hifigan package in model.py file

Best regards,
PeterPham

some questions about synthesize

@keonlee9420 Thanks for your great work of this repo!

However, I have some questions about this model:

I have used the pretrained model to get good performance on single speaker (LJSpeech), but can't synthesize speech on multi-speaker, becasue I dont have the speaker embedding on VCTK. Could you upload some speaker embedding .npy files such as the default speaker "p225" ? I am downloading VCTK but expect it to take a day.
Have you tested the performance on the LibriTTS dataset?

About package numba

Hi @keonlee9420
Do you know how to solve this issue?

preprocess.py error

raceback (most recent call last):
File "D:\C\Comprehensive-Tacotron2-main\preprocess.py", line 28, in
main(preprocess_config)
File "D:\C\Comprehensive-Tacotron2-main\preprocess.py", line 13, in main
preprocessor.build_from_path()
File "D:\C\Comprehensive-Tacotron2-main\preprocessor\vctk.py", line 168, in build_from_path
self.divide_speaker_by_gender(self.in_dir), filename="spker_embed_tsne.png"
File "D:\C\Comprehensive-Tacotron2-main\utils\tools.py", line 314, in plot_embedding
data_y = np.array([gender_dict[spk_id] == 'M' for spk_id in embedding_speaker_id], dtype=np.int)
File "D:\C\Comprehensive-Tacotron2-main\utils\tools.py", line 314, in
data_y = np.array([gender_dict[spk_id] == 'M' for spk_id in embedding_speaker_id], dtype=np.int)
KeyError: 'preprocessed_data\VCTK\spker_embed\p225'

Broken link to pretrained models

The link https://github.com/keonlee9420/Comprehensive-Tacotron2/blob/main indicated in the Readme shows a page not found 404.

Can't install or run

from numba import _dynfunc, _helperlib
ImportError: numpy.core.multiarray failed to import

keonlee9420 / comprehensive-tacotron2 Goto Github PK

comprehensive-tacotron2's People

Contributors

Stargazers

Watchers

Forkers

comprehensive-tacotron2's Issues

About hifigan package

some questions about synthesize

About package numba

preprocess.py error

Broken link to pretrained models

Can't install or run

how to train chinese

not find "speakers.json"

about train error

What GPU did you use to train the models?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent