Hello, It is very great repo! thanks. Need make training for Urdu (arabic lett

Hi! 🙂 At first, please read through the issue <a class="issue-link

Thanks you very much! I run the code analyze.py, get error: <div class="snippe

I want add support for Urdu about multilingual_text_to_speech HOT 4 CLOSED

tomiinek commented on May 28, 2024

I want add support for Urdu

from multilingual_text_to_speech.

Comments (4)

Tomiinek commented on May 28, 2024

Hi! 🙂

At first, please read through the issue #7, it may be helpful for you.

To build your dataset: If you have any audio-transcript pairs in Urdu, just mix them with the dataset provided in this repository and make sure that it has similar transcripts length distribution and audio duration distribution. You can use this jupyter notebook notebooks/analyze.ipynb for the analysis.

You do not need to change vocabulary as the model is character-based. So you do have to extend it with the list of Arabic letters. Specifically, change the parameter characters to something like abcdefghijklmnopqrstuvwxyzçèéßäöōǎǐíǒàáǔüèéìūòóùúāēěīâêôûñőűабвгдежзийклмнопрстуфхцчшщъыьэюяёوهنملكقفغعظطضصشسزرذدخحجثتبيا (I do not undrestand the Arabic script, but it should be a simple text string with every possible character in the alphabet, also make sure that encoding and everything works)

Best wishes, Tomas

from multilingual_text_to_speech.

commented on May 28, 2024

Thanks you very much!
I run the code analyze.py, get error:

FileNotFoundError                         Traceback (most recent call last)
<ipython-input-5-449f082df24f> in <module>
      3 metafile = "all_reduced.txt"
      4 dataset_root = "../data/css10"
----> 5 data = TextToSpeechDataset(os.path.join(dataset_root, metafile), dataset_root)

~/Multilingual_Text_to_Speech/dataset/dataset.py in __init__(self, meta_file, dataset_root_dir, known_unique_speakers)
     82         unique_speakers_set = set(self.unique_speakers)
     83         self.items = []
---> 84         with open(meta_file, 'r', encoding='utf-8') as f:
     85             for line in f:
     86                 line_tokens = line[:-1].split('|')

FileNotFoundError: [Errno 2] No such file or directory: '../data/css10/all_reduced.txt'

How create file /all_reduced.txt? is not in folder.

from multilingual_text_to_speech.

Tomiinek commented on May 28, 2024

Use a dataset meta-file that exists, for example metafile = "all.txt". The notebook is intended just for quick experiments and plotting of some data. You do not have to run it completly or as is.

from multilingual_text_to_speech.

stale commented on May 28, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

from multilingual_text_to_speech.

I want add support for Urdu about multilingual_text_to_speech HOT 4 CLOSED

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent