I am working on Colab, and for now, I'm trying to train the model with LJSpeech datase

Average training time in Google Colab with GPU about transformertts HOT 3 OPEN

as-ideas commented on June 10, 2024

Average training time in Google Colab with GPU

from transformertts.

Comments (3)

cfrancesco commented on June 10, 2024 2

Hi, I trained the autoregressive models for about 600K steps (some less) and around the same for the forward models. This should take, if I remember correctly, about 2-3 days (on RTX 2080).

from transformertts.

tylerweitzman commented on June 10, 2024

I'm getting 1.7 s/it on tts training and 5.9 s/it on aligner training on a Tesla P100 16GB on Colab

I'm trying to figure out how batch size plays into this, if I had more GPU memory for example. The config file only has bucket_batch_sizes but no batch_size so I'm not sure what batch size this is running on— I think bucket_batch_sizes is only for the aligner?

Also, it looks like my default config is different than yours @giymen https://github.com/as-ideas/TransformerTTS/blob/main/config/training_config.yaml shows a max step of 260,000 for example, not 900,000, and not 600,000, so, there may also be other things changed (say dimensions) that would impact the number of parameters and therefore the training time. @cfrancesco could you explain the batch size and the discrepancy in default training configuration? Thanks!

I seem to have three options for default training configs:

The current one linked above in master branch
The one in the colab demo commit c3405c53e435a06c809533aa4453923469081147
The one in the from model.factory import tts_ljspeech import, which has 260K max steps linked from https://public-asai-dl-models.s3.eu-central-1.amazonaws.com/TransformerTTS/api_weights/ljspeech_tts_config_v1.yaml

from transformertts.

cfrancesco commented on June 10, 2024

Hi,
batch sizes are dynamic. Samples are bucketed by duration, so the batch size depends on how many samples there are in each bin. Max sizes are specified in the bucket_batch_sizes for each interval.
The max steps have been reduced over time because of more efficient training (such as the addition of diagonality loss).

from transformertts.

Related Issues (20)

Recommend Projects

Average training time in Google Colab with GPU about transformertts HOT 3 OPEN

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent