Coder Social home page Coder Social logo

Comments (3)

cfrancesco avatar cfrancesco commented on June 10, 2024 2

Hi, I trained the autoregressive models for about 600K steps (some less) and around the same for the forward models. This should take, if I remember correctly, about 2-3 days (on RTX 2080).

from transformertts.

tylerweitzman avatar tylerweitzman commented on June 10, 2024

I'm getting 1.7 s/it on tts training and 5.9 s/it on aligner training on a Tesla P100 16GB on Colab

I'm trying to figure out how batch size plays into this, if I had more GPU memory for example. The config file only has bucket_batch_sizes but no batch_size so I'm not sure what batch size this is running onβ€” I think bucket_batch_sizes is only for the aligner?

Also, it looks like my default config is different than yours @giymen https://github.com/as-ideas/TransformerTTS/blob/main/config/training_config.yaml shows a max step of 260,000 for example, not 900,000, and not 600,000, so, there may also be other things changed (say dimensions) that would impact the number of parameters and therefore the training time. @cfrancesco could you explain the batch size and the discrepancy in default training configuration? Thanks!

I seem to have three options for default training configs:

  1. The current one linked above in master branch
  2. The one in the colab demo commit c3405c53e435a06c809533aa4453923469081147
  3. The one in the from model.factory import tts_ljspeech import, which has 260K max steps linked from https://public-asai-dl-models.s3.eu-central-1.amazonaws.com/TransformerTTS/api_weights/ljspeech_tts_config_v1.yaml

from transformertts.

cfrancesco avatar cfrancesco commented on June 10, 2024

Hi,
batch sizes are dynamic. Samples are bucketed by duration, so the batch size depends on how many samples there are in each bin. Max sizes are specified in the bucket_batch_sizes for each interval.
The max steps have been reduced over time because of more efficient training (such as the addition of diagonality loss).

from transformertts.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.