Comments (3)
Hi, I trained the autoregressive models for about 600K steps (some less) and around the same for the forward models. This should take, if I remember correctly, about 2-3 days (on RTX 2080).
from transformertts.
I'm getting 1.7 s/it on tts training and 5.9 s/it on aligner training on a Tesla P100 16GB on Colab
I'm trying to figure out how batch size plays into this, if I had more GPU memory for example. The config file only has bucket_batch_sizes
but no batch_size
so I'm not sure what batch size this is running onβ I think bucket_batch_sizes is only for the aligner?
Also, it looks like my default config is different than yours @giymen https://github.com/as-ideas/TransformerTTS/blob/main/config/training_config.yaml
shows a max step of 260,000 for example, not 900,000, and not 600,000, so, there may also be other things changed (say dimensions) that would impact the number of parameters and therefore the training time. @cfrancesco could you explain the batch size and the discrepancy in default training configuration? Thanks!
I seem to have three options for default training configs:
- The current one linked above in master branch
- The one in the colab demo commit
c3405c53e435a06c809533aa4453923469081147
- The one in the
from model.factory import tts_ljspeech
import, which has 260K max steps linked from https://public-asai-dl-models.s3.eu-central-1.amazonaws.com/TransformerTTS/api_weights/ljspeech_tts_config_v1.yaml
from transformertts.
Hi,
batch sizes are dynamic. Samples are bucketed by duration, so the batch size depends on how many samples there are in each bin. Max sizes are specified in the bucket_batch_sizes for each interval.
The max steps have been reduced over time because of more efficient training (such as the addition of diagonality loss).
from transformertts.
Related Issues (20)
- Training in 3090 takes ~30 seconds per step HOT 2
- ERROR: while preparing training data HOT 1
- inference error HOT 1
- how can i save the audio file if i am using thee pretrained model in google colab
- Word timestamps
- layer.py TransposedCNNResNorm
- Get rid of the "robotic" sound HOT 4
- model.hdf5 file does not create
- No module named 'decorator'
- How to install on windows?
- the missing of HiFiGAN model.pt HOT 2
- Pause between sentence
- model.pt
- Alignments in PyTorch implementation
- Can't Finish PHONEMIZING On Google Colab. HOT 2
- Error : raise StopIteration StopIteration
- error to run a train_aligner.py
- how Normalized dataset
- DeepPhonemizer? (Phonemizer License Issue)
- About model structute.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from transformertts.