Comments (4)
Hi,
I haven't encountered this error before. The error msg doesn't tell much either.
However, I have a hunch to suspect this problem may be caused by downsampling and upsampling.
To verify this:
Method 1 - can you try to save a randomly initialized aalbert model with downsample_rate: 1
:
- set
downsample_rate: 1
andsave_step: 1
in your .yaml - run the training script over 1 step then ctrl+c, this will save a randomly initialized model at training step 1
- try to load this new model and see if the error still occurs.
Method 2 - in your pytorch-kaldi/nn_transfromer.py
, print and verify that the input feature length and output representation length are the same.
- add
print(x.shape)
at both the beginning and end of the forward function of yourpytorch-kaldi/nn_transfromer.py
. - execute the pytorch-kaldi
run_exp.py
- The input acoustic feature should have the shape of: (
time steps
,batch size=12
,dim=40
);
and the output representation should have the shape of (time steps
,batch size=12
,dim=768
).
Thetime steps
number should be the same.
Please let me know if this is the case!
FYI, we find that downsample_rate: 1
is more suitable for the pytorch-kaldi DNN/HMM framework.
Using a downsample rate > 1 always yield worse results.
from s3prl.
Hi,
I haven't encountered this error before. The error msg doesn't tell much either.
However, I have a hunch to suspect this problem may be caused by downsampling and upsampling.
To verify this:Method 1 - can you try to save a randomly initialized aalbert model with
downsample_rate: 1
:
- set
downsample_rate: 1
andsave_step: 1
in your .yaml- run the training script over 1 step then ctrl+c, this will save a randomly initialized model at training step 1
- try to load this new model and see if the error still occurs.
Method 2 - in your
pytorch-kaldi/nn_transfromer.py
, print and verify that the input feature length and output representation length are the same.
- add
print(x.shape)
at both the beginning and end of the forward function of yourpytorch-kaldi/nn_transfromer.py
.- execute the pytorch-kaldi
run_exp.py
- The input acoustic feature should have the shape of: (
time steps
,batch size=12
,dim=40
);
and the output representation should have the shape of (time steps
,batch size=12
,dim=768
).
Thetime steps
number should be the same.Please let me know if this is the case!
FYI, we find that
downsample_rate: 1
is more suitable for the pytorch-kaldi DNN/HMM framework.
Using a downsample rate > 1 always yield worse results.
thanks for replying, method One works for me!
Could you tell me what experiments you had done training ASR?About which structure of ASR you trained after pre-trained with aalbert/mockingjay,
I mean if aalbert acts the similar role with tdnn, then I should train ASR with simple fc layers and aalbert pretrain structure ,and ASR could aquire the same accuracy or better than kaldi tdnn ASR?
from s3prl.
I've got answers from another issue, thanks for replying!I'll close this issue.
from s3prl.
thanks for replying, method One works for me!
I've tried setting downsample to 3 to match your setting, but I did not encounter any error.
Upsampling and downsampling worked fine, not sure why there is an error in your case.
Also, I noticed that the AALBERT config file is not set according to the original paper.
I've made some adjustments and updates in this commit.
Note that the previous settings are for our new work TERA, and not AALBERT.
I suggest you use the updated default config for future AALBERT training.
Happy training!
from s3prl.
Related Issues (20)
- Feature request for Language Identification on ML-SUPERB dataset HOT 5
- Multiresolution HuBERT as a new upstream HOT 7
- No module named 's3prl.superb' HOT 1
- Is this required for the SS and SE task? assert abs(feat_list[i].size(0) - length_list[i]) < 5. I am getting this error for wav2vec HOT 6
- Different upstream and downstream learning rates HOT 1
- ValueError: mutable default <class 's3prl.upstream.roberta.roberta_model.EncDecBaseConfig'> for field encoder is not allowed: use default_factory HOT 3
- Not able to submit the results. HOT 4
- The rules for conformity for emotion recognition. HOT 5
- Potential SpecAug Issue HOT 1
- What is the accept rate in the VC task evaluation output? HOT 1
- a question about two-stage downstream task HOT 1
- ASVspoof Dateset Support HOT 2
- Requesting to add CLSRIL-23 pretrained model as new upstream HOT 6
- Cannot submit my results in the leaderboard HOT 4
- Document link broken HOT 1
- Broken link HOT 4
- How to extract weighted sum SSL representations from an audio dataset?
- 使用自己的数据进行预训练
- run vq_apc pretrain failed HOT 3
- QbE downstream HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from s3prl.