Comments (8)
This is normal. Because you computed mfcc in different ways.
from autopst.
Thanks for your reply.
from autopst.
Hello, can you please tell us what the correct way to generate mfcc_stats is?
from autopst.
@avanitanna Just compute the mean and std of the mfcc feature.
from autopst.
@auspicious3000 I understand. How should I go from wav files to computing mfcc features and their mean and std? Do you have a script that we can use? I would love to use your work and cite it but it is a little difficult to get the code to work with new training data. I would appreciate your help!
from autopst.
dctmx = scipy.fftpack.dct(np.eye(80), type=2, axis=1, norm='ortho')
# compute mfcc stats using all spectrograms
mfcc_all = sp_all.dot(dctmx)
mfcc_mean, mfcc_std = np.mean(mfcc_all,axis=0), np.std(mfcc_all,axis=0)
# normalize each mfcc
cc_tmp = sp_tmp.dot(dctmx)
cc_norm = (cc_tmp - mfcc_mean) / mfcc_std
from autopst.
@auspicious3000 thank you! how do you get sp_all and what is sp_tmp? Is it a concatenation of all spectograms? How do I create sp_all? Does the following make sense ?
Say I have multiple spectograms -
mfcc_list = []
for file_name in ['p225_003.npy', 'p225_008.npy, ...]:
f = np.load(file_name)
mfcc_list.append(f)
sp_all = np.concatenate(mfcc_list,axis=0)
mfcc_all = sp_all.dot(dctmx)
from autopst.
@avanitanna sp_all is the concatenation of all mel spectrogram, sp_tmp is the spectrogram you want to normalize
from autopst.
Related Issues (17)
- ModuleNotFoundError: No module named 'onmt' HOT 1
- KeyError when run prepare_train_data.py HOT 2
- How to solve SEA model problem
- the speech content of converted voice with my own trained model changed HOT 2
- SpeechSplit actually better than AutoPST for seen speakers? HOT 1
- Missing basic execution with different set of speakers. HOT 4
- Error while running demo.ipynd
- Issue with stop prediction for longer utterances. HOT 1
- test_vctk.meta HOT 5
- Unable to reproduce results HOT 1
- License of this repository and model HOT 3
- How to test AutoPST in onother languages? HOT 6
- How to train SEA model HOT 14
- How to make 'mfcc_stats.pkl' and 'spk2emb_82.pkl'? HOT 3
- 請問我該如何解決 repeats has to be Long tensor 的問題?(How to solve a problem) HOT 2
- Inference with new input audio HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from autopst.