mindspore-lab / mindaudio Goto Github PK
View Code? Open in Web Editor NEWA toolbox of audio models and algorithms based on MindSpore
License: Apache License 2.0
A toolbox of audio models and algorithms based on MindSpore
License: Apache License 2.0
Turn a spectrogram from the amplitude/power scale to decibel scale.
Compute the norm of complex number sequence.
Trim an audio signal to keep concecutive non-silent segment.
Create a context window from an audio signal to gather multiple time step in a single feature vector.
Returns the array with the surrounding context.
Turn a dB-scaled spectrogram to the power/amplitude scale.
执行8p训练报错,在刚开始打印训练日志时存在报错信息,但训练可以继续向下执行,并正常打印loss值等信息
Apply sliding-window cepstral mean (and optionally variance) normalization per utterance.
Create a spectral centroid from an audio signal.
Inverts channels of the audio.
Create a mel-scaled spectrogram from an audio signal.
Write a numpy array as a WAV file.
Transform stereo audios into mono audio by averaging different channels.
readwav
Split an audio signal into non-silent intervals.
Apply masking to a spectrogram in the frequency domain.
Compute delta coefficients of a spectrogram.
Short-time Fourier transform (STFT).
Performs signal rescaling to a target level.
Resample a signal from one frequency to another. A resample method can be given.
执行单卡训练时报错
Normalizes a signal to unitary average or peak amplitude.
Compute the norm of complex number sequence.
Normalize an array along a specified axis.
Update build version from 0.1.0 to 0.1.1
add reverb.
Generate Mel-frequency cepstrum coefficients (MFCC) features from input audio signal.
Convert normal STFT to STFT at the Mel scale
Apply masking to a spectrogram in the time domain.
Compute amplitude of a batch of waveforms.
np.ndarray, the time domain signal.
Create a spectrogram from an audio signal.
请在readme中补充分布式训练的场景,目前训练部分仅单卡训练的说明
add background noise.
readme关于数据预处理过程的描述
Preprocess data to get a "_wav.npy" and "_feature.npy" for each ".wav" file in your dataset folder. Set your data_path and manifest_path in wavegrad_base.yaml. You can now run the following command:
python recipes/LJSpeech/tts/wavegrad/preprocess.py --device_target CPU --device_id 0
Separate a complex-valued spectrogram with shape (..., 2) into its magnitude and phase.
A notch filter only filters a very narrow band.
Reverberate a given signal with given a Room Impulse Response (RIR). It performs convolution between RIR and signal,
but without changing the original amplitude of the signal.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.