Coder Social home page Coder Social logo

mindspore-lab / mindaudio Goto Github PK

View Code? Open in Web Editor NEW
33.0 5.0 10.0 8.69 MB

A toolbox of audio models and algorithms based on MindSpore

License: Apache License 2.0

Python 18.47% Shell 0.03% Jupyter Notebook 81.50%
deep-learning speech-recognition audio speaker-verification mindspore

mindaudio's People

Contributors

daiyuxin0511 avatar geniuspatrick avatar jianyunchao avatar kingcong avatar litingyu1997 avatar lvyufeng avatar vigo999 avatar yiluxiangbei avatar yuedongli1 avatar zhanghuiyao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

mindaudio's Issues

[API][330][context_window]

Create a context window from an audio signal to gather multiple time step in a single feature vector.
Returns the array with the surrounding context.

[ LJSpeech-wavegrad][Ascend][GRAPH] Distributed training error reporting

执行8p训练报错,在刚开始打印训练日志时存在报错信息,但训练可以继续向下执行,并正常打印loss值等信息

  1. 执行步骤:mpirun --allow-run-as-root -n 8 python recipes/LJSpeech/tts/wavegrad/train.py --device_target Ascend --is_distributed True --context_mode graph
  2. 报错截图:暂时不支持上传截图,会线上发送给开发进行沟通

[API][330][resample]

Resample a signal from one frequency to another. A resample method can be given.

[API][330][mfcc]

Generate Mel-frequency cepstrum coefficients (MFCC) features from input audio signal.

[ LJSpeech-wavegrad] Please clarify the path of manifest_path in readme

readme关于数据预处理过程的描述
Preprocess data to get a "_wav.npy" and "_feature.npy" for each ".wav" file in your dataset folder. Set your data_path and manifest_path in wavegrad_base.yaml. You can now run the following command:
python recipes/LJSpeech/tts/wavegrad/preprocess.py --device_target CPU --device_id 0

  1. 首次做数据处理时遇到报错,和开发沟通后获悉以上说明中的manifest_path为新生成的.csv文件的路径,并非原数据集解压后metadata.csv的路径
  2. 建议对该路径添加提示,以免用户使用时造成误解

[API][330][magphase]

Separate a complex-valued spectrogram with shape (..., 2) into its magnitude and phase.

[API][330][reverberate]

Reverberate a given signal with given a Room Impulse Response (RIR). It performs convolution between RIR and signal,
but without changing the original amplitude of the signal.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.