Coder Social home page Coder Social logo

cyclevae-vc-neuralvoco's Introduction

Low-latency real-time multispeaker voice conversion (VC) with cyclic variational autoencoder (CycleVAE) and multiband WaveRNN using data-driven linear prediction (MWDLP)

Requirements:

  • UNIX
  • 3.6 >= python <= 3.9
  • CUDA 11.1
  • virtualenv
  • jq
  • make
  • gcc

Installation

$ cd tools
$ make
$ cd ..

Latest version

  • 3.1 (2021/09/25)
    • Finalize VC and MWDLP Python implementations (impl.)
    • Bug fixes on C impl. to match the output of Python impl.
    • Fix input segmental convolution impl. as in original papers while allowing usage in real-time demo
    • Update MWDLP demo and samples with VCC20 dataset
    • Update VC demo and samples with VCC20 dataset

Compilable demo

Samples from compilable demo

Steps to build the models:

  1. Data preparation and preprocessing
  2. VC and neural vocoder models training [~ 2.5 and 4 days each, respectively]
  3. VC fine-tuning with fixed neural vocoder [~ 2.5 days]
  4. VC decoder fine-tuning with fixed encoder and neural vocoder [~ 2.5 days]

Steps for real-time low-latency decoding with CPU:

  1. Dump and compile models
  2. Decode

Real-time implementation is based on LPCNet.

Details

Please see egs/cycvae_mwdlp_vcc20/README.md for more details on VC + neural vocoder

or

egs/mwdlp_vcc20/README.md for more details on neural vocoder only.

References

[1] High-Fidelity and Low-Latency Universal Neural Vocoder based on Multiband WaveRNN with Data-Driven Linear Prediction for Discrete Waveform Modeling

[2] Low-latency real-time non-parallel voice conversion based on cyclic variational autoencoder and multiband WaveRNN with data-driven linear prediction

Contact

Patrick Lumban Tobing

[email protected]

[email protected]

cyclevae-vc-neuralvoco's People

Contributors

patrickltobing avatar farishijazi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.