Coder Social home page Coder Social logo

guoyang94's Projects

athena icon athena

an open-source implementation of sequence-to-sequence based speech processing engine

audiogpt icon audiogpt

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

audiomentations icon audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

awesome-diffusion-models icon awesome-diffusion-models

A collection of resources and papers on Diffusion Models and Score-based Models, a darkhorse in the field of Generative Models

bark icon bark

🔊 Text-prompted Generative Audio Model

cross-lingual-voice-cloning icon cross-lingual-voice-cloning

Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.

diffsinger icon diffsinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

dns-challenge icon dns-challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

espnet icon espnet

End-to-End Speech Processing Toolkit

fastspeech2 icon fastspeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

flowtron icon flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

hifi-gan icon hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

lpcnet icon lpcnet

Efficient neural speech synthesis

mellotron icon mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

multilingual_text_to_speech icon multilingual_text_to_speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

nemo icon nemo

NeMo: a toolkit for conversational AI

paralleltts icon paralleltts

A fast parallel text-to-speech (tts) model. Work well for English, Mandarin, Japanese, Korean, Russian and Tibetan (so far). 快速并行语音合成模型,适用于英语、普通话、日语、韩语、俄语和藏语(当前已测试)。

parallelwavegan icon parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

pits icon pits

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

pyannote-audio icon pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

resemblyzer icon resemblyzer

A python package to analyze and compare voices with deep learning

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.