charlottecuc Goto Github PK

followers: 81.0 following: 151.0 repos: 115.0 gists: 0.0

Name: Xiaomin Tang

Type: User

Company: University of Edinburgh

Bio: The University of Edinburgh | Speech Synthesis | Voice Conversion | Automatic Speech Recognition | NLP

Location: UK

Xiaomin Tang's Projects

extract_features_using_world

using world vocoder to extract features and make data for training neural networks

fac-via-ppg

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

fftnet

A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

flowavenet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

flowtron

Auto-regressive flow-based generative network for text to speech synthesis

gan-tts

A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS

glow-tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

glow_tts

An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.

gst-tacotron

A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"

gst-tacotron-1

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

hmm-for-emo-tts

:computer: A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech :speaker: from text

A Python library for creating and manipulating musical patterns, designed for use in algorithmic composition, generative music and sonification. Can be used to generate MIDI events, MIDI files, OSC messages, or custom events.

lecturecode-sp18

lecture code

lecturecode-sp19

Code created for lecture during spring 19

librosa_py3_pyin

pYIN pitch detection implementation with librosa and python 3

loop

A method to generate speech across multiple speakers

lpcnet

Efficient neural speech synthesis

lpcnet_parallel

Simulation of parallel synthesis with LPCNet vocoder

malaya-speech

Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/

mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

merlin

This is now the official location of the Merlin project.

meta-tts

mockingbird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

modulateagorademo

A demo integration between Modulate's Voice Skin SDK and Agora's Voice Chat SDK

montreal-forced-aligner

Command line utility for forced alignment using Kaldi

multi-speaker-tacotron

VCTK multi-speaker tacotron for ICASSP 2020

multi-speaker-tacotron-tensorflow

Multi-speaker Tacotron in TensorFlow.

multilingual_text_to_speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

charlottecuc Goto Github PK

Xiaomin Tang's Projects

Recommend Projects

Recommend Topics

Recommend Org