charlottecuc Goto Github PK
Name: Xiaomin Tang
Type: User
Company: University of Edinburgh
Bio: The University of Edinburgh | Speech Synthesis | Voice Conversion | Automatic Speech Recognition | NLP
Location: UK
Name: Xiaomin Tang
Type: User
Company: University of Edinburgh
Bio: The University of Edinburgh | Speech Synthesis | Voice Conversion | Automatic Speech Recognition | NLP
Location: UK
using world vocoder to extract features and make data for training neural networks
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
A PyTorch implementation of the FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Auto-regressive flow-based generative network for text to speech synthesis
A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS
A Generative Flow for Text-to-Speech via Monotonic Alignment Search
An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
:computer: A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech :speaker: from text
A Python library for creating and manipulating musical patterns, designed for use in algorithmic composition, generative music and sonification. Can be used to generate MIDI events, MIDI files, OSC messages, or custom events.
lecture code
Code created for lecture during spring 19
pYIN pitch detection implementation with librosa and python 3
A method to generate speech across multiple speakers
Efficient neural speech synthesis
Simulation of parallel synthesis with LPCNet vocoder
Speech Toolkit for bahasa Malaysia, https://malaya-speech.readthedocs.io/
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data
This is now the official location of the Merlin project.
๐AIๆๅฃฐ: 5็งๅ ๅ ้ๆจ็ๅฃฐ้ณๅนถ็ๆไปปๆ่ฏญ้ณๅ ๅฎน Clone a voice in 5 seconds to generate arbitrary speech in real-time
A demo integration between Modulate's Voice Skin SDK and Agora's Voice Chat SDK
Command line utility for forced alignment using Kaldi
VCTK multi-speaker tacotron for ICASSP 2020
Multi-speaker Tacotron in TensorFlow.
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.