Coder Social home page Coder Social logo

hcwu1993's Projects

amphion icon amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

deepvoice3_pytorch icon deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

encodec icon encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

genshinaudio icon genshinaudio

All audio extracted from Genshin Impact, music, voicelines and everything else

gpt-sovits icon gpt-sovits

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

hifi-gan icon hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

llama2.c icon llama2.c

Inference Llama 2 in one file of pure C

merlin icon merlin

This is now the official location of the Merlin project.

natspeech icon natspeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

parler-tts icon parler-tts

Inference and training library for high-quality TTS models.

parrot icon parrot

RNN-based generative models for speech.

tensorflow icon tensorflow

Computation using data flow graphs for scalable machine learning

tts icon tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

video-subtitle-extractor icon video-subtitle-extractor

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

vits icon vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

waveglow icon waveglow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.