Coder Social home page Coder Social logo

yuekaizhang

Yuekai Zhang's Projects

accelerate icon accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

amphion icon amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio icon audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

ctc_decoder icon ctc_decoder

A ctc decoder for both online and offline asr model

espnet icon espnet

End-to-End Speech Processing Toolkit

fastchat icon fastchat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

funasr icon funasr

A Fundamental End-to-End Speech Recognition Toolkit

gss icon gss

A simple package for Guided source separation (GSS)

k2 icon k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

lhotse icon lhotse

Tools for handling speech data in machine learning projects.

minutes icon minutes

Podcast Summarizer with LLM Technology

nemo icon nemo

NeMo: a toolkit for conversational AI

nemo-guardrails icon nemo-guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

sherpa icon sherpa

Streaming and non-streaming ASR server in Python

sherpa-onnx icon sherpa-onnx

Real-time speech recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, websocket server/client, C/C++, Python, Kotlin

vall-e icon vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

wenet icon wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

wetts icon wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

whisper icon whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.