sanchit-gandhi Goto Github PK

followers: 595.0 following: 1.0 repos: 47.0 gists: 2.0

Name: Sanchit Gandhi

Type: User

Company: @huggingface

Bio: Open-Source Speech @huggingface

Location: London, UK

Sanchit Gandhi's Projects

alignment-handbook

Robust recipes to align language models with human and AI preferences

asr-rnnt

Repository to train NVIDIA NeMo RNN-T BPE models with Hugging Face Datasets and the Hugging Face Trainer 🤗. Training scripts to be migrated to Hugging Face Transformers when complete.

audio-transformers-course

The Hugging Face Course on Transformers for Audio

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

audioldm2

Text-to-Audio/Music Generation

benchmark-asr

blog

Public repo for HF blog posts

candle

Minimalist ML framework for Rust

codesnippets

course

The Hugging Face course

dalle-mini

DALL·E Mini - Generate images from a text prompt

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

dataspeech

diarizers

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

faster-whisper

Faster Whisper transcription with CTranslate2

flax

Flax is a neural network library for JAX that is designed for flexibility.

hub-docs

Frontend components, documentation and information hosted on the Hugging Face website.

insanely-fast-whisper

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

musicldm

The latent diffusion model for text-to-music generation.

notebooks

A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).

open_asr_leaderboard

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

parler-tts

Inference and training library for high-quality TTS models.

pyannote-audio-ka

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

sanchit-gandhi Goto Github PK

Sanchit Gandhi's Projects

Recommend Projects

Recommend Topics

Recommend Org