Topic: speech Goto Github
Some thing interesting about speech
Some thing interesting about speech
speech,OpenAI Whisper ASR Webservice API
User: ahmetoner
Home Page: https://ahmetoner.github.io/whisper-asr-webservice
speech,AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Organization: aigc-audio
Home Page: https://huggingface.co/spaces/AIGC-Audio/AudioGPT
speech,🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
User: avinashkranjan
Home Page: https://amazing-python-scripts.avinashranjan.com
speech,🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
User: babysor
speech,SALMONN: Speech Audio Language Music Open Neural Network
Organization: bytedance
Home Page: https://bytedance.github.io/SALMONN/
speech,MARS5 speech model (TTS) from CAMB.AI
Organization: camb-ai
Home Page: https://www.camb.ai
speech,🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Organization: coqui-ai
Home Page: http://coqui.ai
speech,Community list of startups working with AI in audio and music technology
User: csteinmetz1
Home Page: https://csteinmetz1.github.io/ai-audio-startups/
speech,DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/
Organization: delta-ml
Home Page: https://delta-didi.readthedocs.io/
speech,自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
User: dengbocong
speech,Controllable and fast Text-to-Speech for over 7000 languages!
Organization: digitalphonetics
speech,💬 SpeechGPT is a web application that enables you to converse with ChatGPT.
User: hahahumble
Home Page: https://speechgpt.app
speech,General Speech Restoration
User: haoheliu
Home Page: https://haoheliu.github.io/demopage-voicefixer/
speech,🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Organization: huggingface
Home Page: https://huggingface.co/docs/datasets
speech,Speech To Speech: an effort for an open-sourced and modular GPT4-o
Organization: huggingface
speech,A simple, high-quality voice conversion tool focused on ease of use and performance.
Organization: iahispano
Home Page: https://applio.org
speech,StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Organization: ictnlp
Home Page: https://ictnlp.github.io/StreamSpeech-site/
speech,Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Organization: idea-research
Home Page: https://arxiv.org/abs/2401.14159
speech,Free, easy, portable audio engine for games
User: jarikomppa
Home Page: http://soloud-audio.com
speech,Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
User: jianchang512
Home Page: https://pyvideotrans.com
speech,Open-Source Large Vocabulary Continuous Speech Recognition Engine
Organization: julius-speech
speech,kaldi-asr/kaldi is the official location of the Kaldi project.
Organization: kaldi-asr
Home Page: http://kaldi-asr.org
speech,A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
User: kyubyong
speech,Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Organization: linto-ai
speech,WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
User: m-bain
speech,Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
User: mahmoudashraf97
speech,Foundational model for human-like, expressive TTS
Organization: metavoiceio
Home Page: https://themetavoice.xyz/
speech,The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
User: miteshputhran
speech,An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Organization: modelscope
speech,ModelScope: bring the notion of Model-as-a-Service to life.
Organization: modelscope
Home Page: https://www.modelscope.cn/
speech,:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Organization: mozilla
speech,pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
User: mravanelli
speech,EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
User: netease-youdao
speech,Fully customizable AI chatbot component for your website
User: ovidijusparsiunas
Home Page: https://deepchat.dev
speech,Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
Organization: paddlepaddle
speech,Python library and CLI tool to interface with Google Translate's text-to-speech API
User: pndurette
Home Page: http://gtts.readthedocs.org/
speech,Praat: Doing Phonetics By Computer
Organization: praat
Home Page: http://www.praat.org
speech,Data manipulation and transformation for audio signal processing, powered by PyTorch
Organization: pytorch
Home Page: https://pytorch.org/audio
speech,WaveNet vocoder
User: r9y9
Home Page: https://r9y9.github.io/wavenet_vocoder/
speech,aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
User: readbeyond
Home Page: http://www.readbeyond.it/aeneas/
speech,Noise supression using deep filtering
User: rikorose
Home Page: https://huggingface.co/spaces/hshr/DeepFilterNet2
speech,Videos, notes and experiments to understand deep learning
User: roatienza
speech,Code examples for new APIs of iOS 10.
User: shu223
speech,Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
User: snakers4
speech,Silero VAD: pre-trained enterprise-grade Voice Activity Detector
User: snakers4
speech,SoftVC VITS Singing Voice Conversion
Organization: svc-develop-team
speech,💬 Speech recognition for your site
User: talater
Home Page: https://www.talater.com/annyang/
speech,Lingvo
Organization: tensorflow
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.