xxsuper Goto Github PK
Type: User
Type: User
Build real-time multimodal AI applications 🤖🎙️📹
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
Code and dataset for photorealistic Codec Avatars driven from audio
🔊 Text-Prompted Generative Audio Model
Bark Voice Cloning and Voice Cloning for Chinese Speech
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
ChatTTS is a generative speech model for daily dialogue.
新闻管理后台
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
基于阿里云的tts, llm,stt模型构建的实时对话应用
LLM based TTS model, providing inference/training/deployment full-stack ability.
爬取西瓜小视频
DeepFaceLab is the leading software for creating deepfakes.
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
带HTTP API的数字人视频播放器,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
Apache Dubbo is a high-performance, java based, open source RPC framework.
dubbo服务管理以及监控系统
Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
An example project for book 'Go Programming & Concurrency in Practice, 2nd edition' (《Go并发编程实战》第2版).
Next generation face swapper and enhancer
Brand new TTS solution
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
基于go-gin的web服务框架
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Grok open release
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.