En-gui's Projects
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Singing Voice Conversion via diffusion model
FarmVibes.AI: Multi-Modal GeoSpatial ML Models for Agriculture and Sustainability
4 bits quantization of LLaMA using GPTQ
Record twitch streams live!
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion