Weirenlan's Projects
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
PyTorch package for the discrete VAE used for DALL·E.
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Just a simple record for myself to use the open ai dalle2
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, Warp Markers, and JUCE processors
DDSP: Differentiable Digital Signal Processing
Third-party audio effects plugins as differentiable layers within deep neural networks.
Noise supression using deep filtering
Speech Recognition using DeepSpeech2.
Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Materials for the Hugging Face Diffusion Models Course
A repo record my practice and learning process of diffusion model
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
A PyTorch implementation of DNN-based source separation.
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
How to wait for container X before starting Y using docker-compose healthcheck
Dockerized Facebook Demucs library to make it easy its execution
A Dockerfile FFmpeg from source. Built on Alpine Linux.
Kaggle Python docker image
Docker 基本教學 - 從無到有 Docker-Beginners-Guide 教你用 Docker 建立 Django + PostgreSQL 📝
The repo is to record something about practicing using the docker
A repo to record some tool's config and learning process
Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
使用Caffe框架,並運用開源model做finetune,並串接CNN+SVM做超音波影像良惡性診斷
PyTorch tutorials and best practices.
Framework to easily create LLM powered bots over any dataset.