Weirenlan's Projects
Github repo with tutorials to fine tune transformers for diff NLP tasks
A simple linebot to process the url sent from the user and translate to desired language and export to pdf
Trojan Source: Invisible Vulnerabilities
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Tutorial covering Open Source tools for Source Separation.
θͺθι³ζͺοΌε
¬ιηζ¬οΌ
UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2021/Spring 2022
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
A practice to implement media cutter from frontend
YouTube playback technology for Video.js
Perceptual Quality Estimator for speech and audio
VisualChatGPT
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Code for the paper Real-Time Neural Voice Camouflage
General Speech Restoration
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Repo for the Wasabi datasets
An efficient architecture for real-time target sound extraction.
GUI parts library for Web application using [Polymer] WebComponents
Webcam Stable-Diffusion
A cross-platform GUI for youtube-dl made in Electron and node.js
A repo to practice the react
A basic scirpt to download video from yt url with yt-dlp
A tool for creating styled YouTube subtitles