ishine's Projects
1
Kaldi Speech Processing Tools
Code for the paper "FFC-SE: Fast Fourier Convolution for Speech Enhancement" (published at Interspeech 2022 conference)
FFCV: Fast Forward Computer Vision (and other ML workloads!)
Audio Normalization for Python/ffmpeg
The Fastest Fourier Transform in the South
Code for FG2SEQ: EFFECTIVELY ENCODING KNOWLEDGE FOR END-TO-END TASK-ORIENTED DIALOG.
对小说文本进行分析,提炼小说剧情内容和人物关系
label which character spoke in fimfiction stories using Masked Language Modelling.
A Deep Reinforcement Learning Framework for Automated Trading in Quantitative Finance. NeurIPS 2020. 🔥
FinRL-Meta: A Universe for Data-Driven Financial Reinforcement Learning. 🔥
FinRL-Podracer is a cloud solution.
简单易懂的 TTS / SVS / SVC 框架
A C++ standalone library for machine learning
Speech to Text web app based on flask web development framework.
flask+tornado based NVIDIA tacotron2+waveglow tts web app
pic-music upload
code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
Chinese Novel Character Name NER adopts Flat Lattice Transformer
Chinese Text Normalization and Dataset
About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf
Unofficial Pytorch Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation(FCP) https://ieeexplore.ieee.org/abstract/document/9622185
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis (UNDER CONSTRUCTION).