samsudinng Goto Github PK
Name: Samuel Samsudin Ng
Type: User
Name: Samuel Samsudin Ng
Type: User
Alias-free short-time Fourier transform – a robust time-frequency transform for audio processing
Digital Automatic Gain Control Module
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
The PyTorch-based audio source separation toolkit for researchers
Audio Editor
A curated list of awesome embedding models tutorials, projects and communities.
speech enhancement\speech seperation\sound source localization
simple delaysum, MVDR and CGMM-MVDR
Conferencing Speech Challenge
ConvNet training using pytorch
Python implementation of global histogram equalization
Implementation of Diffusion Convolutional Recurrent Neural Network in Tensorflow
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Face Attribute Prediction on CelebA benchmark with PyTorch Implementation
A fast, multi-language, multi-precision fixed-point library!
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid
label smoothing PyTorch implementation
A C library for reading and writing sound files containing sampled audio data.
Gender prediction in movie audio
Repository for Machine Learning resources, frameworks, and projects. Managed by the DLSU Machine Learning Group.
BiDAF baseline with additional character-level embedding
Books, papers, anything related to NLP
The open Master Hearing Aid (openMHA)
PortAudio is a cross-platform, open-source C language library for real-time audio input and output.
Pre-compiled shared libraries for PortAudio
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.