samsudinng,Samuel Samsudin Ng,github

accessing-and-modifying-different-layers-of-a-pretrained-model-in-pytorch

afstft

Alias-free short-time Fourier transform – a robust time-frequency transform for audio processing

aps

A personal toolkit for single/multi-channel speech recognition & enhancement & separation.

asteroid

The PyTorch-based audio source separation toolkit for researchers

awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

cv_histogram_equalization

Python implementation of global histogram equalization

dcrnn

Implementation of Diffusion Convolutional Recurrent Neural Network in Tensorflow

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.