mwang-lifesize,Min Wang,github

acoustic-simulator

Implementation of audio degradation processes

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

awesome-kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

background-matting

Background Matting: The World is Your Green Screen

cppcoro

A library of C++ coroutine abstractions for the coroutines TS

deepcorrect

Text and Punctuation correction with Deep Learning

deepsegment

A sentence segmenter that actually works!

deepspeech

A TensorFlow implementation of Baidu's DeepSpeech architecture

dejavu

Audio fingerprinting and recognition in Python

facial-similarity-with-siamese-networks-in-pytorch

Implementing Siamese networks with a contrastive loss for similarity learning

frugally-deep

Header-only library for using Keras models in C++.

keras-sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

pytorch_speaker_verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

raw-audio-gender-classification

Machine learning experiment to perform gender classification from raw audio.

rnnoise-wasm

rnnoise noise suppression library as a WASM module

simple_bodypix_python

A simple and minimal bodypix inference in python

sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

sincnet-1

Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)

speaker-identification

A program for automatic speaker identification using deep learning techniques.

speaker-identification-python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

speaker-identification-using-gmms

It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data corpus.

speaker-recognition-3d-cnn

Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"

speaker-recognition-py3

Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

speakerid_challenge

A recipe for creating a Speaker Identification system built on Kaldi.

speakeridentificationneuralnetworks

⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The highlight of the system is that it can identify the Speaker's voice in a Multi-Speaker Environment too. Multi-layer Perceptron (MLP) Neural Network based on error back propagation training algorithm was used to train and test the system. ⇨ The system response time was 74 µs with an average efficiency of 95%.

speakerrecognition_tutorial

Simple d-vector based Speaker Recognition using Pytorch

mwang-lifesize Goto Github PK

Min Wang's Projects

Recommend Projects

Recommend Topics

Recommend Org