Coder Social home page Coder Social logo

yehuangcn / cnn-for-single-channel-speech-enhancement Goto Github PK

View Code? Open in Web Editor NEW

This project forked from zhr1201/cnn-for-single-channel-speech-enhancement

0.0 1.0 0.0 406 KB

Convolutional neural nets for single channel speech enhancement

Python 100.00%

cnn-for-single-channel-speech-enhancement's Introduction

CNN-for-single-channel-speech-enhancement

A tensorflow implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement https://arxiv.org/abs/1609.07132 A processed sample can be found in audiosample/

Requirements

  • tensorflow r0.11
  • librosa
  • numpy

File documentation

  • SENN.py: The structure of the network.
  • audio_reader.py: Find the speech and noise in the files and enqueue the audios that have been read into tf.queue.
  • SENN_train.py: Train the net.
  • SENN_audio_eval: Use a noisy sample to evaluate the net/

Training procedure

  • Orgnize your clean speech files and noise files in different directories.
  • Change their dir in SENN_train.py and train the net.
  • Mix your own samples and test use SENN_audio_eval.py

Some other things

The original paper use per sample pre-whitening and we also use that in this piece of code, but it turns out better to use global mean and var to do the pre-whitening. We didn't use the skip connections and our tests show that the most important factor leading to good performance is the size of the noise data set. The model is very likely to overfit if only 100 types of noise are provided.

cnn-for-single-channel-speech-enhancement's People

Contributors

zhr1201 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.