Coder Social home page Coder Social logo

thesis-bss's Introduction

thesis-bss

list of matlab / octave script used on master thesis by bagustris. The theme of thesis is binaural sound sources separation. Pdf file is available here

In this thesis, I evaluated some common methods in binaural sound separation: ICA (with max likelihood estimation, ICA with Binary Mask (ICABM), binural model using phase difference channel weighting [4], and my-proposed-method FastICA with Binary Mask (FastICABM).

This is the source code for underdetermined separation of instaneous speech mixtures with FastICA and binary mask and the comparison for benchmark.

The algorithm is described in

  1. Michael Syskind Pedersen, DeLiang Wang, Jan Larsen and Ulrik Kjems: Two-microphone Separation of Speech Mixtures, 2006, Submitted for publication.
  2. Michael Syskind Pedersen, DeLiang Wang, Jan Larsen and Ulrik Kjems, Overcomplete Blind Source Separation by Combining ICA and Binary Time-Frequency Masking, IEEE International workshop on Machine Learning for Signal Processing, pp. 15-20, 2005
  3. Hyvärinen, A., Erkki, H. 2000. Independent Component Analysis: Algorithm and Applications. Neural Networks, 13(4-5):411-430, 2000
  4. C. Kim, K. Kumar, B. Raj, , and R. M. Stern, “Signal separation for robust speech recognition based on phase difference information obtained in the fre- quency domain,” INTERSPEECH, pp. 2495–2498, 2009.

All files should be in the same directory. The algorithm is run by calling each icabm.m and fasticabm.m. For ICA algoritrh, can be directy run from worskpace and for PDCW can be obtained from the source.

A number of parameters can be specified in those files.

  • N : Number of sources in mixture
  • NFFT : DFT length
  • winnumber : Selects window function
  • k : Window length is NFFT/k
  • noverlapfactor : Overlap between consecutive windows
  • th : Mask threshold?
  • TC1 : Merge finalstereo signals if correlation is above TC1
  • TC2 : Merge finalstereo and enerstereo if correlation is above TC2
  • stopthresholdini : One source if condition number is above this value
  • thepow : tau_E (see [1])

All codes is copyrighted by its own author. The codes from me are licensed GNU/LGPL v2. Run main.m to get the demo. You can change the input file by your own data.

thesis-bss's People

Contributors

bagustris avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.