Coder Social home page Coder Social logo

sr_with_kaldi's Introduction

Speaker embeddings for Text-independent speaker verification using TensorFlow, with Kaldi

This is a slightly modified TensorFlow implementation of the model presented by David Snyder in Deep Neural Network Embeddings for Text-Independent Speaker Verification.

In the paper, this algorithm is a little worse than i-vector. My test show similar output. Also, in my test, shallow network was a very little worse than deep network (This is dependency of DB).

In this code, there are many hard cording such folder location and some parameter related database. If I have database well-known SR database, I try to it. but I only have private database.

I hope this code helps researcher.

Credits

Original paper:

  • Snyder's paper:
@unknown{unknown,
author = {Snyder, David and Garcia-Romero, Daniel and Povey, Daniel and Khudanpur, Sanjeev},
title = {Deep Neural Network Embeddings for Text-Independent Speaker Verification},
year = {2017}
}

Also, use the part of code:

Features

  • Supports kaldi input&output style(input : mfcc scp-ark pair, output : embedding scp-ark pair)
    • This code can replace i-vector train - extraction part in kaldi egs/SRE10/v1.
  • Instead of concatenate VAD frame, I use orginal frame contain non-speech frame.
    • Training case, Many frame was used to train. Test case, max power frame to test. Detail is in the process_data_kaldi.py load_dataset function
    • This part depend on your opinion.
  • Adding input layer mean normalization instead of exptional block.
  • Adding some layer dropout and Batch normalized.
  • Adding L2 loss in last layer.

Requirements

  • Python (2.7)
  • NumPy
  • TensorFlow (I tried only 1.3 version)
  • Database

Usage

Preperation:

  1. Clone the repository recursively to get all folder and subfolders
  2. Prepare Database(I use private DB. If you need, the script needs to be modified)
  3. Use Kaldi-recipe extracing MFCC and VAD in SRE10/v1/run.sh

Running:

  1. run Training_kaldi function in make_dvec.py.
    after, run embedding_kaldi function.(Some function was written hard cording. Change you file location)
  2. use kaldi-recipe calculating mean vector and PLDA scoring.
    Maybe, you only run after /local/extract_ivectors.sh --stage 2 each folder.

Authors

[email protected]( or [email protected])

sr_with_kaldi's People

Contributors

qqueing avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

sr_with_kaldi's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.