Coder Social home page Coder Social logo

speaker-cluster's Introduction

# speaker-cluster
This code is a simple BIC clusterer for speaker diarisation. Clustering based on Euclidean distances and Arithmetic Harmonic Sphericity is also included. The md-eval.pl script performs scoring and the htkmfc.py script reads in HTK features.




$ python2 speakercluster.py 
usage: speakercluster.py [-h] [-d {ahs,bic,euclid,gish}] [-c {diagonal,full}]
                         [-p PENALTY] [-t THRESHOLD] [-m MAX_CLUSTERS]
                         [-s SHORT_TIME] [-f FOLDER] [-r REFERENCE] [--trace]
                         [--collar COLLAR]
                         scp
                         
                         
                         
Requires an scp file pointing to segments in a HTK feature file:

CORPUS-FILE1_000000_000332.mfc=/path/to/CORPUS-FILE1.mfc[0,332]
CORPUS-FILE1_000554_000599.mfc=/path/to/CORPUS-FILE1.mfc[554,599]
CORPUS-FILE1_000613_000770.mfc=/path/to/CORPUS-FILE1.mfc[613,770]
CORPUS-FILE1_000770_001027.mfc=/path/to/CORPUS-FILE1.mfc[770,1027]


The reference file should be in the NIST format below, and the script outputs a hypothesis file in the same format:
;; SPEAKER FILENAME CHANNEL BEGIN(secs) DUR(secs) <NA> <NA> SPEAKERLABEL <NA>
SPEAKER CORPUS-FILE1 1 0.000000 3.320000 <NA> <NA> SPKR0 <NA>
SPEAKER TBL0101-MIXA1 1 5.540000 0.450000 <NA> <NA> SPKR1 <NA>
SPEAKER TBL0101-MIXA1 1 6.130000 1.570000 <NA> <NA> SPKR0 <NA>
SPEAKER TBL0101-MIXA1 1 7.700000 2.570000 <NA> <NA> SPKR2 <NA>

speaker-cluster's People

Contributors

rosannamilner avatar

Stargazers

wzy avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.