Coder Social home page Coder Social logo

a2lsv's Introduction

A2LSV - Automatic Audio Labeler for Speaker Verification

This project will make it very easy to create speaker verification datasets for all languages. Audios will be automatically downloaded with 'youtube-dl'. Speakers in the audio will be pre-labeled automatically with GE2E encoder. Labeling can be done very efficently with keyboard shortcuts. For web interface, benefitted from this project. For labelling interface, benefitted from this project.

Labeling Screenshot

Shortcuts

Shortcut Description
CTRL + Space Play/Pause current audio.
Right Arrow Load next audio.
Left Arrow Load previous audio.
CTRL + Right Arrow Forward audio
CTRL + Left Arrow Backward audio.
CTRL + Up Arrow Set speed to 2x.
CTRL + Down Arrow Set speed to 1x.
a Add new speaker.
1, 2, 3, 4, .. , 9 Label speaker as according to input number.
Delete Delete this audio.

Setup

Need to install and configure apache kafka and mongoDB. To install apache kafka, you can follow this blog post. To install mongoDB server, you can follow offical documentation.

configs.json

Need to get a valid GCP API developer key. Default values for kafka port and mongoDb address are below. Change them if you need.

{
	"kafkaPort": 9092,
	"mongoDbAddress" : "127.0.0.1:27017",
	"googleAPIDeveloperKey" : "your_developer_key_here"	
}

Installing ffmpeg

sudo apt install ffmpeg

Creating environment

pip install pipenv
pipenv --python 3.6

Activating environment

pipenv shell

Installing python packages

pip install -r requirements.txt

Making migrations

cd a2lsv_web
python manage.py makemigrations web_interface
python manage.py migrate

Loading some language records

python manage.py loaddata fixtures.json

Running server

python manage.py runserver

Starting Kafka Consumers and Producers

Open new terminal window and activate environment for every script.

youtubeSearch

python youtubeSearch.py

youtubeAudioDownloader

python youtubeAudioDownloader.py

speakerDiarization

python speakerDiarization.py

Accessing final dataset files

You can find final dataset files in “a2lsv_web/static/datasets/(dataset_name)/final_dataset” directory. Folder hierarchy is like speaker id => youtube video id => audio file.

Documents

You can download Installation Guide, Software Design Document and User Guide.

a2lsv's People

Contributors

aliagdeniz avatar

Stargazers

 avatar ismet avatar Yakov Filin avatar Recep Ahmet SARITEKIN avatar  avatar twisted avatar Ümit Kaan Usta avatar Emin Can OGUZ avatar

Watchers

James Cloos avatar  avatar ismet avatar Muhammed Çağrı Kurt avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.