View Code? Open in Web Editor NEW

AnKaS: Development and Analysis of the Database of Livvi-Karelian Speech Annotations [INTERSPEECH 2024]

HTML 15.11% CSS 7.51% JavaScript 77.38%

ankas's Introduction

AnKaS

The official repository for "AnKaS", INTERSPEECH 2024 (submitted)

Abstract

This paper presents a new Livvi-Karelian corpus, addressing challenges encountered in low-resource language research. The main research goal was to collect and annotate new speech data, as well as to create a transcription dictionary. The corpus includes transcripts from radio broadcasts, featuring samples from 17 speakers (7 males and 10 females). Covering about 4.5 hours of audio recordings, it contains 32037 words, thus being a valuable tool for linguistic research. Among the peculiarities of the presented corpus are instances of code-switching between Livvi-Karelian and Russian. The baseline experiments were carried out with the Kaldi toolkit. Hybrid DNN/HMMs with factorized time-delay neural networks were utilized for acoustic modeling, while trigram and LSTM-based models were used for language modeling. The proposed model allowed achieving the Word Error Rate (WER) of 26%.

Acknowledgments

Parts of this project page were adopted from the Nerfies page.

Website License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Recommend Projects

karelianspeech / ankas Goto Github PK

ankas's Introduction

AnKaS

Abstract

Acknowledgments

Website License

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent