I need to create an acoustic model for Julius in English. The README says that Julius

Make sure you have the right site for HTK (<a href="http://htk.eng.cam.ac.uk/" rel="no

Tools to create acoustic models about julius HOT 4 OPEN

julius-speech commented on May 25, 2024

Tools to create acoustic models

from julius.

Comments (4)

colbec commented on May 25, 2024

Make sure you have the right site for HTK (http://htk.eng.cam.ac.uk/) - there is a note about the recently released beta 3.5 version which happened Dec 2015, so your source seems to be out of date.

I run openSUSE Leap 42.1 64 bit and have no problem compiling HTK. You might have to specify what error you see.

There are several tools for building models in ARPA and converting between formats. They tend to have minor differences in output, it depends what you are looking for. Try a google search for "language model generator."

Voxforge is occasionally slow, but right now it is ok for me. I selected a 4 MB file from the downloads section and it completed in 16 seconds on my slow connection.

from julius.

palles77 commented on May 25, 2024

I agree with colbec. HTK is old in some places, however there is a beta version available. I have been using HTK for years now for both language and acoustic modelling. Best way is to follow HTK tutorials provided in Voxforge for acoustic modeling and HTK tutorials for language modeling.

You need to be aware that creating a decent acoustic model is a non trivial process and you need to consider how much effort you are prepared to put into it. I myself have a few English UK models from my own experiments in the past, but their quality is not the best (around 25% WER).

from julius.

pdtwonotes commented on May 25, 2024

Since colbec reported that VoxForge downloads worked, on a hunch I created a VPN tunnel out of my local area and tried again. I was able to download the English model in just a few seconds. So something is wrong with my local ISP.

At first glance the VoxForge model appeared to work, and a quite large pronounciation dictionary was included. Unfortunately, the hmmdef file is missing many of the triphones that the dictionary uses.

from julius.

colbec commented on May 25, 2024

One of the downsides of a phone based model is that triphone possibilities are of the order of N^3; in English this might mean 40^3 or 64000 triphone candidates. It is really hard to exercise them all, even the most commonly used ones unless you are working with a very large audio database. This is made harder by trying to get a wide variety of voices. Sometimes you can bend your requirements for additional words by substituting phones that the model is aware of.

from julius.

Recommend Projects

Tools to create acoustic models about julius HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent