Comments (3)
from espeak-ng.
Huh, I think that's the true extent of this bug; the dakuten itself isn't the cause.
Since there are overlapping rules for both るう
and うぃ
, espeak-ng converts the former first, and fails upon encountering ぃ
.
From what I can glean, espeak-ng consumes the longest possible grapheme sequence specified in the rules sequentially, i.e. a greedy algorithm.
If that is the case, we could handle these anomalies by specifying all possible corner cases:
かあぁ -> ka a:
しいぃ -> s\\i i:
つうぅ -> t_su u:
ねえぇ -> ne e:
...
Or alternatively, we could just add rules for all the smaller versions of the nouns and call it a day:
ぁ -> a
ぁー -> a:
ぃ -> i
...
Of course, this still leaves the problem of the dakuten (and handakuten), which by definition doesn't have a fixed sound.
I propose a mixed strategy: remove the separation of dakuten/handakuten and treat graphemes such as ば
as one grapheme.
Then we could add the smaller versions separately.
We would still need to rewrite most rules, but I think this would minimize the work necessary.
from espeak-ng.
Hmm, this isn't limited to small kana, either. The long vowel indicator (chōonpu) ー
causes this too:
$ espeak-ng -v ja とおー -X
Translate 'とおー'
36 と [to]
57 とお [to:]
Translate 'と'
36 と [to]
Translate 'お'
36 お [o]
Translate 'ー'
Found: '_ja' [dZ'ap@ni:z]
t'o 'o _:(en)dZ'ap@ni:z(ja)l'et@
Unlike the above samples which are admittedly pretty niche, this is a very common combination.
from espeak-ng.
Related Issues (20)
- android voice new struct HOT 4
- Win 10 quarantines an index.php file as being the backdoor:PHP/Dirtelti.HA trojan HOT 2
- Improvements for Urdu
- Fix inclusion of <sys/endian.h>
- speech.c: fifo_stop() creates a noticeable delay (at least in NetBSD), call audio_object_flush() first.
- Have espeak stop saying the alphabet name before characters constantly for non-latin alphabets
- emscripten build is broken on MacOS and Linux
- Add support for gender-inclusive German
- Add vim syntax ftplugin to vim runtime HOT 1
- number 1932 becomes 19 hundred 32, while 1832 sounds as 1 thousand 8 hundred 32
- ESPEAK NG APK
- EsPeak not speaking when Demonic variant is used with Scottish voice HOT 1
- Hebrew MBROLA voices have missing phonemes
- Last character truncation in long word
- Can't import Espeak dictionary
- Please Add Filipino/Cebuano Language. HOT 1
- Adding Amazigh/Berber language HOT 1
- The latest version cannot integrated with screen readers?
- U+FFFD pronunced oddly
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from espeak-ng.