Openutau Yaml Dictionaries for Arpasing, En VCCV, Xsampa phonemizers, En Enunu, and En EnunuX/EnunuX to support Japanese, Korean, Chinese, and French (more languages to support soon)
📍 When using these dictionaries with Arpasing banks I recommend using my Syllable-Based ARPAsing Phonemizer because these dictionaries contains not so common [C V]
, [V C]
, and specially [C C]
aliases, also it has a fallback feature for missing aliases and phonemes such as [ax]
, [dx]
, and [zh]
English to Japanese
, English to Korean
, English to Chinese
, English to Cantonese
,English to French
, and English to Spanish
(languages auxiliary support for now) for Arpasing, En VCCV, Xsampa banks, and Enunu/Enunux English banks for Openutau .yaml dictionaries, tweaked based on Slidingwall's synth dictionary.
📍 As of now, the bigger the .yaml
file, the slower Openutau changes the phonemes or any changes in general, as of Ou doesn't supports yet the dictionary picking like on Synthv, for now, you can choose and download the single language dictionary or the compiled one. You can customize the dictionary as you please.
credits for mmemim for the Synthv French .json Dictionary. Credits to Halo/BagelHero & Mim for the Ou En VCCV Custom Dictionary and credits to Lotte's Spanish cmudict for the en2sp base.
- if there's any issue on the dictionary, and language request to support Openutau's dictionary, you can contact me through my Twitter. Let me know if there's a problem tehee.
📍 There may be wrong entries and mispronunciation in the dictionary, please report them if so. Making your english voicebanks sings in different languages will still require phoneme editing as this lexicon is by no means a plug-and-play solution especially Korean, Chinese, French and the Spanish.
⭐ The dictionaries are updated regularly so feel free to download the latest one here or in this mirror link. ⭐
- How to download and install dictionaries on Windows
- How to use the dictionary + the suffix support
- How to edit dictionary to fit a specific voicebank
- To download any .yaml dictionary, click on the filename and then click on the download icon in the upper left corner of the file browser, as shown in the image below.
- If it doesn't show up like that, click Raw...
- Then right click, and click save as...
- Rename the extension to
.yaml
not.txt
then save
- Open your Openutau folder and go to
Plug-ins
folder. Drag and drop the yaml there and you are ready to go
- Reload the english voicebank by choosing any phonemizers then back to the english one or changing voicebanks then going back again to the desired voicebank and you should now be able utilize the dictionary. If it doesn't load up, close and reopen OpenUtau
- You can input
Hiragana
,Katakana
andromaji c v
in the notes to utilize the japanese dictionary. It also supports the usage ofっ
eg:っば
,っいぇ
,っしょ
and glottal stops on vowels eg:あ2
,い2
,う2
,え2
,お2
,ん2
- You can input
Hanguul
andRomaja
with.
after the romaja word in the notes to utilize the Korean dictionary. Some have alternate pronunciations by inputing1
,2
, and3
after the word. Note that korean consonant assimilation is not supported because of the complex character-to-phoneme system with theHangul
input, so usingRomaja
to place consonants such asss.
,p.
,bb.
and so on the note.
- Only
Pinyin
characters with/
after the word to utilize the Chinese dictionary. Multiple pronunciations now supported. eg:ra/1
as[zh, ah]
,chun/
as[ch, w, ax, n]
,chun/1
as[ch, uw, n]
- Multiple pronunciation variations
- Only
Jyutping
characters with\
after the word to utilize the Cantonese dictionary. Multiple pronunciations aresupported. eg:yuk\
as[y, ih, k]
,yuk\1
as[y, k]
,byui\1
as[b, y, ih]
,byui\1
as[b, y, iy]
.
- You can input
French
, with*
after the word in the notes to utilize the French dictionary. Based with mmemim's Synthv French .json Dictionary, If you want to say words like l'année or j'en you'll have to split them in two notes l' + année and j' + en as shown below.
- In French, you also have liaisons, and some words might not always be pronounced the same way. Because of that, you may sometimes need to put
word(2)
,word(3)
or evenword(4)
. For example,absentes
has 4 entries.
-
You can input
español
with~
after the spanish word in the notes to utilize the Spanish dictionary. Note that rolled r's orrr
doesn't support by default on the dictionary itself, if your voibecank have a rolled r extras, you can add them to the dictionary by specifying the phoneme to the symbol list and replacing thedx, dx
phoneme with the rolled r phoneme. If your vb doesn't have rolled r extras, you can find a way to do them, mine I just add as manyr q r q
orr/dx
then glottals to the rolled r part of the word -
also words without accent doesn't work eg:
útiles~
and you typedutiles~
, ou will output the english pronunciation eg:utiles~
to[y, uw, t, ah/ax, l, z]
so it is important to add the symbol accents. If this feature is annoys you (also myself as well lmao) just let me know, I'll make a variant without the symbol accents
-
Some voicebanks come with extra phonemes especially with En VCCV and Xsampa voicebanks, test out first the dictionary so you will know if its there are errors you will encounter like on this picture below:
-
Editing the
.yaml
file via notepad or any other editing software you have to edit your desired phoneme per voicebank. I recommended to copy first one language then edit with the instructions inside the.yaml
file -
If it doesn't work, batch rename your vb's oto or wait for the dev team to make an phoneme fallback to fix the issue.