Skip to content

different behavior dealing with Japanese Kanji by different model #204

Answered by jongwook
unlock2000 asked this question in Q&A
Discussion options

You must be logged in to vote

It's similar to the "punctuation mode" and "no-punctuation mode" observed in #194, where the model can sample a certain style of writing and continue using it since the subsequent transcriptions are conditioned on the previous outputs.

I think the model was more likely to go with this "no-kanji mode" given the tone and the content of the story which would be targeted to children.

As mentioned in the comments in #194, you could supply a hypothetical sentence that could have come before the audio as the initial prompt, such as, --initial_prompt "次はヒロが見た変な夢の物語です。" to nudge the model to output in the style you want.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@unlock2000
Comment options

Answer selected by jongwook
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants