Skip to content

Possible to keep repeated words #304

Answered by jianfch
section33 asked this question in Q&A
Discussion options

You must be logged in to vote

The decoding heuristics does not appear to suppress repeated words. So the "filtering" is performed by the model. Typical transcripts done by human omit repeated words. As a result, it has likely learned this from its training data. One way to "bypass this filtering" is to fine-tune the model on data that does not omit repeating words.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by jongwook
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants