Skip to content

Strange messages in transcript - possible presence of bad inputs for learning Italian? #293

Answered by jongwook
asterbini asked this question in Q&A
Discussion options

You must be logged in to vote

This is an example of regurgitation/hallucination that our data preprocessing unfortunately didn't catch. Retraining all 5 models with a new filter would be the most surefire way to fix it (and there's no separate "Italian model"), but before that happens, you could choose to replace any phrase that contains amara.org with a blank string, as it seems to happen more frequently when the segment is silent or near the beginning and the end of the audio.

If you have a long sequence without speech (like an intro/outro animation) in the beginning/end of your video, Whisper may behave better if you supply trimmed audio without those parts; alternatively you could try combining Whisper with VAD (#29

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@benjaminbellamy
Comment options

Answer selected by jongwook
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants