Is it possible to get the actual verbatim ? #712
-
Hi, When I use Whisper with the stock parameters the transcriptions are always clean, ie without "uh", "erm", repetitions, etc... This is good because it makes awesome transcripts. But if I really want exactly what's been said like in "Uh, I don't know, uh, maybe 100, 153, or or even more". Now Whisper outputs "I don't know, maybe 153 or even more", and I wonder if there is an option to get the actual sentence with all that noisy words ? Thanks in advance! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
This is somewhat steerable using prompts, such as #625 (reply in thread) |
Beta Was this translation helpful? Give feedback.
-
Please check out this Whisper variant which was finetuned specifically with filler detection and verbatim transcription in mind: |
Beta Was this translation helpful? Give feedback.
This is somewhat steerable using prompts, such as #625 (reply in thread)