-
Hi, is it possible to force whisper to not use punctuation at all? I would like to do this manually in post-processing. I have the following problem. When I say: "Hello comma nice to meet you exclamation mark" Whisper most of the time does this then: "Hello comma, nice to meet you exclamation mark." In post, I remove all the punctuation (",", ".") that whisper did automatically and replace the manual punctuation. It looks like this then: "Hello, nice to meet you!" However, sometimes whisper replaces "comma" with "," which sucks for my manual punctuation because now I can not decide if it was set automatically or not. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 6 replies
-
Common punctuation symbols like commas, periods, exclamation points, etc. are not included in the default list of symbols that are suppressed during decoding: Line 248 in eff383b You could try adding them back in this list to obtain output without any of those punctuation symbols. |
Beta Was this translation helpful? Give feedback.
-
You can use
|
Beta Was this translation helpful? Give feedback.
Common punctuation symbols like commas, periods, exclamation points, etc. are not included in the default list of symbols that are suppressed during decoding:
whisper/whisper/tokenizer.py
Line 248 in eff383b
You could try adding them back in this list to obtain output without any of those punctuation symbols.