Fix infinite loop caused by incorrect timestamp tokens prediction #914

andrewchernyh · 2023-02-01T09:12:55Z

#810

openai#810

jongwook · 2023-02-02T00:19:53Z

Thank you!

Jeronymous

The infinite loop can still happen, so I suggest to go further with this bugfix

Jeronymous · 2023-02-21T12:59:13Z

whisper/decoding.py

+            timestamps = sampled_tokens[sampled_tokens.ge(self.tokenizer.timestamp_begin)]
+            if timestamps.numel() > 0:
+                # timestamps shouldn't decrease; forbid timestamp tokens smaller than the last
+                logits[k, self.tokenizer.timestamp_begin : timestamps[-1]] = -np.inf


This is not enough to prevent the infinite loop (see discussion #924) because it is not preventing the model to always output <|0.00|>

Suggestion:

timestamp_last = max(timestamps[-1], self.tokenizer.timestamp_begin + 1) # Avoid to emit <|0.00|> again logits[k, self.tokenizer.timestamp_begin : timestamp_last] = -np.inf

I think a better suggestion is:

if last_was_timestamp and not penultimate_was_timestamp: timestamp_last = timestamps[-1] else: timestamp_last = timestamps[-1] + 1 logits[k, self.tokenizer.timestamp_begin : timestamp_last] = -np.inf

to force that timestamps are strictly increasing after a speech segment / increasing between the end of a speech segment and the start of the next one.

Is anyone looking at this?

Great solution @Jeronymous! I checked it and it works.

With your permission, Im gonna create a new PR to speed up this change. Im gonna mention your suggestion.

Changes from openai/whisper#914

@Jeronymous

Following the suggestions of @Jeronymous in openai#914 and openai#924, it solves the problem of endless loop.

New fix for endless loop problem. I also created a PR for official Whisper: openai/whisper#1155 It is explained in openai/whisper#914 and openai/whisper#924

@Jeronymous

* Update decoding.py Following the suggestions of @Jeronymous in #914 and #924, it solves the problem of endless loop. * Removed blank line and whitespaces in empty lines. * Suggested changes according to the linter --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

…enai#914) * Fix infinite loop caused by incorrect timestamp tokens prediction openai#810 * Update decoding.py --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

@Jeronymous

* Update decoding.py Following the suggestions of @Jeronymous in openai#914 and openai#924, it solves the problem of endless loop. * Removed blank line and whitespaces in empty lines. * Suggested changes according to the linter --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

…enai#914) * Fix infinite loop caused by incorrect timestamp tokens prediction openai#810 * Update decoding.py --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

@Jeronymous

* Update decoding.py Following the suggestions of @Jeronymous in openai#914 and openai#924, it solves the problem of endless loop. * Removed blank line and whitespaces in empty lines. * Suggested changes according to the linter --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

…enai#914) * Fix infinite loop caused by incorrect timestamp tokens prediction openai#810 * Update decoding.py --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

@Jeronymous

* Update decoding.py Following the suggestions of @Jeronymous in openai#914 and openai#924, it solves the problem of endless loop. * Removed blank line and whitespaces in empty lines. * Suggested changes according to the linter --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

@Jeronymous

* Update decoding.py Following the suggestions of @Jeronymous in openai/whisper#914 and https://github.com/openai/whisper/discussions/924, it solves the problem of endless loop. * Removed blank line and whitespaces in empty lines. * Suggested changes according to the linter --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

@Jeronymous

* Update decoding.py Following the suggestions of @Jeronymous in openai/whisper#914 and openai/whisper#924, it solves the problem of endless loop. * Removed blank line and whitespaces in empty lines. * Suggested changes according to the linter --------- Co-authored-by: Jong Wook Kim <jongwook@openai.com>

andrewchernyh and others added 2 commits February 1, 2023 16:04

Fix infinite loop caused by incorrect timestamp tokens prediction

e5f3168

openai#810

Update decoding.py

25d5ccc

jongwook merged commit 7858aa9 into openai:main Feb 1, 2023

Pikauba mentioned this pull request Feb 17, 2023

AssertionError: non-negative timestamp expected m-bain/whisperX#85

Closed

Jeronymous reviewed Feb 21, 2023

View reviewed changes

FernanOrtega added a commit to FernanOrtega/whisperX that referenced this pull request Mar 24, 2023

Update decoding.py

33dd3b9

Changes from openai/whisper#914

FernanOrtega mentioned this pull request Mar 24, 2023

Update decoding.py m-bain/whisperX#148

Merged

FernanOrtega added a commit to FernanOrtega/whisper that referenced this pull request Mar 27, 2023

Update decoding.py

fc181e9

Following the suggestions of @Jeronymous in openai#914 and openai#924, it solves the problem of endless loop.

FernanOrtega mentioned this pull request Mar 27, 2023

Update decoding.py #1155

Merged

FernanOrtega added a commit to FernanOrtega/whisperX that referenced this pull request Mar 27, 2023

Update decoding.py

79de666

New fix for endless loop problem. I also created a PR for official Whisper: openai/whisper#1155 It is explained in openai/whisper#914 and openai/whisper#924

FernanOrtega mentioned this pull request Mar 27, 2023

Update decoding.py m-bain/whisperX#154

Closed

bygreencn mentioned this pull request Dec 4, 2023

fix: remove hallucinations from silent audio ggerganov/whisper.cpp#1588

Closed

kyakuno mentioned this pull request Jan 9, 2024

Update whisper decoding algorithm axinc-ai/ailia-models#1355

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix infinite loop caused by incorrect timestamp tokens prediction #914

Fix infinite loop caused by incorrect timestamp tokens prediction #914

andrewchernyh commented Feb 1, 2023

jongwook commented Feb 2, 2023

Jeronymous left a comment

Jeronymous Feb 21, 2023

Jeronymous Mar 15, 2023

FernanOrtega Mar 27, 2023

Fix infinite loop caused by incorrect timestamp tokens prediction #914

Fix infinite loop caused by incorrect timestamp tokens prediction #914

Conversation

andrewchernyh commented Feb 1, 2023

jongwook commented Feb 2, 2023

Jeronymous left a comment

Choose a reason for hiding this comment

Jeronymous Feb 21, 2023

Choose a reason for hiding this comment

Jeronymous Mar 15, 2023

Choose a reason for hiding this comment

FernanOrtega Mar 27, 2023

Choose a reason for hiding this comment