-
I've an audio that is The audio has 4 phrases. |
Beta Was this translation helpful? Give feedback.
Answered by
jongwook
Sep 26, 2022
Replies: 1 comment 2 replies
-
We could add a post-processing step so that the timestamps are upper-bounded by the audio length. This wasn't my priority because the timestamps tend to become more accurate with the larger models and a prolonged last timestamp didn't hurt much practically. |
Beta Was this translation helpful? Give feedback.
2 replies
Answer selected by
jongwook
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
We could add a post-processing step so that the timestamps are upper-bounded by the audio length. This wasn't my priority because the timestamps tend to become more accurate with the larger models and a prolonged last timestamp didn't hurt much practically.