-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Synchronization offset #14
Comments
@skittlesvampir |
Oh my god, now it works!! Thank you so much. Just two small details:
|
I think it would be very hard to do a good job when guessing timestamps interpolations. In-between texts could be partially fast or slow and may include some sub-parts without spoken text. For point 2: the real solution is to improve the Whisper recognition. This can be obtained with WhisperHallu. For both points 1 and 2: I'm currently working on a solution using word-level timestamps and some complementary pre-/post-processing around WhisperHallu. I don't plan to release it fully open-source. We can discuss about it if you have a budget. |
I will check WhisperHallu out, it seems cool. Unfortunately, I don't have a budget, I'm just synchronizing my own shows so I can understand them better. Anyways, I think the errors are acceptable, so thank you for your work! I wish your business much success in the future! |
Problem description: openai/whisper#1770 (comment)
I've uploaded the data at: https://ben.ist-toll.xyz/k/whisper-test-files/
The text was updated successfully, but these errors were encountered: