Preparing scientific audio text for whisper fine-tuning #2148
Unanswered
kojomensahonums
asked this question in
Q&A
Replies: 3 comments 9 replies
-
Yes, avoid any symbols. |
Beta Was this translation helpful? Give feedback.
0 replies
-
@kojomensahonums Hello, I believe my tool (I am still adding features, and solving bugs) for creating synthetic audio datasets must be useful for your project, you can translate and edit audios to match your desired format. |
Beta Was this translation helpful? Give feedback.
4 replies
-
@gongouveia any progress so far? |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am currently working on training whisper for scientific and mathematical audio datasets. In preparing the ground truth text data, what is the best way to go about preparing it? How should equations be written down, should they be plainly put in text without symbols? How should SI units be written? Assuming there is a quadratic equation, example x^2-y^2=25, what's the best way to put this in text so whisper can follow through to transcribe? These are just few examples I am thinking through.
Beta Was this translation helpful? Give feedback.
All reactions