Replies: 2 comments 1 reply
-
I think we are having the same issue. It seems to be the shape of the audio file tensor returned by torchaudio.load() is different from what whisper.transcribe() is expecting. I worked around it but I am not sure if there is a better solution. Here is my code:
tensor squeeze: https://pytorch.org/docs/stable/generated/torch.squeeze.html |
Beta Was this translation helpful? Give feedback.
-
If this is still relevant to anyone, I got this error with my use of the "large" model and got past it by specifying language = "en" in my call to model.transcribe(), i.e. |
Beta Was this translation helpful? Give feedback.
-
Hello, I am trying to transcribe audio from a Tensor got using torchaudio library but it is not working. I am using Flask to load the audio given an endpoint. Any solution? Here is the code:
The error displayed is:
decode_options["language"] = max(probs, key=probs.get) AttributeError: 'list' object has no attribute 'get'
in transcribe function.Thanks in advance.
Beta Was this translation helpful? Give feedback.
All reactions