Add deployment support wav2vec2.0 via torchaudio #3609

mthrok · 2021-06-10T13:39:35Z

What does this PR do?

In upcoming PyTorch 1.9 / torchaudio 0.9 release, torchaudio support TorchScript-able wav2vec2.0 model definitions.
This PR adds the illustration fo how to convert the models to from fairseq and transformers into deployable package.

cc @myleott @alexeib

mthrok · 2021-06-10T13:43:32Z

examples/wav2vec/README.md

@@ -199,6 +199,48 @@ loss = model(input_values, labels=labels).loss
 loss.backward()
 ```

+## Deploying wav2vec 2.0 with torchaudio
+
+`torchaudio` has added wav2vec 2.0 model definition that supports TorchScript, alongside of function to import model instances from `fairseq` or 🤗Transformers. By using TorchScript, you can deploy your wav2vec 2.0 model to ONNX runtime, [C++](https://github.com/pytorch/audio/tree/master/examples/libtorchaudio/speech_recognition), [iOS](https://github.com/pytorch/ios-demo-app/tree/master/SpeechRecognition), and [Android](https://github.com/pytorch/android-demo-app/tree/master/SpeechRecognition).


Examples for iOS and Android are being updated to use torchaudio.

torchaudio based wav2vec2 with no model input length limit pytorch/android-demo-app#141

updated script and iOS code to use torchaudio 0.9 based wav2vec2 model with no input limit pytorch/ios-demo-app#53

ONNX support via TorchScript is reported to work here

Problem with exporting wav2vec2 to onnx #3010

alexeib

looks great!

alexeib · 2021-06-15T20:47:03Z

examples/wav2vec/README.md

+from torchaudio.models.wav2vec2.utils import import_fairseq_model
+
+original, _, _ = fairseq.checkpoint_utils.load_model_ensemble_and_task(
+    ["wav2vec_small_960h.pt"], arg_overrides={'data': "<DIRECTORY_WITH_DICTIONARY>"})


in newer models, dictionary is actually stored inside the model checkpoint rather than internally. we could probably convert older checkpoints to follow this new format if it will simplify things

@alexeib I removed the arg_overrides. Did you update the public checkpoints? My concern is that users who just want to try the published model (if not converted to the new format) will encounter an issue when loading models.

no, i have not updated the old checkpoints yet. maybe we can allow users to optionally provide data dir instead of making it mandatory to either provide or not?

stale · 2022-03-02T14:32:07Z

This pull request has been automatically marked as stale. If this pull request is still relevant, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize reviewing it yet. Your contribution is very much appreciated.

stale · 2022-04-16T13:20:22Z

Closing this pull request after a prolonged period of inactivity. If this issue is still present in the latest release, please ask for this pull request to be reopened. Thank you!

facebook-github-bot added the CLA Signed label Jun 10, 2021

mthrok commented Jun 10, 2021

View reviewed changes

mthrok changed the title ~~Add support for deployment via torchaudio~~ Add deployment support wav2vec2.0 via torchaudio Jun 10, 2021

mthrok marked this pull request as ready for review June 15, 2021 18:56

alexeib reviewed Jun 15, 2021

View reviewed changes

mthrok added 2 commits July 21, 2021 21:14

Add support for deployment via torchaudio

e80c044

Update model load

63a2a73

mthrok force-pushed the fairseq-pr branch from 7c90aca to 63a2a73 Compare July 21, 2021 21:35

stale bot added the stale label Mar 2, 2022

stale bot closed this Apr 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add deployment support wav2vec2.0 via torchaudio #3609

Add deployment support wav2vec2.0 via torchaudio #3609

mthrok commented Jun 10, 2021

mthrok Jun 10, 2021

alexeib left a comment

alexeib Jun 15, 2021

mthrok Jul 21, 2021

alexeib Jul 24, 2021

stale bot commented Mar 2, 2022

stale bot commented Apr 16, 2022

Add deployment support wav2vec2.0 via torchaudio #3609

Add deployment support wav2vec2.0 via torchaudio #3609

Conversation

mthrok commented Jun 10, 2021

What does this PR do?

mthrok Jun 10, 2021

Choose a reason for hiding this comment

alexeib left a comment

Choose a reason for hiding this comment

alexeib Jun 15, 2021

Choose a reason for hiding this comment

mthrok Jul 21, 2021

Choose a reason for hiding this comment

alexeib Jul 24, 2021

Choose a reason for hiding this comment

stale bot commented Mar 2, 2022

stale bot commented Apr 16, 2022