Fix Onnx Export for Composer HuggingFaceModels #1557

nik-mosaic · 2022-09-23T21:23:18Z

Due to how torch.onnx.export processes arguments that are dictionaries (and since our HuggingFaceModel inputs are generally dictionaries), Onnx export for HuggingFaceModels is currently broken. This PR should fix this.

We fix this by writing
export_for_inference(..., sample_input=(input_batch, {})
instead of
export_for_inference(..., sample_input=(input_batch, ).

This works for both tensor inputs (ResNets, etc.) and dictionary inputs.

In addition to the above change, we move the model and sample_input to CPU, since ONNX export models and inputs cannot be on the GPU. We add a test to verify this works.

We also manually specify the ONNX export opset_version to 13. This is currently the default version for PyTorch 1.12.1, but not for older versions of PyTorch (opset_version 9 was the previous default), which was causing an error. Ideally we would not specify this manually, but I cannot find a way to get the latest supported opset_version. Once we stop supporting PyTorch < 1.12.1, we should stop manually specifying this.

Finally, we also add an optional dynamic_axes input, which is required for older (1.10.2) PyTorch CPU versions when exporting HuggingFaceModels. If users in this scenario do not provide dynamic_axes for their inputs, they may see the following UserWarning and their exported model may not be correct.
UserWarning: Type cannot be inferred, which might cause exported graph to produce incorrect results.

…nto nikhil/onnx-export

dskhudia · 2022-09-23T21:35:02Z

@nik-mosaic Thank you for hunting this down. I think a better solution would be if the sample_input is a dict we extract the names and pass those to input_names argument at export time.

See this: https://github.com/pytorch/pytorch/blob/108b25db25ef180bfbfaed2347c9b99103aa68ef/torch/onnx/utils.py#L288

dskhudia · 2022-09-23T22:00:02Z

Also while we are at this, we may want to assign the output names as well. HF models, depending on a config value, return a dict of outputs. https://github.com/huggingface/transformers/blob/main/src/transformers/models/bert/modeling_bert.py#L1373-L1378

dskhudia · 2022-09-23T22:03:13Z

See the implementation of this function as well. https://github.com/huggingface/transformers/blob/02b176c4ce14340d26d42825523f406959c6c202/src/transformers/onnx/convert.py#L84

dskhudia · 2022-09-23T22:22:35Z

Could you also check if the output type (dict or tuple of tensor) is the same between eager model output vs onnx exported model?

composer/utils/inference.py

tests/utils/test_inference.py

…nto nikhil/onnx-export

nik-mosaic · 2022-09-23T23:01:06Z

Also while we are at this, we may want to assign the output names as well. HF models, depending on a config value, return a dict of outputs. https://github.com/huggingface/transformers/blob/main/src/transformers/models/bert/modeling_bert.py#L1373-L1378

Wouldn't this require writing Hugging Face-specific code for the output names? Can we do this and remain agnostic to the model type?

I think the model/onnx_config config has output_names. Not sure if this is model or onnx specific config.

nik-mosaic · 2022-09-23T23:06:03Z

Could you also check if the output type (dict or tuple of tensor) is the same between eager model output vs onnx exported model?

They are not, the onnx output is just a list of arrays, the eager output is a dict of tensors.

dskhudia · 2022-09-24T16:03:41Z

Could you also check if the output type (dict or tuple of tensor) is the same between eager model output vs onnx exported model?

They are not, the onnx output is just a list of arrays, the eager output is a dict of tensors.

I am ok keeping this as is (i.e., without output names) for now. However, someone directly using onnx output downstream where they were using eager may run into issues if the output of eager is a dict. Could you create a task for this problem?

dskhudia · 2022-09-24T16:06:58Z

Also side comment for a future PR: Are we better off using export_pytorch function from HF for HF models? They might have handled a few more edge cases that we might be missing.

…nto nikhil/onnx-export

…mposer into nikhil/onnx-export

nik-mosaic · 2022-10-01T00:25:53Z

Also side comment for a future PR: Are we better off using export_pytorch function from HF for HF models? They might have handled a few more edge cases that we might be missing.

Yes, for HuggingFace models, I think we are better off calling this function. They do a better job with their OnnxConfig class, and have utilities to convert a HuggingFace config to an OnnxConfig so the inputs and outputs are named well for many classes of models.

composer/utils/inference.py

nik-mosaic and others added 4 commits September 23, 2022 13:32

Add onnx Huggingface export test

269a5f6

Add helper method for input conversion

64d403a

update example, other calls

187a946

Merge branch 'dev' into nikhil/onnx-export

b8af02a

nik-mosaic marked this pull request as ready for review September 23, 2022 21:23

nik-mosaic requested review from dskhudia, hanlint and jacobfulano September 23, 2022 21:24

nik-mosaic added 2 commits September 23, 2022 21:33

Add brackets to callback tests

6cedc0f

Merge branch 'nikhil/onnx-export' of github.com:nik-mosaic/composer i…

22d92f7

…nto nikhil/onnx-export

nik-mosaic and others added 2 commits September 23, 2022 22:06

Add input names to export

7a3d053

remove hf conversion function

5debe57

dskhudia reviewed Sep 23, 2022

View reviewed changes

composer/utils/inference.py Outdated Show resolved Hide resolved

dskhudia reviewed Sep 23, 2022

View reviewed changes

tests/utils/test_inference.py Show resolved Hide resolved

nik-mosaic added 2 commits September 23, 2022 22:44

Remove unnecessary sample_input in test

e554535

Merge branch 'nikhil/onnx-export' of github.com:nik-mosaic/composer i…

f2f1f97

…nto nikhil/onnx-export

rerun tests

6f695e7

nik-mosaic and others added 3 commits September 26, 2022 10:20

Merge branch 'dev' into nikhil/onnx-export

52c94bb

Bump onnx and onnxruntime version

f6342ad

Merge branch 'nikhil/onnx-export' of github.com:nik-mosaic/composer i…

1b59a71

…nto nikhil/onnx-export

nik-mosaic requested a review from a team as a code owner September 27, 2022 18:39

Merge branch 'dev' into nikhil/onnx-export

863c744

nik-mosaic and others added 8 commits September 28, 2022 11:49

Merge branch 'dev' into nikhil/onnx-export

150d630

Merge branch 'dev' into nikhil/onnx-export

c9ccb3d

Add device change

643d8a7

Update test, update inference

b1b83d2

Add gpu huggingface export onnx test

192b8eb

Merge branch 'dev' into nikhil/onnx-export

a7dde16

Add doctstring for dynamic_axes input

7360fac

Merge branch 'nikhil/onnx-export' of https://github.com/nik-mosaic/co…

4c6d3db

…mposer into nikhil/onnx-export

nik-mosaic requested a review from dskhudia October 1, 2022 00:24

remove comment

4309b48

dskhudia approved these changes Oct 1, 2022

View reviewed changes

composer/utils/inference.py Show resolved Hide resolved

Ensure tuple before moving to cpu

f1eed85

nik-mosaic enabled auto-merge (squash) October 1, 2022 01:22

hanlint approved these changes Oct 1, 2022

View reviewed changes

nik-mosaic merged commit 0e5af6c into mosaicml:dev Oct 1, 2022

nik-mosaic deleted the nikhil/onnx-export branch October 3, 2022 20:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Onnx Export for Composer HuggingFaceModels #1557

Fix Onnx Export for Composer HuggingFaceModels #1557

nik-mosaic commented Sep 23, 2022 •

edited

Loading

dskhudia commented Sep 23, 2022

dskhudia commented Sep 23, 2022

dskhudia commented Sep 23, 2022

dskhudia commented Sep 23, 2022

nik-mosaic commented Sep 23, 2022 •

edited by dskhudia

Loading

nik-mosaic commented Sep 23, 2022 •

edited

Loading

dskhudia commented Sep 24, 2022

dskhudia commented Sep 24, 2022

nik-mosaic commented Oct 1, 2022

Fix Onnx Export for Composer HuggingFaceModels #1557

Fix Onnx Export for Composer HuggingFaceModels #1557

Conversation

nik-mosaic commented Sep 23, 2022 • edited Loading

dskhudia commented Sep 23, 2022

dskhudia commented Sep 23, 2022

dskhudia commented Sep 23, 2022

dskhudia commented Sep 23, 2022

nik-mosaic commented Sep 23, 2022 • edited by dskhudia Loading

nik-mosaic commented Sep 23, 2022 • edited Loading

dskhudia commented Sep 24, 2022

dskhudia commented Sep 24, 2022

nik-mosaic commented Oct 1, 2022

nik-mosaic commented Sep 23, 2022 •

edited

Loading

nik-mosaic commented Sep 23, 2022 •

edited by dskhudia

Loading

nik-mosaic commented Sep 23, 2022 •

edited

Loading