`OnnxFloatToFloat16`: Use ort float16 converter #1132

jambayk · 2024-05-01T21:44:17Z

Describe your changes

onnxconverter-common's float16 converter tool is not regularly maintained and it also cannot large models (>2GB). Use the float16 converted from onnxruntime which is a modified version of the previous tool and has more features.
Added unit test for pass.

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.
Is this PR including examples changes? If yes, please remember to update example documentation in a follow-up PR.

(Optional) Issue link

olive/passes/onnx/transformer_optimization.py

jambayk · 2024-05-02T19:43:52Z

Converting to draft. Will revert warning changes.

## Describe your changes - `onnxconverter-common`'s float16 converter tool is not regularly maintained and it also cannot large models (>2GB). Use the float16 converted from onnxruntime which is a modified version of the previous tool and has more features. - Added unit test for pass. ## Checklist before requesting a review - [x] Add unit tests for this change. - [x] Make sure all tests can pass. - [x] Update documents if necessary. - [x] Lint and apply fixes to your code by running `lintrunner -a` - [ ] Is this a user-facing change? If yes, give a description of this change to be included in the release notes. - [ ] Is this PR including examples changes? If yes, please remember to update [example documentation](https://github.com/microsoft/Olive/blob/main/docs/source/examples.md) in a follow-up PR. ## (Optional) Issue link

float16 update, phi model check

00f2cd8

jambayk changed the title ~~OnnxFloatToFloat16: use ort float16 converter, OrtTransformersOptimization: warning for phi torchscript model~~ Use ort float16 converter, warning for phi torchscript model in transformers optimization pass May 1, 2024

jambayk changed the title ~~Use ort float16 converter, warning for phi torchscript model in transformers optimization pass~~ Use ort float16 converter, Warning for phi torchscript model May 1, 2024

jambayk added 2 commits May 1, 2024 15:08

use_symbolic_shape_infer

f95daff

fix

52ca8c8

devang-ml reviewed May 2, 2024

View reviewed changes

olive/passes/onnx/transformer_optimization.py Outdated Show resolved Hide resolved

jambayk marked this pull request as draft May 2, 2024 19:43

remove phi warning changed

9e14089

jambayk changed the title ~~Use ort float16 converter, Warning for phi torchscript model~~ OnnxFloatToFloat16: Use ort float16 converter May 2, 2024

jambayk marked this pull request as ready for review May 2, 2024 22:36

devang-ml approved these changes May 3, 2024

View reviewed changes

jambayk merged commit 3120697 into main May 3, 2024
35 checks passed

jambayk deleted the jambayk/phi-and-fp16 branch May 3, 2024 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`OnnxFloatToFloat16`: Use ort float16 converter #1132

`OnnxFloatToFloat16`: Use ort float16 converter #1132

jambayk commented May 1, 2024 •

edited

Loading

jambayk commented May 2, 2024

OnnxFloatToFloat16: Use ort float16 converter #1132

OnnxFloatToFloat16: Use ort float16 converter #1132

Conversation

jambayk commented May 1, 2024 • edited Loading

Describe your changes

Checklist before requesting a review

(Optional) Issue link

jambayk commented May 2, 2024

`OnnxFloatToFloat16`: Use ort float16 converter #1132

`OnnxFloatToFloat16`: Use ort float16 converter #1132

jambayk commented May 1, 2024 •

edited

Loading