Add correct int type to the quantized weights matrix in MatMulInteger #29

peterbohm · 2023-02-12T01:04:25Z

Assignment of the quantized weights to B matrix in MatMulInteger, was casting them to the same int type as the quantized input (uint8).
This created an issue for models with weights quantized to int8 (e.g. using onnxruntime.quantization.quantize_dynamic()).
This commit checks for the actual data type of the weights and then uses that during assignment to B.

src/nodes/matmulinteger.h

kraiskil · 2023-02-13T08:37:00Z

Thanks for the PR.

ONNX specifies the two inputs to be of different types, so this is a clear bug in onnx2c. Good to merge.

kraiskil · 2023-02-13T09:55:12Z

Thanks :)

Add correct int type to the quantized weights matrix in MatMulInteger

6b69953

kraiskil reviewed Feb 13, 2023

View reviewed changes

src/nodes/matmulinteger.h Outdated Show resolved Hide resolved

Updated formatting to tabs

14a829b

kraiskil merged commit bdfe0f7 into kraiskil:master Feb 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add correct int type to the quantized weights matrix in MatMulInteger #29

Add correct int type to the quantized weights matrix in MatMulInteger #29

peterbohm commented Feb 12, 2023

kraiskil commented Feb 13, 2023

kraiskil commented Feb 13, 2023

Add correct int type to the quantized weights matrix in MatMulInteger #29

Add correct int type to the quantized weights matrix in MatMulInteger #29

Conversation

peterbohm commented Feb 12, 2023

kraiskil commented Feb 13, 2023

kraiskil commented Feb 13, 2023