You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
same issue encountered while installing on a m1 device.
To install, I upgraded the version in setup.py and to run it, I had to remove the parameter activation_type in quantize_dynamic().
quantize_dynamic(
model_input=model_name,
model_output=output_model_name,
per_channel=True,
reduce_range=True, # should be the same as per_channel
# activation_type=QuantType.QUInt8,
weight_type=QuantType.QInt8, # per docs, signed is faster on most CPUs
optimize_model=False,
) # op_types_to_quantize=['MatMul', 'Relu', 'Add', 'Mul' ],
Is it possible to upgrade the ONNX runtime dependency to the latest? The old version has some bugs (e.g. doesn't work on aws lambda arm64)
The text was updated successfully, but these errors were encountered: