Name		Name	Last commit message	Last commit date
parent directory ..
AMMO.md		AMMO.md
layer_info.json.svg		layer_info.json.svg
readme.md		readme.md
resnet18.log		resnet18.log

readme.md

量化感知训练

带插件的QAT
- 在qat onnx生成后替换相应的op
- 插件在网络尾部(插件的输出就是网络的输出) ，这时候插件对应的op可以不参与量化训练

auto qat

https://github.com/lix19937/auto_qat

NVIDIA/TensorRT#3205

NVIDIA/TensorRT#2182

Did you start with a pretrained model w/o QAT? If yes, does the FP32 model (unquantized) also shows instability?
How did you add the QDQ nodes and how did you determine the scales (what SW did you use? did you perform calibration? Was the calibration DS large enough?)?
Did you perform fine-tuning after adding fake quantization? Did you observe the loss vs accuracy curve? Did you check that you did not overfit?
Intuitively I think you should verify that your model is not overfitting because an overfitted model will be unstable when we introduce noise from quantization and limited-precision arithmetic (in float arithmetic different operations ordering can produce small differences in output).

示例

yolox
yolov7
centernet(lidar seg)
lidar od
resnet
hrnet
hourglass

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qat

qat

readme.md

量化感知训练

auto qat

示例

Files

qat

Directory actions

More options

Directory actions

More options

Latest commit

History

qat

Folders and files

parent directory

readme.md

量化感知训练

auto qat

示例