add uint8 bn mkldnn implementation #16003

ElaineBao · 2019-08-26T02:00:18Z

Description

add uint8 batchnorm, mkldnn implementation and test
@PatricZhao @ZhennanQin

Details

Usage

Check the doc in https://github.com/apache/incubator-mxnet/tree/master/example/quantization/README.md to quantize models and do inference.
Quantized bn will be used automatically when a bn operator cannot be fused.

Performance

In most cases, bn can be fused, so quantized bn is not introduced. In reset50 v2, some of the bn operators are standalone, quantizing these bn give a performance as follows:

Model	FP32 (Top-1 / Top-5)	Fusion + fp32 bn	Fusion + int8 bn
Resnet50 v2	0.764 / 0.935	0.722 / 0.901	0.712 / 0.897

xinyu-intel · 2019-08-26T02:21:05Z

example/quantization/imagenet_gen_qsym_mkldnn.py

@@ -216,7 +216,7 @@ def save_params(fname, arg_params, aux_params, logger=None):
            if exclude_first_conv:
                excluded_sym_names += ['resnetv10_conv0_fwd']
        elif args.model.find('resnet') != -1 and args.model.find('v2') != -1:
-            excluded_sym_names += ['resnetv20_flatten0_flatten0']
+            excluded_sym_names += ['resnetv20_flatten0_flatten0', 'resnetv20_stage1_batchnorm0_fwd']


why exclude the first one?

This is for the sake of accuracy, if do not exclude this layer, top-1 accuracy will drop to 52.3. Reason of this accuracy drop is under investigation.

tests/python/quantization/test_quantization.py

xinyu-intel

.

ZhennanQin

LGTM. Just add a comment to remind that the excluded BN layer is for accuracy purpose.

pengzhao-intel

Thanks for the contribution.

LGTM and mering now.

ElaineBao added 2 commits August 26, 2019 09:11

add uint8 bn mkldnn implementation

e1bfae3

update test case for uint8 bn

df4c02a

ElaineBao requested a review from szha as a code owner August 26, 2019 02:00

fix lint

7d00792

xinyu-intel reviewed Aug 26, 2019

View reviewed changes

pengzhao-intel added the MKLDNN label Aug 26, 2019

xinyu-intel reviewed Aug 26, 2019

View reviewed changes

tests/python/quantization/test_quantization.py Outdated Show resolved Hide resolved

xinyu-intel reviewed Aug 26, 2019

View reviewed changes

ZhennanQin approved these changes Aug 26, 2019

View reviewed changes

ElaineBao added 4 commits August 26, 2019 10:29

update test with gpu

f736c04

add comment for quantization

3d0a457

fix quantized_bn test

dd53622

fix quantize_model_with_forward test

eeb60f0

pengzhao-intel approved these changes Aug 26, 2019

View reviewed changes

pengzhao-intel merged commit 9410cc4 into apache:master Aug 26, 2019

ElaineBao deleted the bn-uint8 branch August 29, 2019 04:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add uint8 bn mkldnn implementation #16003

add uint8 bn mkldnn implementation #16003

ElaineBao commented Aug 26, 2019

xinyu-intel Aug 26, 2019

ElaineBao Aug 26, 2019

xinyu-intel left a comment

ZhennanQin left a comment

pengzhao-intel left a comment

add uint8 bn mkldnn implementation #16003

add uint8 bn mkldnn implementation #16003

Conversation

ElaineBao commented Aug 26, 2019

Description

Details

Usage

Performance

xinyu-intel Aug 26, 2019

Choose a reason for hiding this comment

ElaineBao Aug 26, 2019

Choose a reason for hiding this comment

xinyu-intel left a comment

Choose a reason for hiding this comment

ZhennanQin left a comment

Choose a reason for hiding this comment

pengzhao-intel left a comment

Choose a reason for hiding this comment