Improve the performance of fake quantize OP #40491

leo0519 · 2022-03-13T14:10:35Z

PR types

Performance optimization

PR changes

OPs

Describe

The modifications of fake quantize op are as follow.

(moving average) Move the computation of moving average scale to device (80us -> 3.5us)
(find max abs) Use register to save the local maximum in a thread

paddle-bot-old · 2022-03-13T14:10:39Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

qingqing01

LGTM

ceci3

LGTM

* Move the computation of moving average scale to device * Use register to save local maximum in a thread

leo0519 added 2 commits March 13, 2022 07:00

Move the computation of moving average scale to device

2b59fef

Use register to save local maximum in a thread

cacfbec

leo0519 added the NVIDIA label Mar 13, 2022

leo0519 changed the title ~~[Paddle-TRT] Improve the performance of fake quantize OP~~ Improve the performance of fake quantize OP Mar 13, 2022

qingqing01 requested review from wanghaoshuang, yghstill, qingqing01 and ceci3 March 14, 2022 06:01

qingqing01 approved these changes Mar 14, 2022

View reviewed changes

ceci3 approved these changes Mar 14, 2022

View reviewed changes

Wangzheee merged commit 827b6a0 into PaddlePaddle:develop Mar 17, 2022

liqitong-a pushed a commit to liqitong-a/Paddle that referenced this pull request Mar 17, 2022

Improve the performance of fake quantize OP (PaddlePaddle#40491)

925ebb4

* Move the computation of moving average scale to device * Use register to save local maximum in a thread

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the performance of fake quantize OP #40491

Improve the performance of fake quantize OP #40491

leo0519 commented Mar 13, 2022 •

edited

Loading

paddle-bot-old bot commented Mar 13, 2022

qingqing01 left a comment

ceci3 left a comment

Improve the performance of fake quantize OP #40491

Improve the performance of fake quantize OP #40491

Conversation

leo0519 commented Mar 13, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Mar 13, 2022

qingqing01 left a comment

Choose a reason for hiding this comment

ceci3 left a comment

Choose a reason for hiding this comment

leo0519 commented Mar 13, 2022 •

edited

Loading