update doc for pr #3488(quantization speedup tool) #3512

linbinskn · 2021-04-05T04:16:19Z

No description provided.

J-shang · 2021-04-06T08:40:24Z

docs/en_US/Compression/QuantizationSpeedup.rst

+which increases the difficulty of deploying deep neural network model. Quantization is a 
+fundamental technology which is widely used to reduce memory footprint and speed up inference 
+process. Many frameworks begin to support quantization, but few of them support mixed precision 
+quantiation. Frameworks like `HAQ: Hardware-Aware Automated Quantization with Mixed Precision <https://arxiv.org/pdf/1811.08886.pdf>`__\, only support simulated mixed precision quantization which will 


quantiation -> quantization?

Thanks, have fixed.

QuanluZhang · 2021-04-07T14:28:28Z

docs/en_US/Compression/QuantizationSpeedup.rst

+For complete examples please refer to :githublink:`the code <examples/model_compress/quantization/mixed_precision_speedup_mnist.py>`.
+
+
+For more parameters about the class 'TensorRTModelSpeedUp', you can refer to :githublink:`the code <nni/compression/pytorch/speedup/quantization_speedup/integrated_tensorrt.py>`.


better to refer to API doc instead of source code

Yeah, have done.

linbinskn added 3 commits April 5, 2021 12:15

update doc for pr microsoft#3488(quantization speedup tool)

91461cf

update

2353a36

delete qat file

8aaec03

linbinskn mentioned this pull request Apr 5, 2021

Combine tensorrt tool with NNI quantization algorithms. #3488

Merged

SparkSnail requested review from QuanluZhang and J-shang April 6, 2021 03:10

J-shang approved these changes Apr 6, 2021

View reviewed changes

linbinskn added 2 commits April 7, 2021 10:47

delete description of post training quantization

f2a8130

fix errors

7abc74c

QuanluZhang reviewed Apr 7, 2021

View reviewed changes

linbinskn added 3 commits April 9, 2021 10:26

add api reference of quantization speedup

400c6e7

polish content

abed694

redirect doc page

aa458c7

linbinskn requested a review from QuanluZhang April 9, 2021 02:35

QuanluZhang approved these changes Apr 9, 2021

View reviewed changes

SparkSnail merged commit 26207d1 into microsoft:master Apr 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update doc for pr #3488(quantization speedup tool) #3512

update doc for pr #3488(quantization speedup tool) #3512

linbinskn commented Apr 5, 2021

J-shang Apr 6, 2021

linbinskn Apr 7, 2021

QuanluZhang Apr 7, 2021

linbinskn Apr 9, 2021

		For complete examples please refer to :githublink:`the code <examples/model_compress/quantization/mixed_precision_speedup_mnist.py>`.


		For more parameters about the class 'TensorRTModelSpeedUp', you can refer to :githublink:`the code <nni/compression/pytorch/speedup/quantization_speedup/integrated_tensorrt.py>`.

update doc for pr #3488(quantization speedup tool) #3512

update doc for pr #3488(quantization speedup tool) #3512

Conversation

linbinskn commented Apr 5, 2021

J-shang Apr 6, 2021

Choose a reason for hiding this comment

linbinskn Apr 7, 2021

Choose a reason for hiding this comment

QuanluZhang Apr 7, 2021

Choose a reason for hiding this comment

linbinskn Apr 9, 2021

Choose a reason for hiding this comment