[Inference] update fakequant support #9047

lixcli · 2024-08-29T08:01:27Z

PR types

Bug fixes

PR changes

Others

Description

remove useless code and update copyright
fix prepare_qconfig bug

2. add llama3.1 and qwen2 ptq config 3. update quantization.md

…nto add_new_fakequant_type

paddle-bot · 2024-08-29T08:01:32Z

Thanks for your contribution!

codecov · 2024-08-29T09:19:57Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.65%. Comparing base (ae691e2) to head (df416ac).
Report is 220 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9047      +/-   ##
===========================================
- Coverage    53.76%   53.65%   -0.12%     
===========================================
  Files          652      652              
  Lines       104507   104867     +360     
===========================================
+ Hits         56190    56264      +74     
- Misses       48317    48603     +286

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DrownFish19 · 2024-08-29T14:12:31Z

llm/docs/quantization.md

@@ -1,4 +1,4 @@
-# 大模型量化教程
+p# 大模型量化教程


这里多了一个‘p’

DrownFish19

LGTM

* 1. add a8w8(fp8) a8w8c8(int8) quant_type support 2. add llama3.1 and qwen2 ptq config 3. update quantization.md * fix load_quant_model bug * fix load quant bug * update ll/README.md * remove useless code * update quant observer config * resolve wrong modify * fix prepare_qconfig * remove unuse files

lixcli added 9 commits August 28, 2024 07:36

1. add a8w8(fp8) a8w8c8(int8) quant_type support

005f2ad

2. add llama3.1 and qwen2 ptq config 3. update quantization.md

fix load_quant_model bug

e56d9c4

fix load quant bug

e2b9a49

update ll/README.md

d21ace7

remove useless code

e89372c

update quant observer config

e7160d3

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

630b3d6

…nto add_new_fakequant_type

resolve wrong modify

7032bf2

fix prepare_qconfig

323c465

remove unuse files

df416ac

DrownFish19 reviewed Aug 29, 2024

View reviewed changes

llm/docs/quantization.md

@@ -1,4 +1,4 @@

# 大模型量化教程

p# 大模型量化教程

Copy link

Collaborator

DrownFish19 Aug 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里多了一个‘p’

DrownFish19 approved these changes Aug 29, 2024

View reviewed changes

DrownFish19 merged commit e0ba7ef into PaddlePaddle:develop Aug 29, 2024
10 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference] update fakequant support #9047

[Inference] update fakequant support #9047

lixcli commented Aug 29, 2024

paddle-bot bot commented Aug 29, 2024

codecov bot commented Aug 29, 2024 •

edited

Loading

DrownFish19 Aug 29, 2024

DrownFish19 left a comment

		@@ -1,4 +1,4 @@
		# 大模型量化教程
		p# 大模型量化教程

[Inference] update fakequant support #9047

[Inference] update fakequant support #9047

Conversation

lixcli commented Aug 29, 2024

PR types

PR changes

Description

paddle-bot bot commented Aug 29, 2024

codecov bot commented Aug 29, 2024 • edited Loading

Codecov Report

DrownFish19 Aug 29, 2024

Choose a reason for hiding this comment

DrownFish19 left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 29, 2024 •

edited

Loading