weight_quantize api support gpu #59090

wwbitejotunn · 2023-11-17T05:40:03Z

PR types

Function optimization

PR changes

OPs

Description

Add quanted weight gpu kernel for weight only int8 to speed up the model convert. sm70 premute kernel is contributed by @MARD1NO
Pcard-71502

paddle-bot · 2023-11-17T05:40:08Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

MARD1NO · 2023-11-21T02:24:07Z

paddle/phi/kernels/impl/weight_quantize_kernel_impl.h

@@ -30,14 +30,14 @@

 #pragma once

-#include "paddle/phi/backends/cpu/cpu_context.h"
+// #include "paddle/phi/backends/cpu/cpu_context.h"


注释删下

yuanlehome

LGTM

* weight quant gpu wint8 * fix scale type of weight ony quant * fix cpu quant weight * update * float scale for weight-only8 gpu quant * fix infermeta * fix include * add sm70 * fixup and andd sm70 * code_style fix

MARD1NO approved these changes Nov 21, 2023

View reviewed changes

yuanlehome previously approved these changes Nov 21, 2023

View reviewed changes

wwbitejotunn added 9 commits November 22, 2023 13:48

weight quant gpu wint8

ef5b846

fix scale type of weight ony quant

2b95077

fix cpu quant weight

941c9e8

update

56dae78

float scale for weight-only8 gpu quant

4b9184b

fix infermeta

3ae2825

fix include

af4c02e

add sm70

9a913e9

fixup and andd sm70

1ac2c63

wwbitejotunn dismissed yuanlehome’s stale review via 1ac2c63 November 23, 2023 02:14

wwbitejotunn force-pushed the develop_quanted_weight_gpu branch from 2166380 to 1ac2c63 Compare November 23, 2023 02:14

MARD1NO approved these changes Nov 23, 2023

View reviewed changes

code_style fix

eee6974

wwbitejotunn closed this Nov 23, 2023

wwbitejotunn reopened this Nov 23, 2023

yuanlehome approved these changes Nov 24, 2023

View reviewed changes

yuanlehome merged commit 3aff2c0 into PaddlePaddle:develop Nov 24, 2023
28 checks passed

yuanlehome changed the title ~~Add quanted weight gpu kernel~~ weight_quantize api support gpu Nov 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weight_quantize api support gpu #59090

weight_quantize api support gpu #59090

wwbitejotunn commented Nov 17, 2023 •

edited

Loading

paddle-bot bot commented Nov 17, 2023

MARD1NO Nov 21, 2023

wwbitejotunn Nov 23, 2023

yuanlehome left a comment

weight_quantize api support gpu #59090

weight_quantize api support gpu #59090

Conversation

wwbitejotunn commented Nov 17, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Nov 17, 2023

MARD1NO Nov 21, 2023

Choose a reason for hiding this comment

wwbitejotunn Nov 23, 2023

Choose a reason for hiding this comment

yuanlehome left a comment

Choose a reason for hiding this comment

wwbitejotunn commented Nov 17, 2023 •

edited

Loading