Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[xpu] multi_encoder_xpu supoort int8 #61212

Merged
merged 1 commit into from
Jan 30, 2024

Conversation

NeroLoh
Copy link
Contributor

@NeroLoh NeroLoh commented Jan 26, 2024

PR types

New features

PR changes

OPs

Description

  1. xpu支持旧paddle slim的量化格式,fake_xxx_ops等
  2. multi_encoder_xpu大算子支持int8量化模型。支持mul量化,mul+matmul量化,per_channel量化。

zhupengyang
zhupengyang previously approved these changes Jan 29, 2024
Copy link
Contributor

@zhupengyang zhupengyang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@vivienfanghuagood vivienfanghuagood left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM for update OP inputs

Copy link
Contributor

@XiaoguangHu01 XiaoguangHu01 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@zhupengyang zhupengyang merged commit d6cf7e3 into PaddlePaddle:develop Jan 30, 2024
30 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
contributor External developers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants