[Feature] fused mixtral wint4 #9013

penPenf28 · 2024-08-26T11:14:00Z

PR types

New features

PR changes

Models

Description

支持mixtral模型wint4

paddle-bot · 2024-08-26T11:14:04Z

Thanks for your contribution!

codecov · 2024-08-26T11:48:32Z

Codecov Report

Attention: Patch coverage is 0% with 3 lines in your changes missing coverage. Please review.

Project coverage is 54.12%. Comparing base (154928a) to head (2f3c818).
Report is 229 commits behind head on develop.

Files with missing lines	Patch %	Lines
...erimental/transformers/fused_transformer_layers.py	0.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9013      +/-   ##
===========================================
+ Coverage    54.06%   54.12%   +0.05%     
===========================================
  Files          650      650              
  Lines       103883   103874       -9     
===========================================
+ Hits         56164    56221      +57     
+ Misses       47719    47653      -66

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

yuanlehome · 2024-08-27T03:27:10Z

paddlenlp/experimental/transformers/mixtral/modeling.py

-                        ffn1_quanted_weight_list_i.reshape([self.transformer_block.config.embed_dim, -1])
+                        ffn1_quanted_weight_list_i.reshape(
+                            [self.transformer_block.embed_dim, self.transformer_block.dim_feedforward * 2]
+                            if self.transformer_block.config.quant_type == "weight_only_int8"


Suggested change

if self.transformer_block.config.quant_type == "weight_only_int8"

if self.quant_type == "weight_only_int8"

这样就可以吧

yuanlehome · 2024-08-27T03:27:25Z

paddlenlp/experimental/transformers/mixtral/modeling.py

-                        ffn2_quanted_weight_list_i.reshape([-1, self.transformer_block.config.embed_dim])
+                        ffn2_quanted_weight_list_i.reshape(
+                            [self.transformer_block.dim_feedforward, self.transformer_block.embed_dim]
+                            if self.transformer_block.config.quant_type == "weight_only_int8"


DesmonDay

LGTM

* [Feature] fused mixtral wint4 * [Refactor] refine code

[Feature] fused mixtral wint4

4cce85b

yuanlehome reviewed Aug 27, 2024

View reviewed changes

[Refactor] refine code

2f3c818

yuanlehome approved these changes Aug 27, 2024

View reviewed changes

DesmonDay approved these changes Aug 27, 2024

View reviewed changes

wawltor merged commit f6fc7ff into PaddlePaddle:develop Aug 27, 2024
10 of 12 checks passed

lixcli pushed a commit to lixcli/PaddleNLP that referenced this pull request Aug 28, 2024

[Feature] fused mixtral wint4 (PaddlePaddle#9013)

b13fd58

* [Feature] fused mixtral wint4 * [Refactor] refine code

lixcli pushed a commit to lixcli/PaddleNLP that referenced this pull request Aug 28, 2024

[Feature] fused mixtral wint4 (PaddlePaddle#9013)

63cb420

* [Feature] fused mixtral wint4 * [Refactor] refine code

lixcli pushed a commit to lixcli/PaddleNLP that referenced this pull request Aug 28, 2024

[Feature] fused mixtral wint4 (PaddlePaddle#9013)

c7b64b7

* [Feature] fused mixtral wint4 * [Refactor] refine code

Mangodadada pushed a commit to Mangodadada/PaddleNLP that referenced this pull request Sep 10, 2024

[Feature] fused mixtral wint4 (PaddlePaddle#9013)

1c2c269

* [Feature] fused mixtral wint4 * [Refactor] refine code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] fused mixtral wint4 #9013

[Feature] fused mixtral wint4 #9013

penPenf28 commented Aug 26, 2024

paddle-bot bot commented Aug 26, 2024

codecov bot commented Aug 26, 2024 •

edited

Loading

yuanlehome Aug 27, 2024

penPenf28 Aug 27, 2024

yuanlehome Aug 27, 2024

penPenf28 Aug 27, 2024

DesmonDay left a comment

	if self.transformer_block.config.quant_type == "weight_only_int8"
	if self.quant_type == "weight_only_int8"

[Feature] fused mixtral wint4 #9013

[Feature] fused mixtral wint4 #9013

Conversation

penPenf28 commented Aug 26, 2024

PR types

PR changes

Description

paddle-bot bot commented Aug 26, 2024

codecov bot commented Aug 26, 2024 • edited Loading

Codecov Report

yuanlehome Aug 27, 2024

Choose a reason for hiding this comment

penPenf28 Aug 27, 2024

Choose a reason for hiding this comment

yuanlehome Aug 27, 2024

Choose a reason for hiding this comment

penPenf28 Aug 27, 2024

Choose a reason for hiding this comment

DesmonDay left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 26, 2024 •

edited

Loading