fix llama3 static run #8849

yuanlehome · 2024-07-31T09:00:48Z

PR types

Bug fixes

PR changes

Others

Description

修复llama3散op静态图推理的一系列问题，精度正常

paddle-bot · 2024-07-31T09:00:53Z

Thanks for your contribution!

codecov · 2024-07-31T09:32:32Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.87%. Comparing base (34a71c8) to head (10d3e95).
Report is 227 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8849      +/-   ##
===========================================
+ Coverage    53.81%   53.87%   +0.06%     
===========================================
  Files          652      652              
  Lines       104356   104356              
===========================================
+ Hits         56155    56220      +65     
+ Misses       48201    48136      -65

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DesmonDay

LGTM

wawltor · 2024-08-28T03:42:10Z

paddlenlp/generation/utils.py

@@ -1407,6 +1407,7 @@ def _post_process_(
            # compute next_tokens
            if use_top_p:
                logits = logits / temperature
+                probs = paddle.cast(probs, paddle.float32)


是不是在支持下top_p_sampling在bf16算子的支持？

bf16算子kernel实现上是支持的，这里不cast成fp32是因为会报错

具体报错原因是什么？

我又确认了下，确实是kernel没有注册bf16，这个我在paddle侧支持了，所以这里添加cast的逻辑已移除，验证也没有问题

DesmonDay

LGTM

yuanlehome force-pushed the fix_llama3_static_run branch from e63c9b6 to 02680fa Compare August 27, 2024 06:27

DesmonDay previously approved these changes Aug 27, 2024

View reviewed changes

wawltor reviewed Aug 28, 2024

View reviewed changes

fix llama3 static run

10d3e95

yuanlehome dismissed DesmonDay’s stale review via 10d3e95 August 28, 2024 06:25

yuanlehome force-pushed the fix_llama3_static_run branch from 02680fa to 10d3e95 Compare August 28, 2024 06:25

DesmonDay approved these changes Aug 28, 2024

View reviewed changes

DesmonDay merged commit 2f567e6 into PaddlePaddle:develop Aug 28, 2024
10 of 12 checks passed

Mangodadada pushed a commit to Mangodadada/PaddleNLP that referenced this pull request Sep 10, 2024

fix llama3 static run (PaddlePaddle#8849)

ac7c17f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix llama3 static run #8849

fix llama3 static run #8849

yuanlehome commented Jul 31, 2024 •

edited

Loading

paddle-bot bot commented Jul 31, 2024

codecov bot commented Jul 31, 2024 •

edited

Loading

DesmonDay left a comment

wawltor Aug 28, 2024

yuanlehome Aug 28, 2024

wawltor Aug 28, 2024

yuanlehome Aug 28, 2024 •

edited

Loading

DesmonDay left a comment

fix llama3 static run #8849

fix llama3 static run #8849

Conversation

yuanlehome commented Jul 31, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Jul 31, 2024

codecov bot commented Jul 31, 2024 • edited Loading

Codecov Report

DesmonDay left a comment

Choose a reason for hiding this comment

wawltor Aug 28, 2024

Choose a reason for hiding this comment

yuanlehome Aug 28, 2024

Choose a reason for hiding this comment

wawltor Aug 28, 2024

Choose a reason for hiding this comment

yuanlehome Aug 28, 2024 • edited Loading

Choose a reason for hiding this comment

DesmonDay left a comment

Choose a reason for hiding this comment

yuanlehome commented Jul 31, 2024 •

edited

Loading

codecov bot commented Jul 31, 2024 •

edited

Loading

yuanlehome Aug 28, 2024 •

edited

Loading