[FIX DDP] fix ddp #8549

ZHUI · 2024-06-05T07:52:30Z

PR types

Others

PR changes

Others

Description

Others

paddle-bot · 2024-06-05T07:52:35Z

Thanks for your contribution!

codecov · 2024-06-05T08:27:32Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 54.96%. Comparing base (79e8b6e) to head (5d34b49).
Report is 243 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8549      +/-   ##
===========================================
+ Coverage    53.86%   54.96%   +1.10%     
===========================================
  Files          620      620              
  Lines        97090    97109      +19     
===========================================
+ Hits         52298    53377    +1079     
+ Misses       44792    43732    -1060

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DesmonDay · 2024-06-07T06:37:13Z

paddlenlp/trainer/trainer.py

@@ -1795,17 +1795,8 @@ def _wrap_model(self, model, training=True):
        in_cp_parallel_mode = self.args.context_parallel_degree > 1

        # Multi-gpu training


这个地方需要合入到2.8吗？

DesmonDay

LGTM

* enable trainer tests.

* [Safetensors] Fix fast safe open slice. (#8512) * [FIX DDP] fix ddp (#8549)

* [Safetensors] Fix fast safe open slice. (PaddlePaddle#8512) * [FIX DDP] fix ddp (PaddlePaddle#8549)

* quick fix from pretrained. (#8487) * quick fix os.path.split (#8508) * Cp/fix (#8569) * [Safetensors] Fix fast safe open slice. (#8512) * [FIX DDP] fix ddp (#8549) * [BUG] Fix build train valid test datasets (#8823) * Update causal_dataset.py * Add twenty redundant data in post pretrain (#8777) * 给dataset再添加20条数据,防止blend dataset出现错误 * num_samples向下去整,防止数据集的溢出 (#8691) * update release_grads (#8834) * update release_grads (#8834) * [Trainer] Fix release_grads (#9085) * fix pp release_grads * add dataloader_drop_last to evaldataloader (#8773) * bugfix * Fix eval hang (#9052) * fix pipeline eval * fix eval dataloader_num_workers --------- Co-authored-by: Zhong Hui <zhonghui.net@gmail.com> Co-authored-by: yujun <50394665+JunnYu@users.noreply.github.com> Co-authored-by: gongel <ainlp88@qq.com>

fix ddp

e94599f

fix

9f70bba

ZHUI added 3 commits June 6, 2024 14:46

Merge branch 'develop' into fix/ddp

1418330

enable trainer tests.

cb131d6

Merge remote-tracking branch 'zhui/fix/ddp' into fix/ddp

5d34b49

DesmonDay reviewed Jun 7, 2024

View reviewed changes

DesmonDay approved these changes Jun 7, 2024

View reviewed changes

ZHUI merged commit f89c91d into PaddlePaddle:develop Jun 7, 2024
8 of 12 checks passed

ZHUI deleted the fix/ddp branch June 7, 2024 06:53

ZHUI added a commit to ZHUI/PaddleNLP that referenced this pull request Jun 7, 2024

[FIX DDP] fix ddp (PaddlePaddle#8549)

257a5bc

* enable trainer tests.

DesmonDay pushed a commit that referenced this pull request Jun 7, 2024

Cp/fix (#8569)

c628f12

* [Safetensors] Fix fast safe open slice. (#8512) * [FIX DDP] fix ddp (#8549)

DesmonDay pushed a commit to DesmonDay/PaddleNLP that referenced this pull request Sep 5, 2024

Cp/fix (PaddlePaddle#8569)

c18d7b7

* [Safetensors] Fix fast safe open slice. (PaddlePaddle#8512) * [FIX DDP] fix ddp (PaddlePaddle#8549)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX DDP] fix ddp #8549

[FIX DDP] fix ddp #8549

ZHUI commented Jun 5, 2024

paddle-bot bot commented Jun 5, 2024

codecov bot commented Jun 5, 2024 •

edited

Loading

DesmonDay Jun 7, 2024

DesmonDay left a comment

		@@ -1795,17 +1795,8 @@ def _wrap_model(self, model, training=True):
		in_cp_parallel_mode = self.args.context_parallel_degree > 1

		# Multi-gpu training

[FIX DDP] fix ddp #8549

[FIX DDP] fix ddp #8549

Conversation

ZHUI commented Jun 5, 2024

PR types

PR changes

Description

paddle-bot bot commented Jun 5, 2024

codecov bot commented Jun 5, 2024 • edited Loading

Codecov Report

DesmonDay Jun 7, 2024

Choose a reason for hiding this comment

DesmonDay left a comment

Choose a reason for hiding this comment

codecov bot commented Jun 5, 2024 •

edited

Loading