support save load optimizer master_weights #60027

pangengzheng · 2023-12-14T13:44:53Z

PR types

Others

PR changes

Others

Description

card-78318
support flatten state_dict, save load optimizer master_weights and deduplicate tensor when save state_dict

… develop

… flatten_and_dedup_for_save_load

paddle-bot · 2023-12-14T13:45:00Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… flatten_and_dedup_for_save_load

zhiqiu · 2023-12-26T03:28:40Z

python/paddle/distributed/checkpoint/utils.py

@@ -61,5 +61,47 @@ def compute_local_shape_and_global_offset(


 def flatten_state_dict(state_dict):


Does it support multiple level, i.e., {'model': {'m': {'w': xxx}}} ?

zhiqiu · 2023-12-26T03:44:02Z

python/paddle/optimizer/optimizer.py

-                )
-
-                tensor.set(load_para_np, framework._current_expected_place())
+                var.set_value(state_dict[var_tmp.name])


why change here？

代码复用，set_value api的行为包含这部分删掉的代码逻辑，且支持设置distributed tensor赋值

zhiqiu · 2023-12-26T03:49:47Z

python/paddle/distributed/checkpoint/save_state_dict.py

@@ -74,6 +73,16 @@ def dedup_storage_metadata(global_storage_metadata):
    return out


+def dedup_tensor(state_dict, local_storage_metadata, dedup_storage_metadata):


add some comments

ok，不过这个方法可以算是此文件的私有方法，不对外。comment已加

XieYunshen

LGTM

zhiqiu

LGTM

* exclude xpu * dedup tensor in state_dict * polish * support flatten and unflatten state_dict * test flatten * rename test * fix dedup tensor test * fix test * fix load state dict * rename * fix test * support save load optimizer master weights * add comment

pangengzheng added 25 commits June 25, 2023 15:36

exclude xpu

fc3b3c0

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

e291552

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

7a13c0b

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

d81f305

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

cd6e4fb

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

9d27f27

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

5037694

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

ef695ee

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

23aa6ff

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f7615b7

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

6605dff

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

767835d

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f756bc6

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

2ffd709

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

04e9851

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f319eb8

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

0a6997b

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

91174c2

… develop

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

90646b5

… develop

dedup tensor in state_dict

8dc5096

polish

966e0b4

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

2d8cf78

… flatten_and_dedup_for_save_load

support flatten and unflatten state_dict

bc91233

test flatten

15c0e77

rename test

a43938f

pangengzheng added 4 commits December 15, 2023 11:01

fix dedup tensor test

6129d00

merge develop

e4dd5f0

fix test

0f8148e

fix load state dict

949b420

pangengzheng added 4 commits December 18, 2023 17:04

rename

75ea59e

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

2747df3

… flatten_and_dedup_for_save_load

fix test

f0986f3

support save load optimizer master weights

fe1cc22

pangengzheng changed the title ~~Flatten and dedup for save load~~ support save load optimizer master_weights Dec 19, 2023

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

10e3360

… flatten_and_dedup_for_save_load

zhiqiu reviewed Dec 26, 2023

View reviewed changes

pangengzheng added 2 commits December 26, 2023 15:26

add comment

ea5b683

merge dev

2bc1d50

XieYunshen approved these changes Dec 28, 2023

View reviewed changes

zhiqiu approved these changes Dec 28, 2023

View reviewed changes

zhiqiu merged commit 76ce9bb into PaddlePaddle:develop Dec 28, 2023
29 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support save load optimizer master_weights #60027

support save load optimizer master_weights #60027

pangengzheng commented Dec 14, 2023 •

edited

Loading

paddle-bot bot commented Dec 14, 2023

zhiqiu Dec 26, 2023

pangengzheng Dec 26, 2023

zhiqiu Dec 26, 2023

pangengzheng Dec 26, 2023

zhiqiu Dec 26, 2023

pangengzheng Dec 26, 2023 •

edited

Loading

XieYunshen left a comment

zhiqiu left a comment

		@@ -61,5 +61,47 @@ def compute_local_shape_and_global_offset(


		def flatten_state_dict(state_dict):

		@@ -74,6 +73,16 @@ def dedup_storage_metadata(global_storage_metadata):
		return out


		def dedup_tensor(state_dict, local_storage_metadata, dedup_storage_metadata):

support save load optimizer master_weights #60027

support save load optimizer master_weights #60027

Conversation

pangengzheng commented Dec 14, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Dec 14, 2023

zhiqiu Dec 26, 2023

Choose a reason for hiding this comment

pangengzheng Dec 26, 2023

Choose a reason for hiding this comment

zhiqiu Dec 26, 2023

Choose a reason for hiding this comment

pangengzheng Dec 26, 2023

Choose a reason for hiding this comment

zhiqiu Dec 26, 2023

Choose a reason for hiding this comment

pangengzheng Dec 26, 2023 • edited Loading

Choose a reason for hiding this comment

XieYunshen left a comment

Choose a reason for hiding this comment

zhiqiu left a comment

Choose a reason for hiding this comment

pangengzheng commented Dec 14, 2023 •

edited

Loading

pangengzheng Dec 26, 2023 •

edited

Loading