[New Features] add llm pretrain & lora & sft & prefix_tuning testing scripts #7056

wj-Mcat · 2023-09-18T05:28:18Z

PR types

New features

PR changes

Others

Description

添加 LLM 目录下的finetuning 单测脚本

codecov · 2023-09-18T06:04:52Z

Codecov Report

Merging #7056 (5e2c23a) into develop (321faf3) will decrease coverage by 0.07%.
Report is 6 commits behind head on develop.
The diff coverage is n/a.

@@             Coverage Diff             @@
##           develop    #7056      +/-   ##
===========================================
- Coverage    59.91%   59.84%   -0.07%     
===========================================
  Files          556      557       +1     
  Lines        82035    82150     +115     
===========================================
+ Hits         49148    49161      +13     
- Misses       32887    32989     +102

see 6 files with indirect coverage changes

…llm-chatglm-testing

wj-Mcat · 2023-09-20T02:52:15Z

llm/chatglm/sft_argument.json

+  "dataset_name_or_path": "./data",
+  "output_dir": "./checkpoints/chatglm_sft_ckpts",
+  "per_device_train_batch_size": 4,
+  "gradient_accumulation_steps": 4,
+  "per_device_eval_batch_size": 8,
+  "eval_accumulation_steps":16,
+  "num_train_epochs": 3,
+  "learning_rate": 3e-05,
+  "warmup_steps": 30,
+  "logging_steps": 1,
+  "evaluation_strategy": "epoch",
+  "save_strategy": "epoch",
+  "src_length": 1024,
+  "max_length": 2048,
+  "fp16": true,
+  "fp16_opt_level": "O2",
+  "do_train": true,
+  "do_eval": true,
+  "disable_tqdm": true,
+  "load_best_model_at_end": true,
+  "eval_with_do_generation": false,
+  "metric_for_best_model": "accuracy",
+  "recompute": true,
+  "save_total_limit": 1,
+  "tensor_parallel_degree": 4,
+  "pipeline_parallel_degree": 1
+}


这个是 pre-commit 给 format 的

sijunhe · 2023-09-20T02:58:34Z

tests/fixtures/llm/finetune.yaml

+    llama:
+      model_name_or_path: __internal_testing__/tiny-random-llama


加个baichuan的吧，带alibi的那种

这个交给 @wtmlon 来添加吧。

sijunhe · 2023-09-20T02:58:53Z

tests/fixtures/llm/finetune.yaml

+    llama:
+      model_name_or_path: __internal_testing__/tiny-random-llama
+    chatglm:
+      model_name_or_path: __internal_testing__/tiny-fused-chatglm


tiny-fused-chatglm这里的fused指的是什么

因为 fused 模型中的 head_dim 必须是在：10,26,32,64，128 .... 中的一个，所以为了不影响之前的单测，就直接新创建了一个tiny-random 的模型专门用来做非 fused & fuse 相关的单测

tests/llm/test_lora.py

wawltor · 2023-09-20T02:53:44Z

tests/fixtures/llm/finetune.yaml

+    pipeline_parallel_degree: 1
+  default:
+    llama:
+      model_name_or_path: __internal_testing__/tiny-random-llama


这里的random和fused区别点是什么？

因为 fused 模型中的 head_dim 必须是在：10,26,32,64，128 .... 中的一个，所以为了不影响之前的单测，就直接新创建了一个tiny-random 的模型专门用来做非 fused & fuse 相关的单测

wawltor · 2023-09-20T02:55:21Z

tests/fixtures/llm/pretrain.yaml

+      model_type: llama
+      model_name_or_path: __internal_testing__/tiny-random-llama
+    chatglm:
+      model_type: chatglm


chatglm有预训练流程吗？

这里是没有的，然后 fine-tune 这里也没有配置 chatglm，我可以把它先删掉。

wawltor · 2023-09-20T02:58:29Z

tests/llm/test_finetune.py

+        LLMTest.tearDown(self)
+        shutil.rmtree(self.data_dir)
+
+    def test_pretrain(self):


这个函数看起是来做finetune

copy 过来的还没有改，我来调整一下。

wawltor · 2023-09-20T03:00:07Z

tests/llm/test_finetune.py

+        self.data_dir = tempfile.mkdtemp()
+        sys.path.insert(0, self.model_dir)
+
+        # Run pretrain


注释部分看起来有点问题

我来改一下。

wawltor · 2023-09-20T03:08:38Z

tests/llm/test_finetune.py

+        use_few_examples("dev.json")
+        use_few_examples("validation.json")
+
+    def tearDown(self) -> None:


除了删除data文件外，模型文件需要删除吗？

因为是 from_pretrained 的，所以会缓存在 .paddlenlp/models 目录下，应该也没必要删除吧。

wawltor · 2023-09-20T03:09:41Z

tests/llm/test_lora.py

+        self.data_dir = os.path.join(self.data_dir, "data")
+        self.use_small_datasets()
+
+    def use_small_datasets(self):


这个函数看看是否可以复用

wawltor · 2023-09-20T03:11:49Z

tests/llm/test_prefix_tuning.py

+        self.data_dir = os.path.join(self.data_dir, "data")
+        self.use_small_datasets()
+
+    def use_small_datasets(self):


wawltor · 2023-09-20T03:13:04Z

tests/llm/test_lora.py

+
+            merge()
+
+        if self.model_dir not in ["chatglm2"]:


这里是说chatglm2没有办法动转静然后预测是吗？这里记个TODO吧

chatglm2 还没有支持细粒度 op，所以这里需要给移除掉

add chatglm finetune

b04d991

add llm ci scripts

b2fa9be

wj-Mcat changed the title ~~[New Features] add llm finetune testing~~ [New Features] add llm pretrain & lora & sft & prefix_tuning testing scripts Sep 18, 2023

wj-Mcat added 4 commits September 19, 2023 06:07

merge develop branch

04db738

add llama llm testing

93ac1ec

update llm testing

68ff464

Merge branch 'develop' of github.com:PaddlePaddle/PaddleNLP into add-…

c58ba2e

…llm-chatglm-testing

wj-Mcat marked this pull request as ready for review September 19, 2023 12:09

wj-Mcat added 2 commits September 19, 2023 12:10

revert chatglm sft-arguments

ff6ac92

revert chatglm sft-arguments

7311f09

wj-Mcat commented Sep 20, 2023

View reviewed changes

sijunhe reviewed Sep 20, 2023

View reviewed changes

wawltor reviewed Sep 20, 2023

View reviewed changes

update llm testing

5e2c23a

wj-Mcat mentioned this pull request Sep 20, 2023

[LLM] Support gpt3 fine grained dybatch v1 #7080

Merged

sijunhe approved these changes Sep 20, 2023

View reviewed changes

sijunhe merged commit a150627 into PaddlePaddle:develop Sep 20, 2023
7 of 9 checks passed

wj-Mcat deleted the add-llm-chatglm-testing branch September 20, 2023 12:56

ZHUI mentioned this pull request Jan 2, 2024

PaddleNLP 2.7.0 Release Note Candidate #7753

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Features] add llm pretrain & lora & sft & prefix_tuning testing scripts #7056

[New Features] add llm pretrain & lora & sft & prefix_tuning testing scripts #7056

wj-Mcat commented Sep 18, 2023 •

edited

Loading

codecov bot commented Sep 18, 2023 •

edited

Loading

wj-Mcat Sep 20, 2023

sijunhe Sep 20, 2023

wj-Mcat Sep 20, 2023

sijunhe Sep 20, 2023

wj-Mcat Sep 20, 2023

wawltor Sep 20, 2023

wj-Mcat Sep 20, 2023

wawltor Sep 20, 2023

wj-Mcat Sep 20, 2023

wawltor Sep 20, 2023

wj-Mcat Sep 20, 2023

wawltor Sep 20, 2023

wj-Mcat Sep 20, 2023

wawltor Sep 20, 2023

wj-Mcat Sep 20, 2023

wawltor Sep 20, 2023

wawltor Sep 20, 2023

wawltor Sep 20, 2023

wj-Mcat Sep 20, 2023

		llama:
		model_name_or_path: __internal_testing__/tiny-random-llama

[New Features] add llm pretrain & lora & sft & prefix_tuning testing scripts #7056

[New Features] add llm pretrain & lora & sft & prefix_tuning testing scripts #7056

Conversation

wj-Mcat commented Sep 18, 2023 • edited Loading

PR types

PR changes

Description

codecov bot commented Sep 18, 2023 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wj-Mcat commented Sep 18, 2023 •

edited

Loading

codecov bot commented Sep 18, 2023 •

edited

Loading