[NewFeature] add paddlenlp command #3538

wj-Mcat · 2022-10-22T05:56:03Z

PR types

New features

PR changes

APIs

Description

add paddlenlp command tools to do more fancy things, eg: search models, download models and so on ...

ZeyuChen · 2022-10-22T08:10:50Z

Cool features! 🎉

ZeyuChen

commands目录改为cli是否会更简洁？

wj-Mcat · 2022-10-25T01:33:53Z

commands目录改为cli是否会更简洁？

cool，赞成，那我调整成cli。

wj-Mcat · 2022-11-01T13:19:29Z

paddlenlp search的示例界面如下所示：

wj-Mcat

@guoshengCS ready for review

wj-Mcat · 2022-11-01T13:23:10Z

paddlenlp/cli/converter.py

+def convert_from_online_model(model_name: str, cache_dir: str, output_dir):
+    """convert the model which is not maintained in paddlenlp community, eg: vblagoje/bert-english-uncased-finetuned-pos
+
+    TODO(wj-Mcat): to deeply test this method
+
+    Args:
+        model_name (str): the name of model
+        cache_dir (str): the cache_dir to save pytorch model
+        output_dir (_type_): the output dir
+    """


由于时间原因，现在暂不测试这个函数，等模型中心弄完之后再来深入测试online_convert的相关功能。

wj-Mcat · 2022-11-01T13:24:07Z

paddlenlp/cli/download.py

+        logger.info("download community model configuration from server ...")
+        remote_community_model_path = os.path.join(
+            COMMUNITY_MODEL_PREFIX, COMMUNITY_MODEL_CONFIG_FILE_NAME)
+        cache_dir = os.path.join(MODEL_HOME)


将community_models.json缓存文件放到~/.paddlenlp/models目录下。

guoshengCS · 2022-11-07T03:53:17Z

paddlenlp/utils/converter.py

+        if model_config_file is not None:
+            with open(model_config_file, 'r', encoding='utf-8') as f:
+                config = json.load(f)
+            config = self.convert_config(config)


看这个convert_config默认直接用hf的，当前config是能直接使用hf的不需要任何调整是吗，version这些需要调整吗

对于未来模型理论上在参数上应该和hf保持一致，而且都是使用PretrainedConfig，所以不做任何调整都是可以使用的。

对于旧模型，是需要重写convert_config函数，调整成为BaseModel构造函数中的参数列表。

不过，version理论上是需要删除的，我也会去排查一下其他需要删除的字段。

我这边添加了一个remove_transformer_unused_fields 方法来过滤不需要字段。

此外如果参数名称和hf 模型保持一致的话，只需要调整config_fields_to_be_removed字段即可实现configuration的自动转化。

guoshengCS · 2022-11-07T04:04:43Z

paddlenlp/utils/converter.py

+
+            self.compare_model_state_dicts(paddle_model, pytorch_model,
+                                           name_mappings)
+            del paddle_model, pytorch_model


是否就paddle跑完结果就及时del然后跑pytorch的，避免更多的显存占用

嗯嗯，确实，这样能够合理利用现存空间。

wawltor

LGTM

guoshengCS

LGTM

guoshengCS · 2022-11-14T13:09:54Z

paddlenlp/cli/converter.py

+
+
+def convert_from_local_file(weight_file_path: str, output: str):
+    """convert from the local weitht file


weitht -> weight

add cli solution

dbbd3ea

ZeyuChen reviewed Oct 22, 2022

View reviewed changes

ZeyuChen previously approved these changes Oct 22, 2022

View reviewed changes

ZeyuChen assigned guoshengCS Oct 22, 2022

add model weight converter command

d94606d

wj-Mcat dismissed ZeyuChen’s stale review via d94606d October 22, 2022 13:01

Merge branch 'develop' into add-cli

417ff77

wj-Mcat added 14 commits October 27, 2022 06:06

add converter

46c6e6f

Merge branch 'add-cli' of github.com:wj-Mcat/PaddleNLP into add-cli

b96ed14

update cli setup

841b1ad

Merge branch 'develop' into add-cli

073295a

Merge branch 'develop' into add-cli

14c5a35

update cli

371bf0a

Merge branch 'develop' of github.com:wj-Mcat/PaddleNLP into add-cli

ed8cf48

update search & download

689751f

Merge branch 'add-cli' of github.com:wj-Mcat/PaddleNLP into add-cli

3eb24b3

update table header name

1fe1695

rename logger msg

8f51693

upgrade command arguments

76e7e28

merge develop branch

e0e3510

complete converter

a43c6b4

wj-Mcat added 2 commits November 1, 2022 13:22

add comments to online convert

0e7c5f2

add comment to cli/main

52d7ff9

wj-Mcat commented Nov 1, 2022

View reviewed changes

wj-Mcat added the PR:ready for revivew label Nov 1, 2022

wj-Mcat added 2 commits November 1, 2022 14:26

add clip converter

d6da5b6

update clip converter

59d9330

guoshengCS reviewed Nov 7, 2022

View reviewed changes

wj-Mcat added 15 commits November 7, 2022 05:39

imporve converter gpu memory usage case

7357f79

remove unused fields from config

4f5303d

add remove unused fileds

cb97e96

add config fields to converter

74a1710

add cli

3a29bef

Merge branch 'develop' into add-cli

e0b4df4

add convert for stable-diffusion

0d3477b

Merge branch 'add-cli' of github.com:wj-Mcat/PaddleNLP into add-cli

a8770cb

remove ppdiffusers converter

03dafef

Merge branch 'develop' into add-cli

9040b1a

Merge branch 'develop' into add-cli

598258f

update description of cli

e792ca3

Merge branch 'add-cli' of github.com:wj-Mcat/PaddleNLP into add-cli

f1da84e

update exit code

1c142d7

Merge branch 'develop' into add-cli

e198d1b

wawltor previously approved these changes Nov 16, 2022

View reviewed changes

wj-Mcat added 2 commits November 16, 2022 15:19

add cli dependency to dev tag

1b77e4e

merge develop branch

f663930

wj-Mcat dismissed wawltor’s stale review via f663930 November 16, 2022 07:21

guoshengCS previously approved these changes Nov 16, 2022

View reviewed changes

fix typo

fff7eea

wj-Mcat dismissed guoshengCS’s stale review via fff7eea November 16, 2022 10:51

wj-Mcat added 2 commits November 16, 2022 18:51

Merge branch 'develop' into add-cli

871b4cf

Merge branch 'develop' into add-cli

171d370

guoshengCS approved these changes Nov 16, 2022

View reviewed changes

Merge branch 'develop' into add-cli

784bb93

wj-Mcat merged commit d3d0592 into PaddlePaddle:develop Nov 16, 2022

wj-Mcat deleted the add-cli branch November 16, 2022 23:26

wj-Mcat mentioned this pull request Nov 17, 2022

PaddleNLP 2.4.3 Release Note Candidate #3774

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NewFeature] add paddlenlp command #3538

[NewFeature] add paddlenlp command #3538

wj-Mcat commented Oct 22, 2022

ZeyuChen commented Oct 22, 2022

ZeyuChen left a comment

wj-Mcat commented Oct 25, 2022

wj-Mcat commented Nov 1, 2022

wj-Mcat left a comment

wj-Mcat Nov 1, 2022

wj-Mcat Nov 1, 2022

guoshengCS Nov 7, 2022

wj-Mcat Nov 7, 2022

wj-Mcat Nov 7, 2022

guoshengCS Nov 7, 2022

wj-Mcat Nov 7, 2022

wawltor left a comment

guoshengCS left a comment

guoshengCS Nov 14, 2022



		def convert_from_local_file(weight_file_path: str, output: str):
		"""convert from the local weitht file

[NewFeature] add paddlenlp command #3538

[NewFeature] add paddlenlp command #3538

Conversation

wj-Mcat commented Oct 22, 2022

PR types

PR changes

Description

ZeyuChen commented Oct 22, 2022

ZeyuChen left a comment

Choose a reason for hiding this comment

wj-Mcat commented Oct 25, 2022

wj-Mcat commented Nov 1, 2022

wj-Mcat left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

guoshengCS left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment