[utc] fix loading local model in taskflow #4505

LemonNoel · 2023-01-17T06:23:33Z

PR types

Bug fixes

PR changes

APIs

Description

Fix bug when load local checkpoints from task_path in taskflow.

paddle-bot · 2023-01-17T06:23:37Z

Thanks for your contribution!

…nto utc

codecov · 2023-01-17T06:40:16Z

Codecov Report

Merging #4505 (7f51676) into develop (689428a) will increase coverage by 0.01%.
The diff coverage is 79.41%.

@@             Coverage Diff             @@
##           develop    #4505      +/-   ##
===========================================
+ Coverage    41.27%   41.29%   +0.01%     
===========================================
  Files          432      432              
  Lines        61705    61733      +28     
===========================================
+ Hits         25468    25491      +23     
- Misses       36237    36242       +5

Impacted Files	Coverage Δ
...addlenlp/taskflow/zero_shot_text_classification.py	`18.25% <33.33%> (+0.79%)`	⬆️
paddlenlp/utils/serialization.py	`88.05% <75.00%> (-0.23%)`	⬇️
paddlenlp/transformers/t5/modeling.py	`84.82% <86.95%> (-0.26%)`	⬇️
paddlenlp/transformers/nezha/modeling.py	`20.60% <0.00%> (-0.34%)`	⬇️
paddlenlp/transformers/activations.py	`78.68% <0.00%> (+1.63%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

linjieccc · 2023-01-17T06:57:44Z

paddlenlp/taskflow/zero_shot_text_classification.py

@@ -105,7 +105,7 @@ def _construct_model(self, model):
        if self.from_hf_hub:
            model_instance = UTC.from_pretrained(self._task_path, from_hf_hub=self.from_hf_hub)
        else:
-            model_instance = UTC.from_pretrained(model)
+            model_instance = UTC.from_pretrained(self._task_path)


@LemonNoel 改成from_pretrained("{local_path}")的形式后，需要定义resource_files_names和resource_files_urls并在__init__ 中增加 self._check_task_files()，可以参考这里https://github.com/PaddlePaddle/PaddleNLP/blob/develop/paddlenlp/taskflow/information_extraction.py#L115

关于例如from_pretrained("utc_large")调用后模型不能再通过from_pretrained("{local_path}")方式加载的问题也请 @wj-Mcat 帮忙看下，我们后续看看能不能解决一下这里的gap

如果是改成这样，没必要if了，直接一行model_instance = UTC.from_pretrained(self._task_path, from_hf_hub=self.from_hf_hub) 就行了。
@linjieccc UTC taskflow有做from_pretrained以外的文件下载吗？像这种已经整合pretrained config的模型和taskflow, 建议下载功能全部由from_pretrained承载，不要再做分开的下载逻辑了

@sijunhe 嗯嗯，这里确实改成下载功能全部通过from_pretrained承载好些，也可以避免模型文件重复下载的情况，后续会针对这块处理进行统一升级

关于Taskflow内非transformers类的模型，例如像GRU-CRF，目前模型是放在$PPNLP_HOME/.taskflow/{task_name}/{model_name}，是否这部分模型后续也统一放在$PPNLP_HOME/.paddlenlp/models管理，模型加载的代码在Taskflow中实现

sijunhe · 2023-01-17T08:28:36Z

任何涉及LEGACY_CONFIG_NAME的都可以删除，因为UTC在ernie升级pretrainded config之后，不涉及向后兼容 @LemonNoel

LemonNoel · 2023-01-17T08:56:50Z

任何涉及LEGACY_CONFIG_NAME的都可以删除，因为UTC在ernie升级pretrainded config之后，不涉及向后兼容 @LemonNoel

已删除

linjieccc

LGTM

[utc] fix loading local model in taskflow

7d6d008

[utc] fix task_path in deployment

7134f4c

LemonNoel requested a review from linjieccc January 17, 2023 06:26

Merge branch 'develop' of https://github.com/PaddlePaddle/PaddleNLP i…

c4fff60

…nto utc

linjieccc reviewed Jan 17, 2023

View reviewed changes

[utc] add resource_urls for default model

afbaaa7

[utc] update taskflow

7f51676

LemonNoel requested review from sijunhe and linjieccc January 17, 2023 08:57

linjieccc approved these changes Jan 17, 2023

View reviewed changes

LemonNoel merged commit 82a303f into PaddlePaddle:develop Jan 17, 2023

LemonNoel mentioned this pull request Jan 18, 2023

[Bug]: UTC分类模型设置task_path和不设置的预测结果一样。疑似未加载模型。 #4500

Closed

1 task

LemonNoel mentioned this pull request Feb 17, 2023

PaddleNLP 2.5.1 Release Note Candidate #4852

Closed

LemonNoel deleted the utc branch March 23, 2023 13:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[utc] fix loading local model in taskflow #4505

[utc] fix loading local model in taskflow #4505

LemonNoel commented Jan 17, 2023

paddle-bot bot commented Jan 17, 2023

codecov bot commented Jan 17, 2023 •

edited

Loading

linjieccc Jan 17, 2023

sijunhe Jan 17, 2023

linjieccc Jan 17, 2023

sijunhe commented Jan 17, 2023 •

edited

Loading

LemonNoel commented Jan 17, 2023

linjieccc left a comment

[utc] fix loading local model in taskflow #4505

[utc] fix loading local model in taskflow #4505

Conversation

LemonNoel commented Jan 17, 2023

PR types

PR changes

Description

paddle-bot bot commented Jan 17, 2023

codecov bot commented Jan 17, 2023 • edited Loading

Codecov Report

linjieccc Jan 17, 2023

Choose a reason for hiding this comment

sijunhe Jan 17, 2023

Choose a reason for hiding this comment

linjieccc Jan 17, 2023

Choose a reason for hiding this comment

sijunhe commented Jan 17, 2023 • edited Loading

LemonNoel commented Jan 17, 2023

linjieccc left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 17, 2023 •

edited

Loading

sijunhe commented Jan 17, 2023 •

edited

Loading