[Models] Add Llama-3.2 #9199

DrownFish19 · 2024-09-26T02:15:24Z

PR types

New features

PR changes

Models

Description

Add Llama-3.2.

meta-llama/Llama-3.2-1B
meta-llama/Llama-3.2-1B-Instruct
meta-llama/Llama-3.2-3B
meta-llama/Llama-3.2-3B-Instruct
meta-llama/Llama-Guard-3-1B

DrownFish19 · 2024-09-26T02:16:56Z

README.md

@@ -85,7 +86,7 @@ Unified Checkpoint 大模型存储格式在模型参数分布上支持动态扩
 |     [Qwen2](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)     | Qwen/Qwen2-0.5B, Qwen/Qwen2-0.5B-Instruct, Qwen/Qwen2-1.5B, Qwen/Qwen2-1.5B-Instruct, Qwen/Qwen2-7B, Qwen/Qwen2-7B-Instruct, Qwen/Qwen2-72B, Qwen/Qwen2-72B-Instruct, Qwen/Qwen2-57B-A14B, Qwen/Qwen2-57B-A14B-Instruct                                                                                                                                                                       |
 |  [Qwen2-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)   | Qwen/Qwen2-Math-1.5B, Qwen/Qwen2-Math-1.5B-Instruct, Qwen/Qwen2-Math-7B, Qwen/Qwen2-Math-7B-Instruct, Qwen/Qwen2-Math-72B, Qwen/Qwen2-Math-72B-Instruct, Qwen/Qwen2-Math-RM-72B                                                                                                                                                                                                               |
 |    [Qwen2.5](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)    | Qwen/Qwen2.5-0.5B, Qwen/Qwen2.5-0.5B-Instruct, Qwen/Qwen2.5-1.5B, Qwen/Qwen2.5-1.5B-Instruct, Qwen/Qwen2.5-3B, Qwen/Qwen2.5-3B-Instruct, Qwen/Qwen2.5-7B, Qwen/Qwen2.5-7B-Instruct, Qwen/Qwen2.5-14B, Qwen/Qwen2.5-14B-Instruct, Qwen/Qwen2.5-32B, Qwen/Qwen2.5-32B-Instruct, Qwen/Qwen2.5-72B, Qwen/Qwen2.5-72B-Instruct                                                                     |
-| [Qwen2.5-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)  | Qwen/Qwen2.5-Math-1.5B, Qwen/Qwen2.5-Math-1.5B-Instruct, Qwen/Qwen2.5-Math-7B, Qwen/Qwen2.5-Math-7B-Instruct, Qwen/Qwen2.5-Math-72B, Qwen/Qwen2.5-Math-72B-Instruct, Qwen/Qwen2.5-Math-RM-72B                                                                                                                                                                                                                                |
+| [Qwen2.5-Math](https://github.com/PaddlePaddle/PaddleNLP/tree/develop/llm/config/qwen/)  | Qwen/Qwen2.5-Math-1.5B, Qwen/Qwen2.5-Math-1.5B-Instruct, Qwen/Qwen2.5-Math-7B, Qwen/Qwen2.5-Math-7B-Instruct, Qwen/Qwen2.5-Math-72B, Qwen/Qwen2.5-Math-72B-Instruct, Qwen/Qwen2.5-Math-RM-72B                                                                                                                                                                                                 |


空格变化，内容没有变化

DrownFish19 · 2024-09-26T02:17:33Z

README.md

@@ -96,9 +97,6 @@ Unified Checkpoint 大模型存储格式在模型参数分布上支持动态扩
 |:---------------------:|:--------:|:------------:|:--------:|:------------:|:------:|:------:|:----------:|
 |                       |          |   基础能力   | 序列并行 |    stage1    | stage2 | stage3 |            |
 |         Llama         |    ✅     |      ✅       |    ✅     |      ✅       |   ✅    |   ✅    |     ✅      |


统一LLaMA和Llama不同版本

codecov · 2024-09-26T02:48:30Z

Codecov Report

Attention: Patch coverage is 47.72727% with 23 lines in your changes missing coverage. Please review.

Project coverage is 53.02%. Comparing base (cd4e816) to head (68a5cb1).
Report is 11 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/llama/modeling_pp.py	40.00%	12 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	54.16%	11 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9199      +/-   ##
===========================================
- Coverage    53.06%   53.02%   -0.05%     
===========================================
  Files          656      656              
  Lines       106147   106181      +34     
===========================================
- Hits         56324    56299      -25     
- Misses       49823    49882      +59

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

add llama3.2

6619c7a

DrownFish19 commented Sep 26, 2024

View reviewed changes

ZHUI previously approved these changes Sep 26, 2024

View reviewed changes

update for llama3.2

4c30595

DrownFish19 dismissed ZHUI’s stale review via 4c30595 September 26, 2024 03:21

fix jamba

68a5cb1

ZHUI approved these changes Sep 27, 2024

View reviewed changes

ZHUI merged commit db80bdd into PaddlePaddle:develop Sep 27, 2024
7 of 12 checks passed

DrownFish19 deleted the dev_20240926_add_llama3.2 branch September 27, 2024 10:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Models] Add Llama-3.2 #9199

[Models] Add Llama-3.2 #9199

DrownFish19 commented Sep 26, 2024

DrownFish19 Sep 26, 2024

DrownFish19 Sep 26, 2024

codecov bot commented Sep 26, 2024 •

edited

Loading

[Models] Add Llama-3.2 #9199

[Models] Add Llama-3.2 #9199

Conversation

DrownFish19 commented Sep 26, 2024

PR types

PR changes

Description

DrownFish19 Sep 26, 2024

Choose a reason for hiding this comment

DrownFish19 Sep 26, 2024

Choose a reason for hiding this comment

codecov bot commented Sep 26, 2024 • edited Loading

Codecov Report

codecov bot commented Sep 26, 2024 •

edited

Loading