This project is dedicated to learning and developing large language models (LLMs). It encompasses various stages of training and testing, including pretraining, supervised fine-tuning (SFT), and reinforcement learning (RL).
forked from hengjiUSTC/learn-llm
-
Notifications
You must be signed in to change notification settings - Fork 0
xuyongfu/learn-llm
About
从零预训练LLAMA3的完整指南:一个文件,探索Scaling Law
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Jupyter Notebook 91.4%
- Python 8.6%