This project is dedicated to learning and developing large language models (LLMs). It encompasses various stages of training and testing, including pretraining, supervised fine-tuning (SFT), and reinforcement learning (RL).
| Name | Name | Last commit date | ||
|---|---|---|---|---|
This project is dedicated to learning and developing large language models (LLMs). It encompasses various stages of training and testing, including pretraining, supervised fine-tuning (SFT), and reinforcement learning (RL).