File tree Expand file tree Collapse file tree 1 file changed +3
-4
lines changed
Expand file tree Collapse file tree 1 file changed +3
-4
lines changed Original file line number Diff line number Diff line change @@ -43,15 +43,14 @@ What you can expect:
4343- ✅ ** Learning Algorithms** - GRPO (Group Relative Policy Optimization), SFT (Supervised Fine-Tuning), and DPO (Direct Preference Optimization).
4444- ✅ ** Multi-Turn RL** - multi-turn generation and training for RL with tool use, games, etc.
4545- ✅ ** Large Model Support** - Native PyTorch support for models up to 32B parameters.
46- - ✅ ** Advanced Parallelism** - FSDP2, TP, and SP for efficient training.
46+ - ✅ ** Advanced Parallelism** - PyTorch native FSDP2, TP, and SP for efficient training.
4747- ✅ ** Worker Isolation** - Process isolation between RL Actors (no worries about global state).
4848- ✅ ** Environment Isolation** - Dependency isolation between components.
4949
50- - 🔜 ** (Even) Larger Model Support** - Native PyTorch & Megatron.
5150- 🔜 ** Improved Native Performance** - Improve training time for Native Pytorch Models.
52- - 🔜 ** Megatron Policy** - Support advanced parallelism in training with Megatron Core.
53- - 🔜 ** Megatron Inference** - Support Megatron Inference for day-0 support for new megatron models.
51+ - 🔜 ** (even) Larger Model Support with Long(er) Sequence** - Support advanced parallelism in training with Megatron Core.
5452- 🔜 ** MoE Models** - Support DeepseekV3 and Llama4.
53+ - 🔜 ** Megatron Inference** - Support Megatron Inference for day-0 support for new megatron models.
5554
5655## Prerequisites
5756
You can’t perform that action at this time.
0 commit comments