You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* 📊 View the release run metrics on [Google Colab](https://colab.research.google.com/drive/15kpesCV1m_C5UQFStssTEjaN2RsBMeZ0?usp=sharing) to get a head start on your experimentation.
40
42
*[5/14/2025][Reproduce DeepscaleR with NeMo RL!](docs/guides/grpo-deepscaler.md)
* 📊 View the release run metrics on [Google Colab](https://colab.research.google.com/drive/1o14sO0gj_Tl_ZXGsoYip3C0r5ofkU1Ey?usp=sharing) to get a head start on your experimentation.
43
45
44
46
## Features
45
47
46
-
✅ _Available now_ | 🔜 _Coming in v0.3_
48
+
✅ _Available now_ | 🔜 _Coming in v0.4_
47
49
48
50
- ✅ **Fast Generation** - vLLM backend for optimized inference.
49
-
- ✅ **HuggingFace Integration** - Works with 1-32B models (Qwen2.5, Llama).
50
-
- ✅ **Distributed Training** - Fully Sharded Data Parallel (FSDP) support and Ray-based infrastructure.
51
+
- ✅ **HuggingFace Integration** - Works with 1-70B models (Qwen, Llama).
52
+
- ✅ **Distributed Training** - Fully Sharded Data Parallel (FSDP2) support and Ray-based infrastructure.
51
53
- ✅ **Environment Support** - Support for multi-environment training.
0 commit comments