v3.3.0.post1
·
857 commits
to main
since this release
What's Changed
- Fix sampling and rft by @tastelikefeet in #3847
- Fix incorrect retry count check in LazyLLMDataset.getitem by @IamLihua in #3845
- support internvl3 by @hjh0119 in #3842
- fix grpo filter overlong by @hjh0119 in #3844
- dapo-bug by @Evilxya in #3846
- support agent packing by @Jintao-Huang in #3853
- Fix internvl2.5/3 deepspeed packing by @Jintao-Huang in #3855
- fix multimodal target_modules by @Jintao-Huang in #3856
- Fix multimodal target modules by @Jintao-Huang in #3858
- Update FAQ by @slin000111 in #3841
- fix grpo completion length equal zero by @hjh0119 in #3857
New Contributors
Full Changelog: v3.3.0...v3.3.0.post1