Skip to content

v0.5.1

Latest

Choose a tag to compare

@pan-x-c pan-x-c released this 12 Feb 10:16
· 5 commits to main since this release
c3d356c

Overview

  1. Enhanced support for multi-modal models (including Qwen2.5 VL, Qwen3 VL and Kimi-VL-A3B-Thinking series)
  2. Refactored trinity command line interface using typer
  3. Added a log management tool and fixed bugs in the logging system.
  4. Added Jensen-Shannon Divergence for on-policy distillation.
  5. Fixed bugs in model weight synchronization and over-rollout.

What's Changed

New Contributors

Full Changelog: v0.5.0...v0.5.1