Skip to content

Tunix v0.1.2: Expanded Model Support and Enhanced Flexibility

Choose a tag to compare

@wang2yn84 wang2yn84 released this 10 Oct 18:14
· 691 commits to main since this release

This release of Tunix introduces support for new models, enhances core functionalities for more flexible and efficient workflows, and includes several important fixes.

Highlights

  • Expanded Model Support: We've added a configuration for qwen-8b and ported the Llama3 example to the Tunix CLI. Additionally, GRPO disaggregated llama3.1-70b is now supported through MaxText, including checkpoint saving.
  • Enhanced Flexibility: Users can now specify a different data type for the rollout model and take advantage of more flexible PyTree support in the checkpoint manager. This release also introduces flexible collect modes and tokenization support, along with support for multiple EOS tokens in the vanilla sampler.

Other Changes

  • Downgraded Jax version to 0.7.1 in prod mode due to performance regression, dev mode still supports Jax v0.7.2
  • Fixes to the front page pip install command and GRPO examples.
  • Improvements to the checkpoint manager and resharding library.
  • Added a backward compatibility test for Orbax checkpoint restoration.
  • Various code simplifications, refactoring, and documentation updates.

Full Changelog: v0.1.1...v0.1.2

What's Changed

Full Changelog: v0.1.1...v0.1.2