Skip to content

v0.4.1

Latest

Choose a tag to compare

@pan-x-c pan-x-c released this 16 Jan 08:37
· 14 commits to main since this release
c9168b6

Overview

⭐️ Trinity-RFT repository has moved to agentscope-ai organization. You can find our new repository here🔗 and documentation here📚

This is a minor release that includes several bug fixes, feature improvements, and dependency upgrades.

Explorer

  1. Add OpenAI API support for Tinker backend, now users can use tinker backend to run agentic RL examples.
  2. Enhance the AgentScope Workflow Adapter to support features of AgentScope Tuner.

Trainer

  1. Update veRL to v0.7.0, which includes various performance improvements and bug fixes.
  2. Fix bugs in multi-stage resume and last checkpoint saving.
  3. Avoid preserving checkpoints for weight synchronization purpose to reduce storage usage.

Buffer

  1. Fix batch size mismatch issue in SQL buffer.

Others

  1. Introducing R3L, a systematic reflect-then-retry RL mechanism with efficient language-guided exploration and stable off-policy learning. [github repo], [paper]
  2. Improve documentation.

🚨 Breaking Changes

veRL has been upgraded to v0.7.0, and v0.5.0 is no longer supported.

What's Changed

New Contributors

Full Changelog: v0.4.0...v0.4.1