Skip to content

v0.3.2

Choose a tag to compare

@pan-x-c pan-x-c released this 06 Nov 11:07
· 76 commits to main since this release
30ca213

Overview

This is a bug fix update that addresses many bugs present in versions 0.3. We recommend all users currently using versions 0.3.0 and 0.3.1 upgrade to this new version.

Buffer

  1. Support task scheduler and selector in task dataset
  2. Add BOTS: Online RL task selection for efficient LLM fine-tuning (paper).
  3. Extract the PriorityFunction of the replay buffer as a customizable module.

Explorer

  1. Update vLLM to v0.11.0
  2. Fix logprobs, top_k, top_p, temperature mismatch when using vLLM's OpenAI API server
  3. Fix torch cache conflicts when enforce_eager is False
  4. Simplify Workflow Inferface

Other Modules

  1. Optimize monitor metrics organization
  2. Optimize and simplify Config and config manager
  3. Add more algorithm and examples

What's Changed

New Contributors

Full Changelog: v0.3.1...v0.3.2