Skip to content

Releases: TIGER-AI-Lab/verl-tool

Release of v0.2.0

24 Dec 08:40
c25276a

Choose a tag to compare

Summary of Change

In this version, we have updated the verl version to 0.6.0 and vllm to 0.11.0 to support more models and incorporate latest features of verl. It's able to train Qwen-3-VL, Qwen3-Omni with your customized tools.

Below are the key changes and instructions for upgrading your existing VerlTool setup to be compatible with these new versions.

  • verl-tool's codebase has been completely re-organized. Thanks to the verl's agent loop abstraction design, we are able to put all the verl-tool's agentic logic in a single file verl_tool/agent_loop/verltool_agent_loop.py, with the main agent loop logic less than 200 lines of code. This greatly improves the modularity and maintainability of the codebase. Please refer to the new code structure when making any custom modifications.
  • verl-tool keeps its support for both text-only LLMs and multi-modal models training, with math_tir and pixel_reasoner as examples correspondingly.
  • We strictly force the "tokens-in" and "tokes-out" design to avoid potential off-policy issues brought by tokenization.
  • We put all the reference verl-tool's custom replacement of classes and functions in verl_tool/trainer/ppo/ray_trainer.py for better maintainability. If you are trying to understand how verl-tool replaces verl's default implementations, please refer to this file.
  • The step records are saved via verl's native trainer.rollout_data_dir argument. (e.g. trainer.rollout_data_dir=$(pwd)/verl_step_records/$run_name ). You need to set it in your training scripts to save the rollout data.
  • The verl-tool now supports hybrid training with tool and without tool. When preparing the data, simply set the use_tool field in the data samples to indicate whether the sample requires tool usage. The agent loop will automatically decide whether to call the tool server based on this field.
  • The old verl_tool with verl 0.4.1.dev is archived in the verl-0.4.1 branch for backward compatibility.
VERLTOOL Ver. VERL Ver. VLLM Ver. Modality Support Main Code Lines
0.1.0 0.4.1 0.8.4 Text, Image, Video ~1300
0.2.0 0.6.0 0.11.0 Text, Image, Video, Audio ~500

What's Changed

New Contributors

Read more

Release of VerlTool v0.1.0

10 Nov 21:22
a7dd605

Choose a tag to compare

This release is to archive existing verl-tool codebase. The new re-organized and verl-tool lite vesrion will come later.