Calc-X Example

This example demonstrates training a mathematical reasoning agent using Agent-Lightning with the VERL algorithm and AutoGen framework. The agent solves math problems using a calculator tool through the Model Context Protocol (MCP). It's compatible with Agent-lightning v0.2 or later.

Requirements

This example requires a single node with at least one 40GB GPU. Follow the installation guide to install Agent-Lightning and VERL-related dependencies.

Additionally, ensure uv and the MCP calculator server are properly installed. The agent relies on the MCP protocol to access calculator functionality during problem-solving.

pip install "autogen-agentchat" "autogen-ext[openai]" "mcp>=1.10.0"

Dataset

Download the Calc-X dataset in parquet format from here and extract it to the data folder:

unzip calc-x-data.zip -d data

The dataset contains mathematical problems with ground truth solutions for training and evaluation.

Included Files

File/Directory	Description
`calc_agent.py`	Math problem-solving agent using AutoGen and MCP calculator tool
`train_calc_agent.py`	Training script using VERL algorithm with configurable hyperparameters
`eval_utils.py`	Evaluation utilities for assessing agent accuracy on math problems
`data/`	Directory containing training and test datasets in parquet format
`tests/`	Test files including MCP calculator verification script
`legacy_calc_agent.py`	Legacy agent implementation compatible with Agent-lightning v0.1.x (deprecated)
`legacy_calc_agent_debug.py`	Legacy debugging script compatible with Agent-lightning v0.1.x (deprecated)
`legacy_train.sh`	Legacy training script compatible with Agent-lightning v0.1.x (deprecated)

Running Examples

Training

The training process uses distributed Ray workers to run agent rollouts in parallel while the training server optimizes the model. Start Ray before launching the training:

bash ../../scripts/restart_ray.sh

If you want to track experiments with Weights & Biases, set the WANDB_API_KEY environment variable before starting Ray.

Then run the training script:

python train_calc_agent.py --train-file data/train.parquet --val-file data/test.parquet

The script automatically launches agent workers and the training server. The agent workers execute math problem rollouts using the MCP calculator, while the training server applies the VERL algorithm to improve the model based on rewards.

Debugging

To test the agent interactively without training:

python calc_agent.py

This runs the agent on sample problems to verify that the MCP calculator integration and AutoGen setup work correctly. This test relies on an OpenAI service available. Set OPENAI_API_KEY environment variable to the API key of the OpenAI service; and OPENAI_API_BASE environment variable to the base URL of the OpenAI service.

A very common issue is that the agent may hang indefinitely if the environment is not properly configured. Verify that uv and the MCP calculator server are correctly installed by running:

python tests/test_mcp_calculator.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calc-X Example

Requirements

Dataset

Included Files

Running Examples

Training

Debugging

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Calc-X Example

Requirements

Dataset

Included Files

Running Examples

Training

Debugging