Examples Catalog

!!! tip "Want to Contribute?"

We welcome contributions to the examples catalog! Please refer to the [Contributing](../community/contributing.md) guide for more details.

:material-robot:{ .lg .middle } APO room selector

Prompt-optimize a room-booking agent with the built-in APO algorithm, then contrast it with the write-your-own algorithm and debugging workflows in the tutorials. Pairs well with the [Train the First Agent how-to]({{ src("docs/how-to/train-first-agent.md") }}) and the [Write the First Algorithm guide]({{ src("docs/how-to/write-first-algorithm.md") }}).

[:octicons-repo-24: Browse source]({{ src("examples/apo") }})
:material-cloud-sync:{ .lg .middle } Azure OpenAI SFT

Run a supervised fine-tuning loop against Azure OpenAI: roll out the capital-lookup agent, turn traces into JSONL, launch fine-tunes, and redeploy the resulting checkpoints through Azure CLI.

[:octicons-repo-24: Browse source]({{ src("examples/azure") }})
:material-calculator:{ .lg .middle } Calc-X VERL math

VERL-based reinforcement learning setup for a math-reasoning agent that uses AutoGen plus an MCP calculator tool to solve Calc-X problems end to end.

[:octicons-repo-24: Browse source]({{ src("examples/calc_x") }})
:material-chart-box:{ .lg .middle } ChartQA vision-language RL

LangGraph-powered workflow for answering chart questions end to end: rollout the multi-modality agent with GPT or vLLM, and train with VERL/GRPO plus self-refinement loops.

[:octicons-repo-24: Browse source]({{ src("examples/chartqa") }})
:material-code-braces:{ .lg .middle } Claude Code SWE-bench

Instrumented driver that runs Anthropic's Claude Code workflow on SWE-bench instances while streaming traces through Agent-lightning—supports hosted vLLM, official Anthropic, or any OpenAI-compatible backend and emits datasets for downstream tuning.

[:octicons-repo-24: Browse source]({{ src("examples/claude_code") }})
:material-view-grid:{ .lg .middle } Minimal building blocks

Bite-sized scripts that isolate Agent-lightning primitives (e.g., LightningStore usage, LLM proxying, minimal vLLM host) so you can study each part before composing larger workflows.

[:octicons-repo-24: Browse source]({{ src("examples/minimal") }})
:material-book-open-page-variant:{ .lg .middle } RAG (MuSiQue)

Retrieval-Augmented Generation pipeline that preps a Wikipedia retriever via MCP and trains a MuSiQue QA agent with GRPO. Documented for historical reference (verified on Agent-lightning v0.1.x).

[:octicons-repo-24: Browse source]({{ src("examples/rag") }})
:material-database:{ .lg .middle } Spider SQL agent

LangGraph-powered text-to-SQL workflow for the Spider benchmark, combining LangChain tooling with Agent-lightning rollouts; follow along with the [how-to for training SQL agents]({{ src("docs/how-to/train-sql-agent.md") }}).

[:octicons-repo-24: Browse source]({{ src("examples/spider") }})
:material-thought-bubble:{ .lg .middle } Tinker integration

Adapter package ([agl_tinker]({{ src("examples/tinker/agl_tinker") }})) with Tinker plus sample CrewAI/OpenAI agents that feed Agent-lightning traces into Tinker’s reinforcement-learning backend for both toy and 20-Questions-style workflows.

[:octicons-repo-24: Browse source]({{ src("examples/tinker") }})
:material-fast-forward:{ .lg .middle } Unsloth SFT

Supervised fine-tuning loop that ranks math-agent rollouts, fine-tunes with Unsloth’s 4-bit LoRA stack, and mirrors the [Fine-tune with Unsloth recipe]({{ src("docs/how-to/unsloth-sft.md") }}).

[:octicons-repo-24: Browse source]({{ src("examples/unsloth") }})

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Examples Catalog

FilesExpand file tree

examples-catalog.md

Latest commit

History

examples-catalog.md

File metadata and controls

Examples Catalog