WIP: DBOS docs

qianl15 · qianl15 · commit a4364f377caa · 2025-08-27T18:13:36.000-07:00
diff --git a/docs/dbos.md b/docs/dbos.md
@@ -0,0 +1,150 @@
+# Durable Execution with DBOS
+
+!!! note
+    Durable execution support is in beta and the public interface is subject to change based on user feedback. We expect it to be stable by the release of Pydantic AI v1 at the end of August. Questions and feedback are welcome in [GitHub issues](https://github.com/pydantic/pydantic-ai/issues) and the [`#pydantic-ai` Slack channel](https://logfire.pydantic.dev/docs/join-slack/).
+
+Pydantic AI allows you to build durable agents that can preserve their progress across transient API failures and application errors or restarts, and handle long-running, asynchronous, and human-in-the-loop workflows with production-grade reliability. Durable agents have full support for [streaming](agents.md#streaming-all-events) and [MCP](mcp/client.md), with the added benefit of fault tolerance.
+
+[DBOS](https://www.dbos.dev/) is a lightweight [durable execution](https://docs.dbos.dev/architecture) library that's natively supported by Pydantic AI.
+The integration only uses Pydantic AI's public interface, so it can also serve as a reference for how to integrate with other durable execution systems.
+
+### Durable Execution
+
+In DBOS's durable execution implementation, a program that crashes or encounters an exception while interacting with a model or API will retry until it can successfully complete.
+
+DBOS relies primarily on a replay mechanism to recover from failures.
+As the program makes progress, DBOS saves key inputs and decisions, allowing a re-started program to pick up right where it left off.
+
+The key to making this work is to separate the application's repeatable (deterministic) and non-repeatable (non-deterministic) parts:
+
+1. Deterministic pieces, termed [**workflows**](https://docs.dbos.dev/python/tutorials/workflow-tutorial), execute the same way when re-run with the same inputs.
+2. Non-deterministic pieces, termed [**steps**](https://docs.dbos.dev/python/tutorials/step-tutorial), can run arbitrary code, performing I/O and any other operations.
+
+Workflow code can run for extended periods and, if interrupted, resume exactly where it left off.
+Critically, workflow code generally _cannot_ include any kind of I/O, over the network, disk, etc.
+Step code faces no restrictions on I/O or external interactions, but if a step fails part-way through it is restarted from the beginning.
+
+
+!!! note
+
+    If you are familiar with celery, it may be helpful to think of DBOS steps as similar to celery tasks, but where you wait for the task to complete and obtain its result before proceeding to the next step in the workflow.
+    However, DBOS workflows and steps offer a great deal more flexibility and functionality than celery tasks.
+
+    See the [DBOS documentation](https://docs.dbos.dev/architecture) for more information.
+
+In the case of Pydantic AI agents, integration with DBOS means that [model requests](models/index.md), [tool calls](tools.md) that may require I/O, and [MCP server communication](mcp/client.md) all need to be offloaded to DBOS steps due to their I/O requirements, while the logic that coordinates them (i.e. the agent run) lives in the workflow. Code that handles a scheduled job or web request can then execute the workflow, which will in turn execute the steps as needed.
+
+The diagram below shows the overall architecture of an agentic application in DBOS.
+DBOS is lightweight because it runs entirely in-process as a library, so your workflows and steps remain normal functions within your application that you can call from other application code. DBOS instruments them to checkpoint their state into a database (i.e., possibly replicated across cloud regions).
+
+```text
+                    Clients
+            (HTTP, RPC, Kafka, etc.)
+                        |
+                        v
++------------------------------------------------------+
+|               Application Servers                    |
+|                                                      |
+|   +----------------------------------------------+   |
+|   |        Pydantic AI + DBOS Libraries          |   |
+|   |                                              |   |
+|   |  [ Workflows (Agent Run Loop) ]              |   |
+|   |  [ Steps (Tool, MCP, Model) ]                |   |
+|   |  [ Queues ]   [ Cron Jobs ]   [ Messaging ]  |   |
+|   +----------------------------------------------+   |
+|                                                      |
++------------------------------------------------------+
+                        |
+                        v
++------------------------------------------------------+
+|                      Database                        |
+|   (Stores workflow and step state, schedules tasks)  |
++------------------------------------------------------+
+```
+
+See the [DBOS documentation](https://docs.dbos.dev/architecture) for more information.
+
+## Durable Agent
+
+Any agent can be wrapped in a [`DBOSAgent`][pydantic_ai.durable_exec.dbos.DBOSAgent] to get a durable agent, by automatically wrapping the agent run loop as a deterministic DBOS workflow and offloading work that requires I/O (namely model requests and MCP server communication) to non-deterministic steps. To make it flexible, `DBOSAgent` doesn't automatically wrap other tool functions, so you can decorate them as either DBOS workflows or steps as needed.
+
+At the time of wrapping, the agent's [model](models/index.md) and [MCP server communication](mcp/client.md) are wrapped as DBOS steps instead of directly invoking the original functions inside the workflow. The original agent can still be used as normal outside the DBOS workflow.
+
+Here is a simple but complete example of wrapping an agent for durable execution. All it requires is to install the DBOS [open-source library](https://github.com/dbos-inc/dbos-transact-py):
+
+```sh
+uv add pydantic-ai[dbos]
+```
+
+or if you use pip:
+```sh
+pip install pydantic-ai[dbos]
+```
+
+```python {title="dbos_agent.py" test="skip"}
+from dbos import DBOS, DBOSConfig
+
+from pydantic_ai import Agent
+from pydantic_ai.durable_exec.dbos import DBOSAgent
+
+dbos_config: DBOSConfig = {
+    'name': 'pydantic_dbos_agent',
+    'system_database_url': 'sqlite:///dbostest.sqlite',  # (3)!
+}
+DBOS(config=dbos_config)
+
+agent = Agent(
+    'gpt-5',
+    instructions="You're an expert in geography.",
+    name='geography',  # (4)!
+)
+
+dbos_agent = DBOSAgent(agent)  # (1)!
+
+async def main():
+    DBOS.launch()
+    result = await dbos_agent.run('What is the capital of Mexico?')  # (2)!
+    print(result.output)
+    #> Mexico City (Ciudad de México, CDMX)
+```
+
+1. The original `Agent` cannot be used inside a deterministic DBOS workflow, but the `DBOSAgent` can. Workflow function declarations and `DBOSAgent` creations needs to happen before calling `DBOS.launch()` because DBOS requires all workflows to be registered before launch so that recovery can correctly find all workflows.
+2. [`DBOSAgent.run()`][pydantic_ai.durable_exec.dbos.DBOSAgent.run] works like [`Agent.run()`][pydantic_ai.Agent.run], but runs inside a DBOS workflow and wraps model requests, decorated tool calls, and MCP communication as DBOS steps.
+3. This assumes DBOS is using SQLite. To deploy your agent to production, we recommend using a Postgres server.
+4. The agent's `name` is used to uniquely identify its workflows.
+
+_(This example is complete, it can be run "as is" — you'll need to add `asyncio.run(main())` to run `main`)_
+
+Because DBOS workflows need to be defined before calling `DBOS.launch()` and the `DBOSAgent` instance automatically registers `run` and `run_sync` as workflows, it needs to be defined before calling `DBOS.launch()` as well.
+
+For more information on how to use DBOS in Python applications, see their [Python SDK guide](https://docs.dbos.dev/python/programming-guide).
+
+## DBOS Integration Considerations
+
+There are a few considerations specific to agents and toolsets when using DBOS for durable execution. These are important to understand to ensure that your agents and toolsets work correctly with DBOS's workflow and step model.
+
+### Agent and Toolset Requirements
+
+To ensure that DBOS knows what code to run when a workflow fails or is interrupted and then restarted, each agent instance needs to have a name that's unique.
+
+Other than that, any agent and toolset will just work!
+
+### Agent Run Context and Dependencies
+
+As DBOS checkpoints workflows and steps execution into a database, workflow inputs and outputs, and step outputs need to be serializable (JSON Pickleable). You may also want to keep the inputs and outputs small (usually less than 2MB).
+
+### Streaming
+
+Because DBOS steps cannot stream output directly to the step call site, [`Agent.run_stream()`][pydantic_ai.Agent.run_stream] is not supported.
+
+Instead, you can implement streaming by setting an [`event_stream_handler`][pydantic_ai.agent.EventStreamHandler] on the `Agent` or `DBOSAgent` instance and using [`DBOSAgent.run()`][pydantic_ai.durable_exec.dbos.DBOSAgent.run].
+The event stream handler function will receive the agent [run context][pydantic_ai.tools.RunContext] and an async iterable of events from the model's streaming response and the agent's execution of tools. For examples, see the [streaming docs](agents.md#streaming-all-events).
+
+
+## Step Configuration
+
+TBD
+
+## Step Retries
+
+TBD
diff --git a/mkdocs.yml b/mkdocs.yml
@@ -45,6 +45,7 @@ nav:
       - common-tools.md
       - retries.md
       - temporal.md
+      - dbos.md
       - MCP:
           - mcp/index.md
           - mcp/client.md