diff --git a/src/content/docs/agents/index.mdx b/src/content/docs/agents/index.mdx
index c396697639ca9f..0744eac7e42616 100644
--- a/src/content/docs/agents/index.mdx
+++ b/src/content/docs/agents/index.mdx
@@ -24,6 +24,7 @@ import {
Build AI-powered agents that can autonomously perform tasks, persist state, browse the web, and communicate back to users in real-time over any channel.
+- ***Serverless inference that scales up _and_ down**: run AI directly on Cloudflre, without worrying about pre-provisioning VMs at peak, and worrying about utilization. Call the latest open-source models on [Workers AI](/workers-ai/), and pay just for what you use.
- **Non I/O bound pricing:** don't pay for long-running processes when your code is not executing. Cloudflare Workers is designed to scale down and [only charge you for CPU time](https://blog.cloudflare.com/workers-pricing-scale-to-zero/), as opposed to wall-clock time.
- **Designed for durable execution:** [Durable Objects](/durable-objects/) and [Workflows](/workflows) are built for a programming model that enables guaranteed execution for async tasks like long-running deep thinking LLM calls, human-in-the-loop, or unreliable API calls.
- **Scalable, and reliable, without compromising on performance:** by running on Cloudflare's network, agents can execute tasks close to the user without introducing latency for real-time experiences.
diff --git a/src/content/docs/workers-ai/index.mdx b/src/content/docs/workers-ai/index.mdx
index fe961c81434031..982ec8fd9060f2 100644
--- a/src/content/docs/workers-ai/index.mdx
+++ b/src/content/docs/workers-ai/index.mdx
@@ -20,9 +20,12 @@ Run machine learning models, powered by serverless GPUs, on Cloudflare's global