Portkey-AI
diff --git a/‎changelog/2025/may.mdx
Lines changed: 82 additions & 143 deletions b/‎changelog/2025/may.mdx
Lines changed: 82 additions & 143 deletions
diff --git a/‎images/changelog/lions-teat.png
65.2 KB b/‎images/changelog/lions-teat.png
65.2 KB
@@ -4,9 +4,9 @@ title: "May"
 
 **May-king it production ready✨**
 
-Infra shouldn’t slow you down. In May, we shipped the kind of upgrades that help you move fast into productiion and stay in control — whether you're scaling agents, securing AI behavior, or managing costs across teams.
+In May, we shipped the kind of upgrades that help you move your AI Agents fast into productiion and stay in control — whether you're scaling, securing AI behavior, or bringing new models to your apps.
 
-From deeper integrations with agent frameworks to support for newer models, Portkey keeps evolving as the AI infra layer teams can rely on. We also shipped observability upgrades, and added tighter controls for cost, access, and security.
+We launched deep integrations with agent frameworks like PydanticAI and OpenAI Agents SDK, added enterprise-grade controls to Claude Code, made it simpler to call a remote MCP server simpler and much more!
 
 Here’s everything new this month:
 
@@ -15,67 +15,55 @@ Here’s everything new this month:
 
 | Area | Key Updates |
 | :-- | :-- |
-| **AI agent infrastructure** | • PydanticAI integration for modular agent development<br/>• OpenAI Agents SDK support with monitoring, guardrails, and cost tracking<br/>• Strands Agents integration with observability, retries, and load balancing<br/>• Remote MCP server support via Responses API<br/>• Arize Phoenix tracing integration for unified agent observability |
-| **Platform** | • Deep integration into Azure AI ecosystem (OpenAI, Foundry, APIM, Marketplace)<br/>• Support for Claude Code with rate limits, observability, and access control<br/>• AI coding assistant integrations: Cline, Roo Code<br/>• Multimodal embedding support via Vertex AI (text, image, video)<br/>• Multi-label support for prompt versions<br/>• OpenAI Computer Use Tool routing and observability<br/>• Full support for `GET`, `PUT`, and `DELETE` HTTP methods<br/>• OTel analytics export to your existing observability stack |
-| **Improvements** | • Token cost tracking for gpt-image-1<br/>• Ping messages removed from streamed responses<br/>• Resizing metadata columns in logs |
-| **New models & providers** | • Claude 4 now live<br/>• PDF support for Claude via Anthropic and Bedrock<br/>• OpenAI’s Computer Use Tool supported via Responses API<br/>• Grok 3 and Grok 3 Mini on Azure<br/>• Gemini 2.5 Thinking Mode in Prompt Playground<br/>• Extended thinking for Claude 3.7 and Claude 4<br/>• Mistral supports function calling<br/>• WorkersAI supports image generation<br/>• Lepton AI, Nscale now integrated<br/>• Tool calling enabled for Mistral and OpenRouter<br/>• MIME type support for Vertex and Google<br/>• PDF support via Anthropic and Bedrock |
-| **Guardrails** | • Prompt Security guardrails for injection detection and sensitive data protection<br/>• JWT validator input guardrail<br/>• PANW Prisma AIRS plugin for real-time prompt/response risk blocking<br/>• Model whitelist guardrail for org/environment/request-level control |
-| **Documentation** | • Guardrail documentation moved under “Integrations”<br/>• New solution pages for AWS Bedrock and GovCloud<br/>• Cookbook: OpenAI Computer Use tool <br/>• Cookbook: Optimizing Prompts using LLama Prompt Ops |
+| **AI agent infrastructure** | • Integration with PydanticAI, OpenAI Agents SDK, Strands Agents integration<br/>• Remote MCP server support via Responses API<br/>• Arize Phoenix tracing integration|
+| **AI tools** | • Integration with Claude Code, Cline, Roo Code<br/>
+| **Platform** | • Deep integration into Azure AI ecosystem<br/>• Multi-label support for prompt versions<br/>• Full support for `GET`, `PUT`, and `DELETE` HTTP methods<br/>• OTel analytics export |
+| **New models & providers** | • Claude 4 now live<br/>• Grok 3 and Grok 3 Mini on Azure<br/>• Lepton AI, Nscale now integrated<br/>• PDF support for Claude via Anthropic and Bedrock<br/>• WorkersAI supports image generation<br/>• Tool calling enabled for Mistral and OpenRouter<br/>• MIME type support for Vertex and Google<br/> |
+| **Guardrails** | • Prompt Security guardrails for injection detection and sensitive data protection<br/>• JWT validator input guardrail<br/>• PANW Prisma AIRS plugin for real-time prompt/response risk blocking<br/>• Model whitelist guardrail for org/environment/request-level control<br/> 
 ---
 
 ## AI Agent Infrastructure
 AI agent frameworks are helping teams prototype faster, but taking agents to production requires real infrastructure. Portkey integrates with leading frameworks to bring interoperability, observability, reliability, and cost management to your agent workflows.
 
-**PydanticAI**
-
-Portkey now integrates PydanticAI, a Python framework that brings FastAPI-like ergonomics to building AI agents. With Portkey, you can:
-
-- Build modular, testable agents with a clean developer experience.
-- Route all agent calls through Portkey for observability and debugging.
-- Add retries, fallbacks, guardrails, and cost tracking without extra infra
-
-See how it's done [here](https://portkey.ai/docs/integrations/agents/pydantic-ai#pydantic-ai)
-
-**OpenAI Agents SDK**
-
-Portkey integrates with the OpenAI Agents SDK to help teams ship production-grade agents with built-in planning, memory, and tool use. You can now:
+<CardGroup cols={3}>
 
-- Monitor and debug each step of the agent’s reasoning and tool use.
-- Automatically track usage and cost for each agent call.
-- Apply guardrails to both agent input and output.
-- Scale agent-based workflows across environments with versioned control
+<Card
+  title="PydanticAI"
+  href="https://portkey.ai/docs/integrations/agents/pydantic-ai#pydantic-ai">
+  PydanticAI is a Python framework that brings FastAPI-like ergonomics to building AI agents.<br/><br/>
+</Card>
 
-Explore the integration [here](https://portkey.ai/docs/integrations/agents/openai-agents)
+<Card
+  title="OpenAI Agents SDK"
+  href="https://portkey.ai/docs/integrations/agents/openai-agents">
+  OpenAI Agents SDK helps teams ship production-grade agents with built-in planning, memory, and tool use.<br/><br/>
+</Card>
 
-**Strands Agents**
+<Card
+  title="Strands Agents"
+  href="https://portkey.ai/docs/integrations/agents/strands">
+  Strands Agents is a lightweight agent framework built by AWS to simplify agent development.<br/><br/>
+ <br/>
+</Card>
 
-Strands Agents is a lightweight agent framework built by AWS to simplify agent development.
+</CardGroup>
 
-Portkey now integrates seamlessly with Strands Agents to make them production-ready. With this integration, you get:
+**Tracing Integrations: Arize AI**
 
-- Full observability into agent steps, tool calls, and interactions
-- Built-in reliability through fallbacks, retries, and load balancing
-- Cost tracking and spend optimization
+For teams consolidating observability into Arize, you can now view Portkey’s logs directly into Arize Phoenix to get unified trace views across your LLM workflows.
 
-See how it's done [here](https://portkey.ai/docs/integrations/agents/strands)
 
-**Support for remote MCP servers!**
+## Remote MCP servers
 
 Portkey now supports calling a remote MCP server that is maintained by developers and organizations across the internet that expose these tools to MCP clients via the Responses API
-Read more about the integration [here](https://portkey.ai/docs/product/ai-gateway/remote-mcp)
-
-**Tracing Integrations: Arize AI**
+Read more about the integration [here](https://portkey.ai/docs/product/ai-gateway/remote-mcp).
 
-For teams consolidating observability into Arize, you can now view Portkey’s logs directly into Arize Phoenix to get unified trace views across your LLM workflows.
+## Azure AI ecosystem
 
-## Platform
-
-**Azure AI ecosystem**
+More than half of Fortune 500 companies use Azure OpenAI. But building GenAI apps in the enterprise is still messy, cost attribution, routing logic, usage tracking, model evaluation... all scattered.
 <Frame>
 <img width="700" src="/images/changelog/azure-integration.jpeg" />
 </Frame>
-More than half of Fortune 500 companies use Azure OpenAI. But building GenAI apps in the enterprise is still messy, cost attribution, routing logic, usage tracking, model evaluation... all scattered.
-
 With Portkey’s deep integration into the Azure AI ecosystem (OpenAI, Foundry, APIM, Marketplace), teams can now build, scale, and govern GenAI apps without leaving their existing cloud setup.
 
 
@@ -84,57 +72,49 @@ Our customers are vouching for it!
 <img width="700" src="/images/changelog/testimonial3.png" />
 </Frame>
 
-<Card horizontal title="Working with Azure? Read more here." href="https://portkey.ai/for/azure">
-</Card>
 
-### Portkey for AI Tools
+## Portkey for AI Tools
 
 <CardGroup cols={2}>
 
 <Card
   title="Claude Code"
-  icon="terminal"
-  href="/docs/integrations/libraries/claude-code"
->
-  Bring enterprise-grade visibility, governance, and access control to Anthropic’s agentic coding assistant. Enforce rate limits, monitor usage with rich metadata, debug faster with detailed logs, and issue virtual keys for secure access across teams and infrastructures (Bedrock, Vertex AI).
+  href="https://portkey.ai/docs/integrations/libraries/claude-code">
+  Bring enterprise-grade visibility, governance, and access control to Claude Code.
 </Card>
 
 <Card
   title="Cline"
-  icon="code"
-  href="/docs/integrations/libraries/cline"
->
-  Supercharge your AI-powered terminal with unified logging, granular cost tracking, access controls, and advanced observability. Portkey lets you audit every prompt, tool invocation, and generation for full developer productivity oversight.
+  href="https://portkey.ai/docs/integrations/libraries/cline">
+  Supercharge your AI-powered terminal with cost tracking, access controls, and observability.
 </Card>
 
 <Card
   title="Roo Code"
-  icon="rocket"
-  href="/docs/integrations/libraries/roo-code"
->
-  Add security, compliance, and real-time analytics to your code assistant workflows. Track usage, control spend, and manage access across all Roo deployments—ensuring safe and optimized coding environments at scale.
+  href="https://portkey.ai/docs/integrations/libraries/roo-code">
+  Add security, compliance, and real-time analytics to your code assistant workflows.
 </Card>
 
 <Card
   title="Goose"
-  icon="feather"
-  href="/docs/integrations/libraries/goose"
->
-  Enable enterprise features in Goose—AI code review and generation—by routing through Portkey. Gain full observability, cost controls, and secure team access for responsible and accountable AI coding, with seamless integration into your workflows.
+  href="https://portkey.ai/docs/integrations/libraries/goose">
+ Add essential enterprise controls to Goose's powerful autonomous coding capabilities
 </Card>
 
 </CardGroup>
 
-**Multilmodal embeddings**
+## Multilmodal embeddings 
 
 Portkey now supports embedding APIs from Vertex AI for text, image, and video—across multiple languages.
 This unlocks the ability to:
-- Build cross-language search and retrieval
+- Build multimodal search and retrieval
 - Power multimodal RAG pipelines
 - Track, route, and optimize embedding usage at scale
 
 Read more about the implementation [here](https://portkey.ai/docs/integrations/llms/vertex-ai/embeddings)
 
+## Platform
+
 **Multi-label support for prompts**
 
 <Frame>
@@ -143,16 +123,6 @@ Read more about the implementation [here](https://portkey.ai/docs/integrations/l
 
 You can now assign multiple labels to a single prompt version, making it easy to promote a version across environments like staging and production.
 
-**OpenAI Computer Use Tool**
-
-Build production-grade browser automation with enterprise-level controls using Portkey and:
-
-- Route and monitor Computer Use API calls
-- Build a complete Playwright-based browser automation solution
-- Add observability, logging, and cost controls with Portkey
-
-Explore the implementation [here](https://portkey.ai/docs/guides/use-cases/openai-computer-use)
-
 **Gateway to any API**
 
 Portkey now supports `GET`, `PUT`, and `DELETE` HTTP methods in addition to `POST`, allowing you to route requests to any external or self-hosted provider endpoint. This means you can connect to custom APIs directly through Portkey with full observability for every call.
@@ -166,89 +136,58 @@ You can now export Portkey analytics to any OpenTelemetry (OTel)-compatible coll
 - Ping messages are removed from streamed responses.
 - Resizing metadata columns in logs
 
-<CardGroup cols={3}>
-  <Card title="Claude 4">
-    Now live on Portkey for advanced reasoning and coding.
-  </Card>
-  <Card title="Grok 3 & Grok 3 Mini">
-    Available on Azure for high-performance inference.
-  </Card>
-  <Card title="Lepton AI Integration">
-    Integrate Lepton AI into your Portkey workflows.
-  </Card>
-  <Card title="Nscale Models">
-    Access Nscale models through Portkey.
-  </Card>
-</CardGroup>
-
-<CardGroup cols={3}>
-  <Card title="PDF Support for Claude">
-    Send PDFs to Claude via Anthropic and Bedrock.
-  </Card>
-  <Card title="OpenAI Computer Use Tool">
-    Access Computer Use Tool via the Responses API.
-  </Card>
-  <Card title="Gemini 2.5 Thinking Mode">
-    Thinking Mode now supported in Prompt Playground.
-  </Card>
-  <Card title="Extended Thinking for Claude">
-    Claude 3.7 and Claude 4 support extended thinking.
-  </Card>
-  <Card title="Mistral Function Calling">
-    Mistral now supports function calling.
-  </Card>
-  <Card title="WorkersAI Image Generation">
-    Generate images directly using WorkersAI.
-  </Card>
-  <Card title="Tool Calling for Mistral & OpenRouter">
-    Tool calling now live for Mistral and OpenRouter.
-  </Card>
-  <Card title="MIME Type Support">
-    MIME types now handled for Vertex and Google.
-  </Card>
-  <Card title="PDFs via Anthropic/Bedrock">
-    PDF routes available via Anthropic and Bedrock.
-  </Card>
-</CardGroup>
+**This is what keeps us going!**
+<Frame>
+<img src="/images/changelog/testimonial1.png" width="100%"/>
+</Frame>
 
+## New Models & Providers
+
+<div style={{ display: "flex", gap: "2rem", flexWrap: "wrap" }}>
+  <div style={{ flex: 1, minWidth: 300 }}>
+    <ul>
+        <b>New additions</b>
+      <li><b>Claude 4</b> is now live for advanced reasoning and coding.</li>
+      <li><b>Grok 3 & Grok 3 Mini</b> are available on Azure</li>
+      <li><b>Lepton AI</b> is now live</li>
+      <li><b>Nscale Models</b> can now be accessed through Portkey.</li>
+    </ul>
+  </div>
+  <div style={{ flex: 1, minWidth: 300 }}>
+    <ul>
+    <b>Updates</b>
+      <li><b>PDF Support for Claude</b> via Anthropic and Bedrock.</li>
+      <li><b>Gemini 2.5 Thinking Mode</b> is now supported in Prompt Playground.</li>
+      <li><b>Extended Thinking</b> is available for Claude 3.7 and Claude 4.</li>
+      <li>Image generation now supported on WorkersAI</li>
+      <li><b>Tool Calling and Function Calling for Mistral</b> is now live.</li>
+      <li><b>MIME Type</b> is now supported for Vertex AI</li>
+    </ul>
+  </div>
+</div>
 
 ## Guardrails
 
 - **Prompt Security guardrails**: Integrate with Prompt Security to detect prompt injection and prevent sensitive data exposure in both prompts and responses.
 
 - **JWT validator guardrail**: Added as an input guardrail to validate incoming JWT tokens before requests are sent to the LLM.
 
-- **PANW Prisma AIRS Plugin**:Portkey now integrates with Palo Alto Networks' AIRS (AI Runtime Security) to enforce guardrails that block risky prompts or model responses based on real-time security analysis.
-
-- **Model whitelist guardrail**:Restrict or deny specific models at the org, environment, or request level using a flexible whitelist/blacklist guardrail.
-
-## Resources
-
-**LLama Prompt Ops: Optimizing Prompts**
+- **PANW Prisma AIRS Plugin**: Portkey now integrates with Palo Alto Networks' AIRS (AI Runtime Security) to enforce guardrails that block risky prompts or model responses based on real-time security analysis.
 
-Looking to upgrade to the latest Llama models? Llama Prompt Ops makes it easy—transform your existing prompts for optimal performance with Llama models automatically, no manual rewriting needed.
+- **Model whitelist guardrail**: Restrict or deny specific models at the org, environment, or request level using a flexible whitelist/blacklist guardrail.
 
-For customer support teams, we provide a comprehensive guide to building systems that analyze support messages for urgency, sentiment, and categorization.
-
-[Read the Llama Prompt Ops guide](https://portkey.ai/docs/guides/prompts/llama-prompts)
-
-**More Resources**
+**No frills. No hype. Just serious safety**
+<Frame>
+<img src="/images/changelog/lions-teat.png" width="100%"/>
+</Frame>
 
+## Resources
+- Cookbook: [Optimizing Prompts with LLama Prompt Ops](https://portkey.ai/docs/guides/prompts/llama-prompts)
+- Cookbook: [OpenAI Computer Use Tool](https://portkey.ai/docs/guides/llms/openai-computer-use-tool)
 - Guardrail documentation is now located under “Integrations”.
 - Expanded guides for agent frameworks, including CrewAI and LangGraph.
 
-
-## Customer love!
-
-From powering reliable provider failovers at Hedy to equipping AI policy analysts, Portkey is becoming the trusted backbone for builders!
-<Frame>
-<img src="/images/changelog/testimonial1.png" width="100%"/>
-</Frame>
-<Frame>
-<img src="/images/changelog/lions-testimonial.png" width="100%"/>
-</Frame>
-
-### Community Contributors
+## Community Contributors
 
 A special thanks to our community contributors this month:
 - [unsync](https://github.com/unsync)