You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
Stability AI models deployed via standard deployment implement the Foundry Models API on the route `/image/generations`.
@@ -358,15 +367,6 @@ For examples of how to use Stability AI models, see the following examples:
358
367
-[Use Requests library with Stable Diffusion 3.5 Large for image to image requests](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/stabilityai/Image_to_Image.ipynb)
359
368
-[Example of a fully encoded image generation response](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/stabilityai/Sample_image_generation_response.txt)
360
369
361
-
### xAI
362
-
363
-
**Grok** is a family of models designed...
364
-
365
-
| Model | Type | Tier | Capabilities |
366
-
| ------ | ---- | --- | ------------ |
367
-
|[Grok]()|| Global standard | - **Input:** text (0 tokens) <br /> - **Output:** text (0 tokens) <br /> - **Languages:** <br /> - **Tool calling:** <br /> - **Response formats:**|
368
-
|[grok-3-mini]()|| Global standard | - **Input:** text (0 tokens) <br /> - **Output:** text (0 tokens) <br /> - **Languages:** <br /> - **Tool calling:** <br /> - **Response formats:**|
tsuzumi-7b | [Microsoft Managed Countries/Regions](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) | East US 2 <br> South Central US <br> East US <br> West US 3 <br> West US <br> North Central US | East US 2 <br> East US <br> North Central US <br> South Central US <br> West US <br> West US 3 |
116
116
117
-
### xAI
118
-
119
-
| Model | Offer Availability Region | Hub/Project Region for Deployment | Hub/Project Region for Fine tuning |
120
-
|---------|---------|---------|---------|
121
-
grok-3 | [Microsoft Managed Countries/Regions](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) | | Not available |
122
-
grok-3-mini | [Microsoft Managed Countries/Regions](/partner-center/marketplace/tax-details-marketplace#microsoft-managed-countriesregions) | | Not available |
Copy file name to clipboardExpand all lines: articles/ai-foundry/model-inference/concepts/models.md
+3-4Lines changed: 3 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -208,13 +208,12 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
208
208
209
209
### xAI
210
210
211
-
**Grok** is a family of models designed...
211
+
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
212
212
213
213
| Model | Type | Tier | Capabilities |
214
214
| ------ | ---- | --- | ------------ |
215
-
|[Grok]()|| Global standard | - **Input:** text (0 tokens) <br /> - **Output:** text (0 tokens) <br /> - **Languages:** <br /> - **Tool calling:** <br /> - **Response formats:**|
216
-
|[grok-3-mini]()|| Global standard | - **Input:** text (0 tokens) <br /> - **Output:** text (0 tokens) <br /> - **Languages:** <br /> - **Tool calling:** <br /> - **Response formats:**|
217
-
215
+
|[grok-3](https://ai.azure.com/explore/models/grok-3/version/1/registry/azureml-xai)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |
216
+
|[grok-3-mini](https://ai.azure.com/explore/models/grok-3-mini/version/1/registry/azureml-xai)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |
Copy file name to clipboardExpand all lines: articles/ai-services/agents/concepts/agent-catalog.md
+4-7Lines changed: 4 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ description: This article provides instructions on how to use the AI agent catal
5
5
manager: nitinme
6
6
ms.service: azure-ai-agent-service
7
7
ms.topic: how-to
8
-
ms.date: 04/29/2025
8
+
ms.date: 05/18/2025
9
9
ms.author: aahi
10
10
author: aahill
11
11
---
@@ -59,25 +59,22 @@ A full list of agent samples in the catalog can be found on the Azure AI Foundry
59
59
|[AI Red Teaming Agent](https://github.com/microsoft/agent-catalog/tree/main/semantic-kernel-blueprints/ai-red-teaming-agent)| Facilitates the development of a copilot to accelerate your AI red teaming process: through multi-agent system that automates the generation, transformation, and execution of harmful prompts against target generative AI models or applications for AI red teaming purposes. Useful for streamlining safety testing workflows, surfacing guardrail bypasses, and guiding risk mitigation planning. | Microsoft | Multi-agent | Semantic Kernel | Advanced | N/A |
60
60
|[Saifr Communication Compliance Agent](https://aka.ms/saifr-communication-agent)| The Saifr Communication Compliance Agent identifies potentially noncompliant text and generates a more compliant, fair, and balanced version, helping end users better adhere to relevant regulatory guidelines | Saifr from Fidelity Labs | Single-agent | Agent AI Agent Service | Intermediate | OpenAPI Specified Tool |
61
61
|[Auquan Due Diligence Risk Analyst](https://aka.ms/due-diligence-risk-analyst-agent)| Helps create agents that assess risks across financial, operational, regulatory, and ESG domains | Auquan | Single-agent | Agent AI Agent Service | Intermediate | OpenAPI Specified Tool |
62
-
|[Healthcare Multi-agent Orchestrator](https://aka.ms/healthcare-multi-agent)| Facilitates the development and testing of modular specialized agents that coordinate across diverse data types and tools like M365 and Teams to assist multi-disciplinary healthcare workflows—such as cancer care. | Microsoft | Multi-agent | Semantic Kernel | Advanced ||
62
+
|[Healthcare Agent Orchestrator](https://aka.ms/healthcare-multi-agent)| Facilitates the development and testing of modular specialized agents that coordinate across diverse data types and tools like M365 and Teams to assist multi-disciplinary healthcare workflows—such as cancer care. | Microsoft | Multi-agent | Semantic Kernel | Advanced ||
63
63
|[ResearchFlow Agent](https://aka.ms/research-flow)| Helps create agents that execute complex, multi-step research workflows and solve open-ended tasks | Microsoft | Multi-agent | Agent AI Agent Service | Advanced ||
64
64
|[Magentic-One Agent](https://aka.ms/magnetic-one)| A generalist, autonomous multi-agent system that performs deep research and problem-solving by orchestrating web search, code generation, and code execution agents. Helpful for tackling open-ended analytical or technical tasks. | Microsoft | Multi-agent | Agent AI Agent Service | Advanced ||
65
65
|[SightMachine Filler Optimization Agent](https://aka.ms/sight-machine-filler-optimization-agent)| The SightMachine Filler Optimization Agent supports building agents that analyze manufacturing data to reduce bottlenecks and improve throughput via predictive insights | SightMachine | Single-agent | Agent AI Agent Service | Intermediate | Azure Functions |
66
66
|[Marquee Insights AI News Agent](https://aka.ms/ai-news-agent)| Enables creating an agent that retrieves and summarize news focused on Microsoft, healthcare, and legal sectors | Marquee Insights | Single-agent | Agent AI Agent Service | Intermediate ||
67
-
|[MiHCM HR Assist Agent](https://aka.ms/ hr-agent) | Supports agent development for HR scenarios by enabling employees to navigate HR-related records like leave balances, HR requests and work activities using MiHCM's HR APIs | MiHCM | Single-agent | Agent AI Agent Service | Intermediate | OpenAPI Specified Tool |
68
-
|[Claim Concierge](https://aka.ms/claim-concierge)| Helps create agents for multi-lingual claim navigation | Microsoft | Multi-agent | Agent AI Agent Service | Beginner | Connected Agents, File Search, Grounding with Bing, Code Interpreter |
67
+
|[MiHCM HR Assist Agent](https://aka.ms/hr-agent)| Supports agent development for HR scenarios by enabling employees to navigate HR-related records like leave balances, HR requests and work activities using MiHCM's HR APIs | MiHCM | Single-agent | Agent AI Agent Service | Intermediate | OpenAPI Specified Tool |
69
68
|[Portfolio Navigator](https://aka.ms/trusty-link)| Supports agent creation for exploring financial topics from Morningstar data and Grounding with Bing | Microsoft | Single-agent | Agent AI Agent Service | Beginner | Morningstar, Grounding with Bing |
70
69
|[Travel Planner](https://aka.ms/travel-planner)| Enables agent creation for travel scenarios | Microsoft | Single-agent | Agent AI Agent Service | Beginner | File Search, Code Interpreter, Tripadvisor, OpenAPI Specified Tool |
71
70
|[Home Loan Guide](https://aka.ms/home-loan-guide)| Enables agent creation to provide users with helpful information about mortgage applications at a fictitious company, Contoso Bank. | Microsoft | Single-agent | Agent AI Agent Service | Beginner | Connected Agents, File Search, Code Interpreter, Grounding with Bing |
72
71
|[Sales Analyst Agent](https://aka.ms/sales-analyst)| Supports building agents that analyze sales data | Microsoft | Single-agent | Agent AI Agent Service | Beginner | File Search, Code Interpreter |
73
72
|[Customer Service Agent](https://aka.ms/customer-service)| Helps create a multi-agent system that manages full-cycle support resolution —from authentication to escalation to resolution | Microsoft | Multi-agent | Agent AI Agent Service | Advanced ||
74
73
|[Warranty Claim Processing Agent](https://aka.ms/warranty-claim-processing)| Facilitates the development of agents for processing warranty claims | Microsoft | Single-agent | Semantic Kernel | Intermediate | OpenAPI Specified Tool |
75
74
|[Voice Live Agent](https://aka.ms/voice-live-agent)| Enables agent development for real-time, voice-based interactions using Azure AI Voice Live API. | Microsoft | Single-agent | Agent AI Agent Service | Intermediate ||
76
-
|[Meeting Prep Agent](https://aka.ms/meeting-prep-agent)| Helps build an agent that helps with meetings by researching attendees and generating contextual summaries | Microsoft | Single-agent | Agent AI Agent Service | Intermediate | Grounding with Bing, Azure Logic Apps |
77
-
|[CommsPilot](https://aka.ms/comms-pilot)| Enables agent creation for personalized outbound sales emails and outreach logging | Microsoft | Single-agent | Agent AI Agent Service | Intermediate | File Search, Grounding with Bing, Azure Logic Apps |
78
75
|[Text Translation Agent](https://aka.ms/translation-agent)| Helps create agents that handle multilingual text processing, including dynamic language detection and bidirectional translation using Azure AI Translator service | Microsoft | Single-agent | Agent AI Agent Service | Beginner | OpenAPI Specified Tool |
79
76
|[Video Translation Agent](https://aka.ms/video-translation-agent)| Supports building agents for multilingual video localization with translation, subtitles, and speech generation | Microsoft | Single-agent | Semantic Kernel | Beginner | N/A |
80
77
|[Intent Routing Agent](https://aka.ms/intent-routing)| Helps create agents that detect user intent and provide exact answering. Perfect for deterministically intent routing and exact question answering with human controls. | Microsoft | Single-agent | Agent AI Agent Service | Beginner | OpenAPI Specified Tool |
81
78
|[Exact Question Answering Agent](https://aka.ms/exact-question-answering)| Supports building agents that answer predefined, high-value questions to ensure consistent and accurate responses. | Microsoft | Single-agent | Agent AI Agent Service | Beginner | OpenAPI Specified Tool |
Copy file name to clipboardExpand all lines: articles/search/retrieval-augmented-generation-overview.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -58,7 +58,7 @@ A high-level summary of the pattern looks like this:
58
58
+ Send it to Azure AI Search to find relevant information.
59
59
+ Return the top ranked search results to an LLM.
60
60
+ Use the natural language understanding and reasoning capabilities of the LLM to generate a response to the initial prompt.
61
-
+ Optionally, use agentic RAG where an agent evaluates an answer and finds a better one if the original answer is incomplete or low quality.
61
+
+ Optionally, use agentic retrieval where an agent evaluates an answer and finds a better one if the original answer is incomplete or low quality.
62
62
63
63
Azure AI Search provides inputs to the LLM prompt, but doesn't train the model. In RAG architecture, there's no extra training. The LLM is pretrained using public data, but it generates responses that are augmented by information from the retriever, in this case, Azure AI Search.
0 commit comments