MicrosoftDocs
diff --git a/‎articles/ai-services/agents/how-to/metrics.md‎
Lines changed: 81 additions & 0 deletions b/‎articles/ai-services/agents/how-to/metrics.md‎
Lines changed: 81 additions & 0 deletions
diff --git a/‎articles/ai-services/agents/how-to/use-your-own-resources.md‎
Lines changed: 4 additions & 3 deletions b/‎articles/ai-services/agents/how-to/use-your-own-resources.md‎
Lines changed: 4 additions & 3 deletions
diff --git a/‎articles/ai-services/agents/media/monitoring/dashboard.png‎
175 KB b/‎articles/ai-services/agents/media/monitoring/dashboard.png‎
175 KB
diff --git a/‎articles/ai-services/agents/media/monitoring/diagnostic-settings.png‎
196 KB b/‎articles/ai-services/agents/media/monitoring/diagnostic-settings.png‎
196 KB
diff --git a/‎articles/ai-services/agents/media/monitoring/log-analytics-metrics-query.png‎
54 KB b/‎articles/ai-services/agents/media/monitoring/log-analytics-metrics-query.png‎
54 KB
diff --git a/‎articles/ai-services/agents/reference/monitor-service.md‎
Lines changed: 45 additions & 0 deletions b/‎articles/ai-services/agents/reference/monitor-service.md‎
Lines changed: 45 additions & 0 deletions
diff --git a/‎articles/ai-services/agents/toc.yml‎
Lines changed: 4 additions & 0 deletions b/‎articles/ai-services/agents/toc.yml‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎articles/ai-services/agents/whats-new.md‎
Lines changed: 11 additions & 1 deletion b/‎articles/ai-services/agents/whats-new.md‎
Lines changed: 11 additions & 1 deletion
diff --git a/‎articles/ai-services/openai/how-to/responses.md‎
Lines changed: 21 additions & 94 deletions b/‎articles/ai-services/openai/how-to/responses.md‎
Lines changed: 21 additions & 94 deletions
@@ -0,0 +1,81 @@
+---
+title: Monitor Azure AI Agent Service
+description: Start here to learn how to use Azure Monitor to capture and analyze metrics for your Azure AI Agent Service.
+ms.date: 03/20/2025
+ms.custom: horz-monitor, subject-monitoring
+ms.topic: conceptual
+author: aahill
+ms.author: aahi
+ms.service: azure-ai-agent-service
+---
+
+# Monitor Azure AI Agent Service
+
+[!INCLUDE [horz-monitor-intro](~/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-intro.md)]
+
+Monitoring is available for agents in a [standard agent setup](../quickstart.md?pivots=programming-language-csharp#choose-basic-or-standard-agent-setup).
+
+## Dashboards
+
+Azure AI Agent Service provides out-of-box dashboards. There are two key dashboards to monitor your resource: 
+
+- The metrics dashboard in the AI Foundry resource view 
+- The dashboard in the overview pane within the Azure portal 
+
+To access the monitoring dashboards, sign in to the [Azure portal](https://portal.azure.com) and then select **Monitoring** in the left navigation menu, then click **Metrics**.
+
+
+:::image type="content" source="../media/monitoring/dashboard.png" alt-text="Screenshot that shows out-of-box dashboards for a resource in the Azure portal." lightbox="../media/monitoring/dashboard.png" border="false":::
+
+## Azure monitor platform metrics
+
+Azure Monitor provides platform metrics for most services. These metrics are:
+
+* Individually defined for each namespace.
+* Stored in the Azure Monitor time-series metrics database.
+* Lightweight and capable of supporting near real-time alerting.
+* Used to track the performance of a resource over time.
+* Collection: Azure Monitor collects platform metrics automatically. No configuration is required.
+
+For a list of all metrics it's possible to gather for all resources in Azure Monitor, see [Supported metrics in Azure Monitor](/azure/azure-monitor/platform/metrics-supported).
+
+Azure AI Agent Service has commonality with a subset of Azure AI services. For a list of available metrics for Azure AI Agent Service, see the [monitoring data reference](../reference/monitor-service.md#metrics).
+
+## Analyze monitoring data
+
+There are many tools for analyzing monitoring data.
+
+### Azure Monitor tools
+
+Azure Monitor supports the [metrics explorer](/azure/azure-monitor/essentials/metrics-getting-started), a tool in the Azure portal that allows you to view and analyze metrics for Azure resources. For more information, see Analyze metrics with Azure Monitor metrics explorer.
+
+## Azure Monitor export tools
+
+You can get data out of Azure Monitor into other tools by using the [REST API for metrics](/rest/api/monitor/operation-groups) to extract metric data from the Azure Monitor metrics database. The API supports filter expressions to refine the data retrieved. For more information, see [Azure Monitor REST API reference](/rest/api/monitor/filter-syntax).
+
+To get started with the REST API for Azure Monitor, see [Azure monitoring REST API walkthrough](/azure/azure-monitor/essentials/rest-api-walkthrough).
+
+## Alerts
+
+Azure Monitor alerts proactively notify you when specific conditions are found in your monitoring data. Alerts allow you to identify and address issues in your system before your customers notice them. For more information, see Azure Monitor alerts.
+
+There are many sources of common alerts for Azure resources. [The Azure Monitor Baseline Alerts (AMBA)](https://aka.ms/amba) site provides a semi-automated method of implementing important platform metric alerts, dashboards, and guidelines. The site applies to a continually expanding subset of Azure services, including all services that are part of the Azure Landing Zone (ALZ).
+
+The common alert schema standardizes the consumption of Azure Monitor alert notifications. For more information, see [Common alert schema](/azure/azure-monitor/alerts/alerts-common-schema).
+
+[Metric alerts](/azure/azure-monitor/alerts/alerts-types#metric-alerts) evaluate resource metrics at regular intervals. Metric alerts can also apply multiple conditions and dynamic thresholds.
+
+Every organization's alerting needs vary and can change over time. Generally, all alerts should be actionable and have a specific intended response if the alert occurs. If an alert doesn't require an immediate response, the condition can be captured in a report rather than an alert. Some use cases might require alerting anytime certain error conditions exist. In other cases, you might need alerts for errors that exceed a certain threshold for a designated time period.
+
+Depending on what type of application you're developing with your use of Azure AI Agent Service, [Azure Monitor Application Insights](/azure/azure-monitor/overview) might offer more monitoring benefits at the application layer.
+
+### Azure AI Agent service alert rules
+
+You can set alerts for any metric listed in the [monitoring data reference](../reference/monitor-service.md).
+
+[!INCLUDE [horz-monitor-advisor-recommendations](~/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-advisor-recommendations.md)]
+
+## Related content
+
+- See [Monitoring data reference](../reference/monitor-service.md) for a reference of the metrics and other important values created for Azure AI Agent Service.
+- See [Monitoring Azure resources with Azure Monitor](/azure/azure-monitor/essentials/monitor-azure-resource) for general details on monitoring Azure resources.
@@ -41,7 +41,7 @@ Replace the parameter value for `aiServiceAccountResourceId` with the full arm r
 
     The value returned is the `aiServiceAccountResourceId` you need to use in the template.
 
-2. In the basic agent template file, replace the following placeholders:
+3. In the basic agent template file, replace the following placeholders:
 
     ```
     aiServiceAccountResourceId:/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.CognitiveServices/accounts/{serviceName}
@@ -74,9 +74,10 @@ Use an existing AI Services / Azure OpenAI, Azure Storage account, Azure Cosmos
     ```az login``` 
 2. Then run the command:
 
-    ```az search service show --resource-group  <your-resource-group> --name <your-storage-account>  --query "id" --output tsv```
-    
+    ```az storage account show --resource-group  <your-resource-group> --name <your-storage-account>  --query "id" --output tsv```
+   
      The output is the `aiStorageAccountResourceID` you need to use in the template.
+   
 3. In the standard agent template file, replace the following placeholders:
     
     ```
 
@@ -0,0 +1,45 @@
+---
+title: Monitoring data reference for Azure AI Agent Service
+description: This article contains important reference material you need when you monitor Azure AI Agent Service by using Azure Monitor.
+ms.date: 03/24/2025
+ms.custom: horz-monitor, subject-monitoring
+ms.topic: reference
+author: aahill
+ms.author: aahi
+ms.service: azure-ai-agent-service
+---
+
+# Azure AI Agent Service monitoring data reference
+
+[!INCLUDE [horz-monitor-ref-intro](~/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-intro.md)]
+
+See [Monitor Azure AI Agent Service](../how-to/metrics.md) for details on the data you can collect on your agents.
+
+## Metrics
+
+Here are the most important metrics we think you should monitor for Azure AI Agent Service. Later in this article is a longer list of all available metrics which contains more details on metrics in this shorter list. _See the below list for most up to date information. We're working on refreshing the tables in the following sections._
+
+- [Runs](#category-agents)
+- [Indexed files](#category-agents)
+<!-- - Indexed files -->
+
+## Supported metrics
+
+This section lists all the automatically collected platform metrics for this service. These metrics are also part of the global list of [all platform metrics supported in Azure Monitor](/azure/azure-monitor/reference/supported-metrics/metrics-index#supported-metrics-per-resource-type).
+
+[!INCLUDE [horz-monitor-ref-metrics-tableheader](~/reusable-content/ce-skilling/azure/includes/azure-monitor/horizontals/horz-monitor-ref-metrics-tableheader.md)]
+[!INCLUDE [Microsoft.MachineLearningServices/workspaces](~/reusable-content/ce-skilling/azure/includes/azure-monitor/reference/metrics/microsoft-machinelearningservices-workspaces-metrics-include.md)]
+
+## Category: Agents
+
+
+|Metric  |Name in REST API  |Unit  | Aggregation | Dimension | Time grains | DS Export |
+|---------|---------|---------|---------|---------|---------|---------|
+|Runs <br>  The number of runs in a given timeframe.     | `Runs`        | Count        | Total (sum), Average, Minimum, Maximum, Count        | `ResourceId`, `ProjectId`, `AgentId`, `StreamType`, `Region`, `StatusCode (successful, clienterrors, server errors)`, `RunStatus (started, completed, failed, cancelled, expired)` | PT1M | Yes |
+|Indexed files <br> Number of files indexed for file search    |  `IndexedFiles`       | Count        |  Count, Average, Minimum, Maximum       | `ResourceId`, `ProjectId`, `VectorStoreId`, `StreamType`, `Region`, `Status`, `ErrorCode` | PT1M | Yes |
+
+
+## Related content
+
+- See [Monitor Azure AI Agent Service](../how-to/metrics.md) for a description of monitoring Azure AI Agent Service.
+- See [Monitor Azure resources with Azure Monitor](/azure/azure-monitor/essentials/monitor-azure-resource) for details on monitoring Azure resources.
@@ -58,6 +58,8 @@ items:
           href: ../openai/how-to/content-filters.md?context=/azure/ai-services/agents/context/context
         - name: Use virtual networks
           href: how-to/virtual-networks.md
+    - name: Service monitoring
+      href: how-to/metrics.md
     - name: Use the Visual Studio Code extension
       href: ../../ai-foundry/how-to/develop/vs-code-agents.md?context=/azure/ai-services/agents/context/context
 - name: Responsible AI
@@ -84,6 +86,8 @@ items:
           href: https://github.com/openai/openai-dotnet/blob/main/README.md
         - name: Python
           href: https://github.com/openai/openai-python/blob/main/README.md
+    - name: Data monitoring reference
+      href: reference/monitor-service.md
 - name: Resources
   items: 
     - name: Support and help options
 
@@ -7,7 +7,7 @@ author: aahill
 ms.author: aahi
 ms.service: azure-ai-agent-service
 ms.topic: overview
-ms.date: 01/30/2025
+ms.date: 04/23/2025
 ms.custom: azure-ai-agents
 ---
 
@@ -16,9 +16,19 @@ ms.custom: azure-ai-agents
 This article provides a summary of the latest releases and major documentation updates for Azure AI Agent Service.
 
 ## April 2025
+
+### Azure monitor integration
+
+You can now see metrics related to Agents in Azure monitor
+* The number of files indexed for file search.
+* The number of runs in a given timeframe.
+
+See the [Azure monitor](./how-to/metrics.md) and [metrics reference](./reference/monitor-service.md) articles for more information.
+
 ### BYO thread storage
 The Standard Agent Setup now supports **Bring Your Own (BYO) thread storage using an Azure Cosmos DB for NoSQL account**. This feature ensures all thread messages and conversation history are stored in your own resources. See the [Quickstart](./quickstart.md) for more information on how to deploy a Standard agent project.
 
+
 ## March 2025
 
 ### Microsoft Fabric tool
 
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's new stateful Responses API.
 manager: nitinme
 ms.service: azure-ai-openai
 ms.topic: include
-ms.date: 04/23/2025
+ms.date: 03/21/2025
 author: mrbullwinkle    
 ms.author: mbullwin
 ms.custom: references_regions
@@ -56,9 +56,9 @@ Not every model is available in the regions supported by the responses API. Chec
 > - Structured outputs
 > - tool_choice
 > - image_url pointing to an internet address
-> - The web search tool is also not supported, and isn't part of the `2025-03-01-preview` API.  
+> - The web search tool is also not supported, and is not part of the `2025-03-01-preview` API.  
 > 
-> There's also a known issue with vision performance when using the Responses API, particularly with OCR tasks. As a temporary workaround set image detail to `high`. This article will be updated once this issue is resolved and as any additional feature support is added.
+> There is also a known issue with vision performance when using the Responses API, particularly with OCR tasks. As a temporary workaround set image detail to `high`. This article will be updated once this issue is resolved and as any additional feature support is added.
 
 
 ### Reference documentation
@@ -96,16 +96,6 @@ response = client.responses.create(
     input="This is a test."
     #truncation="auto" required when using computer-use-preview model.
 
-response_id = response.id
-response_status = response.status
-
-
-print(f"\n Response ID: {response_id}")
-print(f"\n Response Status: {response_status}\n")
-
-print(response.model_dump_json(indent=2))
-
-
 )
 ```
 
@@ -128,15 +118,6 @@ response = client.responses.create(
     input="This is a test."
     #truncation="auto" required when using computer-use-preview model.
 
-response_id = response.id
-response_status = response.status
-
-
-print(f"\n Response ID: {response_id}")
-print(f"\n Response Status: {response_status}\n")
-
-print(response.model_dump_json(indent=2))
-
 )
 ```
 
@@ -171,111 +152,57 @@ curl -X POST https://YOUR-RESOURCE-NAME.openai.azure.com/openai/responses?api-ve
 **Output:**
 
 ```json
-Response ID: resp_680915b58140819085f4c55454402f3600400b1e6ec996fc
-
-Response Status: completed
-
 {
-  "id": "resp_680915b58140819085f4c55454402f3600400b1e6ec996fc",
-  "created_at": 1745425845.0,
+  "id": "resp_67cb32528d6881909eb2859a55e18a85",
+  "created_at": 1741369938.0,
   "error": null,
   "incomplete_details": null,
   "instructions": null,
   "metadata": {},
-  "model": "gpt-4o",
+  "model": "gpt-4o-2024-08-06",
   "object": "response",
   "output": [
     {
-      "id": "msg_680915b5c8dc8190b21a72a55830fea900400b1e6ec996fc",
+      "id": "msg_67cb3252cfac8190865744873aada798",
       "content": [
         {
           "annotations": [],
-          "text": "It looks like you're testing out how this works! How can I assist you today?",
+          "text": "Great! How can I help you today?",
           "type": "output_text"
         }
       ],
       "role": "assistant",
-      "status": "completed",
+      "status": null,
       "type": "message"
     }
   ],
-  "parallel_tool_calls": true,
+  "output_text": "Great! How can I help you today?",
+  "parallel_tool_calls": null,
   "temperature": 1.0,
-  "tool_choice": "auto",
+  "tool_choice": null,
   "tools": [],
   "top_p": 1.0,
   "max_output_tokens": null,
   "previous_response_id": null,
-  "reasoning": {
-    "effort": null,
-    "generate_summary": null,
-    "summary": null
-  },
-  "service_tier": null,
+  "reasoning": null,
   "status": "completed",
-  "text": {
-    "format": {
-      "type": "text"
-    }
-  },
-  "truncation": "disabled",
+  "text": null,
+  "truncation": null,
   "usage": {
-    "input_tokens": 12,
-    "input_tokens_details": {
-      "cached_tokens": 0
-    },
-    "output_tokens": 18,
+    "input_tokens": 20,
+    "output_tokens": 11,
     "output_tokens_details": {
       "reasoning_tokens": 0
     },
-    "total_tokens": 30
+    "total_tokens": 31
   },
   "user": null,
-  "store": true
+  "reasoning_effort": null
 }
 ```
 
 ---
 
-Unlike the chat completions API, the responses API is asynchronous. More complex requests may not be completed by the time that an initial response is returned by the API. This is similar to how the Assistants API handles [thread/run status](/azure/ai-services/openai/how-to/assistant#retrieve-thread-status). 
-
-Note in the response output that the response object contains a `status` which can be monitored to determine when the response is finally complete. `status` can contain a value of `completed`, `failed`, `in_progress`, or `incomplete`.
-
-### Retrieve an individual response status
-
-In the previous Python examples we created a variable `response_id` and set it equal to the `response.id` of our `client.response.create()` call. We can then pass client.response.retrieve() to pull the current status of our response.
-
-```python
-
-retrieve_response =  client.responses.retrieve(response_id)
-print(retrieve_response.status)
-```
-
-### Monitor response status
-
-Depending on the complexity of your request it isn't uncommon to have an initial response with a status of `in_progress` with message output not yet generated. In that case you can create a loop to monitor the status of the response with code. The example below is for demonstration purposes only and is intended to be run in a Jupyter notebook. This code assumes you have already run the two previous Python examples and the Azure OpenAI client as well as `retrieve_response` have already been defined:
-
-```python
-import time
-from IPython.display import clear_output
-
-start_time = time.time()
-
-status = retrieve_response.status
-
-while status not in ["completed", "failed", "incomplete"]:
-    time.sleep(5)
-    retrieve_response =  client.responses.retrieve(response_id)
-    print("Elapsed time: {} minutes {} seconds".format(int((time.time() - start_time) // 60), int((time.time() - start_time) % 60)))
-    status = retrieve_response.status
-    print(f'Status: {status}')
-    clear_output(wait=True)
-
-print(f'Status: {status}')
-print("Elapsed time: {} minutes {} seconds".format(int((time.time() - start_time) // 60), int((time.time() - start_time) % 60)))
-print(retrieve_response.model_dump_json(indent=2))
-```
-
 ## Retrieve a response
 
 To retrieve a response from a previous call to the responses API.
@@ -678,7 +605,7 @@ print(response.model_dump_json(indent=2))
 
 ## Image input
 
-There's a known issue with image url based image input. Currently only base64 encoded images are supported.
+There is a known issue with image url based image input. Currently only base64 encoded images are supported.
 
 ### Image url
 
@@ -958,7 +885,7 @@ async def take_screenshot(page):
             return last_successful_screenshot
 ```
 
-This function captures the current browser state as an image and returns it as a base64-encoded string, ready to be sent to the model. We'll constantly do this in a loop after each step allowing the model to see if the command it tried to execute was successful or not, which then allows it to adjust based on the contents of the screenshot. We could let the model decide if it needs to take a screenshot, but for simplicity we'll force a screenshot to be taken for each iteration.
+This function captures the current browser state as an image and returns it as a base64-encoded string, ready to be sent to the model. We'll constantly do this in a loop after each step allowing the model to see if the command it tried to execute was successful or not, which then allows it to adjust based on the contents of the screenshot. We could let the model decide if it needs to take a screenshot, but for simplicity we will force a screenshot to be taken for each iteration.
 
 ### Model response processing