Merge pull request #270667 from mrbullwinkle/mrb_03_29_2024_monitoring_update

prmerger-automator[bot] · web-flow · commit 979fd0b2d7b7 · 2024-03-29T19:30:25.000Z
[Azure OpenAI] Monitoring update
diff --git a/articles/ai-services/openai/how-to/monitoring.md b/articles/ai-services/openai/how-to/monitoring.md
@@ -6,7 +6,7 @@ ms.author: mbullwin
 ms.service: azure-ai-openai
 ms.topic: how-to
 ms.custom: subject-monitoring
-ms.date: 11/14/2023
+ms.date: 03/29/2024
 ---
 
 # Monitoring Azure OpenAI Service
@@ -47,6 +47,10 @@ These are legacy metrics that are common to all Azure AI Services resources. We
 
 ### Azure OpenAI Metrics
 
+> [!NOTE]
+> The **Provisioned-managed Utilization** metric is now deprecated and is no longer recommended. This metric has been replaced by the **Provisioned-managed Utilization V2** metric.
+
+
 The following table summarizes the current subset of metrics available in Azure OpenAI.
 
 |Metric|Category|Aggregation|Description|Dimensions|
@@ -56,7 +60,7 @@ The following table summarizes the current subset of metrics available in Azure
 | `Processed FineTuned Training Hours` | Usage |Sum| Number of Training Hours Processed on an OpenAI FineTuned Model |  `ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
 | `Processed Inference Tokens` | Usage | Sum|  Number of inference tokens processed by an OpenAI model. Calculated as prompt tokens (input) + generated tokens. Applies to PayGo, PTU, and PTU-manged SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
 | `Processed Prompt Tokens` | Usage | Sum | Total number of prompt tokens (input) processed on an OpenAI model. Applies to PayGo, PTU, and PTU-managed SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
-| `Provision-managed Utilization` | Usage | Average | Provision-managed utilization is the utilization percentage for a given provisioned-managed deployment. Calculated as (PTUs consumed/PTUs deployed)*100. When utilization is at or above 100%, calls are throttled and return a 429 error code. | `ModelDeploymentName`,`ModelName`,`ModelVersion`, `Region`, `StreamType`|
+| `Provision-managed Utilization V2` | Usage | Average | Provision-managed utilization is the utilization percentage for a given provisioned-managed deployment. Calculated as (PTUs consumed/PTUs deployed)*100. When utilization is at or above 100%, calls are throttled and return a 429 error code. | `ModelDeploymentName`,`ModelName`,`ModelVersion`, `Region`, `StreamType`|
 
 ## Configure diagnostic settings