Skip to content

Commit 979fd0b

Browse files
Merge pull request #270667 from mrbullwinkle/mrb_03_29_2024_monitoring_update
[Azure OpenAI] Monitoring update
2 parents a68fd4e + 7e67966 commit 979fd0b

File tree

1 file changed

+6
-2
lines changed

1 file changed

+6
-2
lines changed

articles/ai-services/openai/how-to/monitoring.md

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: mbullwin
66
ms.service: azure-ai-openai
77
ms.topic: how-to
88
ms.custom: subject-monitoring
9-
ms.date: 11/14/2023
9+
ms.date: 03/29/2024
1010
---
1111

1212
# Monitoring Azure OpenAI Service
@@ -47,6 +47,10 @@ These are legacy metrics that are common to all Azure AI Services resources. We
4747

4848
### Azure OpenAI Metrics
4949

50+
> [!NOTE]
51+
> The **Provisioned-managed Utilization** metric is now deprecated and is no longer recommended. This metric has been replaced by the **Provisioned-managed Utilization V2** metric.
52+
53+
5054
The following table summarizes the current subset of metrics available in Azure OpenAI.
5155

5256
|Metric|Category|Aggregation|Description|Dimensions|
@@ -56,7 +60,7 @@ The following table summarizes the current subset of metrics available in Azure
5660
| `Processed FineTuned Training Hours` | Usage |Sum| Number of Training Hours Processed on an OpenAI FineTuned Model | `ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
5761
| `Processed Inference Tokens` | Usage | Sum| Number of inference tokens processed by an OpenAI model. Calculated as prompt tokens (input) + generated tokens. Applies to PayGo, PTU, and PTU-manged SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
5862
| `Processed Prompt Tokens` | Usage | Sum | Total number of prompt tokens (input) processed on an OpenAI model. Applies to PayGo, PTU, and PTU-managed SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
59-
| `Provision-managed Utilization` | Usage | Average | Provision-managed utilization is the utilization percentage for a given provisioned-managed deployment. Calculated as (PTUs consumed/PTUs deployed)*100. When utilization is at or above 100%, calls are throttled and return a 429 error code. | `ModelDeploymentName`,`ModelName`,`ModelVersion`, `Region`, `StreamType`|
63+
| `Provision-managed Utilization V2` | Usage | Average | Provision-managed utilization is the utilization percentage for a given provisioned-managed deployment. Calculated as (PTUs consumed/PTUs deployed)*100. When utilization is at or above 100%, calls are throttled and return a 429 error code. | `ModelDeploymentName`,`ModelName`,`ModelVersion`, `Region`, `StreamType`|
6064

6165
## Configure diagnostic settings
6266

0 commit comments

Comments
 (0)