You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/monitoring.md
+6-2Lines changed: 6 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ ms.author: mbullwin
6
6
ms.service: azure-ai-openai
7
7
ms.topic: how-to
8
8
ms.custom: subject-monitoring
9
-
ms.date: 11/14/2023
9
+
ms.date: 03/29/2024
10
10
---
11
11
12
12
# Monitoring Azure OpenAI Service
@@ -47,6 +47,10 @@ These are legacy metrics that are common to all Azure AI Services resources. We
47
47
48
48
### Azure OpenAI Metrics
49
49
50
+
> [!NOTE]
51
+
> The **Provisioned-managed Utilization** metric is now deprecated and is no longer recommended. This metric has been replaced by the **Provisioned-managed Utilization V2** metric.
52
+
53
+
50
54
The following table summarizes the current subset of metrics available in Azure OpenAI.
@@ -56,7 +60,7 @@ The following table summarizes the current subset of metrics available in Azure
56
60
|`Processed FineTuned Training Hours`| Usage |Sum| Number of Training Hours Processed on an OpenAI FineTuned Model |`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
57
61
|`Processed Inference Tokens`| Usage | Sum| Number of inference tokens processed by an OpenAI model. Calculated as prompt tokens (input) + generated tokens. Applies to PayGo, PTU, and PTU-manged SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
58
62
|`Processed Prompt Tokens`| Usage | Sum | Total number of prompt tokens (input) processed on an OpenAI model. Applies to PayGo, PTU, and PTU-managed SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
59
-
|`Provision-managed Utilization`| Usage | Average | Provision-managed utilization is the utilization percentage for a given provisioned-managed deployment. Calculated as (PTUs consumed/PTUs deployed)*100. When utilization is at or above 100%, calls are throttled and return a 429 error code. |`ModelDeploymentName`,`ModelName`,`ModelVersion`, `Region`, `StreamType`|
63
+
|`Provision-managed Utilization V2`| Usage | Average | Provision-managed utilization is the utilization percentage for a given provisioned-managed deployment. Calculated as (PTUs consumed/PTUs deployed)*100. When utilization is at or above 100%, calls are throttled and return a 429 error code. |`ModelDeploymentName`,`ModelName`,`ModelVersion`, `Region`, `StreamType`|
0 commit comments