You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
title: Understanding Confidence Scores in Azure AI Content Understanding
2
+
title: Understanding Confidence Scores in Azure AI Content Understanding.
3
3
titleSuffix: Azure AI services
4
-
description: Learn about confidence score use-cases, and tips to improve.
5
-
author: admaheshwari
6
-
ms.author: lajanuar
4
+
description: Best practices to interpret and improve Azure AI Content Understanding accuracy and confidence scores.
5
+
author: laujan
6
+
ms.author: admaheshwari
7
7
manager: nitinme
8
8
ms.service: azure-ai-content-understanding
9
9
ms.topic: overview
10
10
ms.date: 02/20/2025
11
-
ms.custom: ignite-2024-understanding-release
12
-
13
-
#customer intent: As a user, I want to learn more about Content Understanding confidence scores.
14
11
---
15
-
# Confidence Scores in Azure AI Content Understanding
12
+
13
+
# Interpret and improve accuracy and confidence scores
14
+
15
+
A confidence score indicates probability by measuring the degree of statistical certainty that the extracted result is detected correctly. The estimated accuracy is calculated by running a few different combinations of the training data to predict the labeled values. In this article, we share how to interpret accuracy and confidence scores and best practices for using those scores to improve accuracy and confidence results.
16
+
17
+
16
18
Understanding Confidence Scores
17
19
What are confidence scores?
18
20
Confidence scores represent the probability that the extracted result is correct. For example, a confidence score of 0.95 (95%) suggests that the prediction is likely correct 19 out of 20 times. These scores are derived from various factors, including the quality of the input document, the similarity between the training data and the document being analyzed, and the model's ability to recognize patterns and features in the document.
@@ -23,18 +25,18 @@ Confidence scores are supported for extractive fields, including text, tables fo
Thresholds for confidence scores are predefined values that determine whether a prediction is considered reliable or requires further review. These thresholds can be set across different modalities to ensure consistent and accurate results. Setting appropriate thresholds is important because it helps balance the trade-off between automation and accuracy. By setting the right thresholds, users can ensure that only high-confidence predictions are automated, while low-confidence predictions are flagged for human review. This helps improve the overall accuracy and reliability of the predictions
40
42
@@ -47,13 +49,13 @@ Human in the Loop (HITL) is a process that involves human intervention in the mo
47
49
It can improved accuracy and reliability of the predictions, reduced errors, and enhanced overall quality of the results.
48
50
49
51
How can customers access confidence score in CU?
50
-
For every field extraction, confidence score is listed as part of the field extraction output. You can also check confidence score as part of your JSON output under “confidence”
52
+
For every field extraction, confidence score is listed as part of the field extraction output. You can also check confidence score as part of your JSON output under "confidence"
51
53
52
54
Tips to improve confidence score
53
-
1.Correcting an expected output so that the model can understand the definition better. Example: Here we can see the confidence score is 12%, to improve confidence score, we can go to label data, select auto label which will give us predicted field labels. Now we can correct our definition and it will show corrected field label. Test the analyzer again for better confidence score. Here it jumped to 98%. Confidence improvement will vary as per the complexity and nature of document.
55
+
1.Correcting an expected output so that the model can understand the definition better. Example: Here we can see the confidence score is 12%, to improve confidence score, we can go to label data, select auto label which will give us predicted field labels. Now we can correct our definition and it will show corrected field label. Test the analyzer again for better confidence score. Here it jumped to 98%. Confidence improvement will vary as per the complexity and nature of document.
54
56
55
-
2.Adding more samples and label them for different variation and templates the model may expect.
56
-
3.Add documents that contains various input values for the schema you want to extract.
57
-
4.Improve the quality of your input documents.
58
-
5.Incorporate human in the loop for lower confidence results.
57
+
2.Adding more samples and label them for different variation and templates the model may expect.
58
+
3.Add documents that contains various input values for the schema you want to extract.
59
+
4.Improve the quality of your input documents.
60
+
5.Incorporate human in the loop for lower confidence results.
59
61
Note: Confidence score is only available for document modality in the preview. For other modalities it will be added soon.
0 commit comments