Merge pull request #268550 from vkurpad/main

prmerger-automator[bot] · web-flow · commit 3583c75ccc0d · 2024-03-11T15:19:21.000Z
Update to Feb2024 release docs
diff --git a/articles/ai-services/document-intelligence/concept-accuracy-confidence.md b/articles/ai-services/document-intelligence/concept-accuracy-confidence.md
@@ -18,11 +18,11 @@ ms.author: lajanuar
 
 > [!NOTE]
 >
-> * **Custom neural models do not provide accuracy scores during training**.
-> * Confidence scores for structured fields such as tables are currently unavailable.
+> * **Custom neural models** do not provide accuracy scores during training.
+> * Confidence scores for tables, table rows and table cells are available starting with the **2024-02-29-preview** API version for **custom models**.
 
 
-Custom models generate an estimated accuracy score when trained. Documents analyzed with a custom model produce a confidence score for extracted fields. In this article, learn to interpret accuracy and confidence scores and best practices for using those scores to improve accuracy and confidence results.
+Custom template models generate an estimated accuracy score when trained. Documents analyzed with a custom model produce a confidence score for extracted fields. In this article, learn to interpret accuracy and confidence scores and best practices for using those scores to improve accuracy and confidence results.
 
 ## Accuracy scores
 
@@ -38,21 +38,25 @@ The accuracy value range is a percentage between 0% (low) and 100% (high). The e
 
 > [!NOTE]
 >
-> * **Table cell confidence scores are now included with the 2024-02-29-preview API version**.
+> * **Table, row and cell confidence scores are now included with the 2024-02-29-preview API version**.
 > * Confidence scores for table cells from custom models is added to the API starting with the 2024-02-29-preview API.
 
 Document Intelligence analysis results return an estimated confidence for predicted words, key-value pairs, selection marks, regions, and signatures. Currently, not all document fields return a confidence score.
 
 Field confidence indicates an estimated probability between 0 and 1 that the prediction is correct. For example, a confidence value of 0.95 (95%) indicates that the prediction is likely correct 19 out of 20 times. For scenarios where accuracy is critical, confidence can be used to determine whether to automatically accept the prediction or flag it for human review.
 
-Confidence scores have two data points: the field level confidence score and the text extraction confidence score. In addition to the field confidence of position and span, the text extraction confidence in the ```pages``` section of the response is the model's confidence in the text extraction (OCR) process. The two confidence scores should be combined to generate one overall confidence score.
-
 **Document Intelligence Studio** </br>
 **Analyzed invoice prebuilt-invoice model**
 
 :::image type="content" source="media/accuracy-confidence/confidence-scores.png" alt-text="confidence scores from Document Intelligence Studio":::
 
-## Interpret accuracy and confidence scores
+## Interpret accuracy and confidence scores for custom models
+
+When interpreting the confidence score from a custom model, you should consider all the confidence scores returned from the model. Let's start with a list of all the confidence scores.
+1. **Document type confidence score**: The document type confidence is an indicator of closely the analyzed document resembleds documents in the training dataset. When the document type confidence is low, this is indicative of template or structural variations in the analyzed document. To improve the document type confidence, label a document with that specific variation and add it to your training dataset. Once the model is re-trained, it should be better equipped to handl that class of variations.
+2. **Field level confidence**: Each labled field extracted has an associated confidence score. This score reflects the model's confidence on the position of the value extracted. While evaluating the confidence you should also look at the underlying extraction confidence to generate a comprehensive confidence for the extracted result. Evaluate the OCR results for text extraction or selection marks depending on the field type to generate a composite confidence score for the field.
+3. **Word confidence score** Each word extracted within the document has an associated confidence score. The score represents the confidence of the transcription. The pages array contains an array of words, each word has an associated span and confidence. Spans from the custom field extracted values will match the spans of the extracted words.
+4. **Selection mark confidence score**: The pages array also contains an array of selection marks, each selection mark has a confidence score representing the confidence of the seletion mark and selection state detection. When a labeled field is a selection mark, the custom field selection confidence combined with the selection mark confidence is an accurate representation of the overall confidence that the field was extracted correctly.
 
 The following table demonstrates how to interpret both the accuracy and confidence scores to measure your custom model's performance.
 
@@ -65,7 +69,7 @@ The following table demonstrates how to interpret both the accuracy and confiden
 
 ## Table, row, and cell confidence
 
-With the addition of table, row and cell confidence with the ```2024-02-29-preview``` API, here are some common questions that should help with interpreting the scores:
+With the addition of table, row and cell confidence with the ```2024-02-29-preview``` API, here are some common questions that should help with interpreting the table, row and cell scores:
 
 **Q:** Is it possible to see a high confidence score for cells, but a low confidence score for the row?<br>
 
diff --git a/articles/ai-services/document-intelligence/concept-custom-classifier.md b/articles/ai-services/document-intelligence/concept-custom-classifier.md
@@ -66,7 +66,7 @@ With custom models, you need to maintain access to the training dataset to updat
 
 > [!IMPORTANT]
 >
-> Incremental trainiing is only supported with models trained with the same API version. If you are trying to extend a model, use the API version the original model was trained with to extend the model.
+> Incremental training is only supported with models trained with the same API version. If you are trying to extend a model, use the API version the original model was trained with to extend the model. Incremental training is only supported with API version **2024-02-29-preview** or later.
 
 Incremental training requires that you provide the original model ID as the `baseClassifierId`. See [incremental training](concept-incremental-classifier.md) to learn more about how to use incremental training.
 
diff --git a/articles/ai-services/document-intelligence/concept-custom-neural.md b/articles/ai-services/document-intelligence/concept-custom-neural.md
@@ -65,7 +65,24 @@ Neural models support documents that have the same information, but different pa
 
 *See* our [Language Support—custom models](language-support-custom.md) page for a complete list of supported languages.
 
-## Tabular fields
+## Overlapping fields
+
+With the release of API versions **2024-02-29-preview** and  later, custom neural models will support overlapping fields:
+
+To use the overlapping fields, your dataset needs to contain at least one sample with the expected overlap. To label an overlap, use **region labeling** to designate each of the spans of content (with the overlap) for each field. Labeling an overlap with field selection (highlighting a value) will fail in the studio as region labeling is the only supported labeling tool for indicating field overlaps. Overlap support includes:
+
+* Complete overlap. The same set of tokens are labeled for two different fields.
+* Partial overlap. Some tokens belong to both fields, but there are tokens that are only part of one field or the other.
+
+Overlapping fields have some limits:
+
+* Any token or word can only be labeled as two fields.
+* overlapping fields in a table can't span table rows.
+* Overlapping fields can only be recognized if at least one sample in the dataset contains overlapping labels for those fields.
+
+To use overlapping fields, label your dataset with the overlaps and train the model with the API version ```2024-02-29-preview``` or later.
+
+## Tabular fields adds table, row and cell confidence
 
 With the release of API versions **2022-06-30-preview** and  later, custom neural models will support tabular fields (tables):
 
@@ -92,23 +109,6 @@ Tabular fields provide **table, row and cell confidence** starting with the ```2
 
 See  [confidence and accuracy scores](concept-accuracy-confidence.md) to learn more about table, row, and cell confidence.
 
-## Overlapping fields
-
-With the release of API versions **2024-02-29-preview** and  later, custom neural models will support overlapping fields:
-
-To use the overlapping fields, your dataset needs to contain at least one sample with the expected overlap. To label an overlap, use region labeling to designate each of the spans of content (with the overlap) for each field. Overlap support includes:
-
-* Complete overlap. The same set of tokens are labeled for two different fields.
-* Partial overlap. Some tokens belong to both fields, but there are tokens that are only part of one field or the other.
-
-Overlapping fields have some limits:
-
-* Any token or word can only be labeled as two fields.
-* overlapping fields in a table can't span table rows.
-* Overlapping fields can only be recognized if at least one sample in the dataset contains overlapping labels for those fields.
-
-To use overlapping fields, label your dataset with the overlaps and train the model with the API version ```2024-02-29-preview``` or later.
-
 
 ## Supported regions
 
diff --git a/articles/ai-services/document-intelligence/concept-custom.md b/articles/ai-services/document-intelligence/concept-custom.md
@@ -51,7 +51,7 @@ To create a custom extraction model, label a dataset of documents with the value
 
 > [!IMPORTANT]
 >
-> Starting with version 3.1—2024-02-29-preview API, custom neural models now support overlapping fields and table, row and cell level confidence.
+> Starting with version 4.0 — 2024-02-29-preview API, custom neural models now support **overlapping fields** and **table, row and cell level confidence**.
 >
 
 The custom neural (custom document) model uses deep learning models and  base model trained on a large collection of documents. This model is then fine-tuned or adapted to your data when you train the model with a labeled dataset. Custom neural models support structured, semi-structured, and unstructured documents to extract fields. Custom neural models currently support English-language documents. When you're choosing between the two model types, start with a neural model to determine if it meets your functional needs. See [neural models](concept-custom-neural.md) to learn more about custom document models.
@@ -219,10 +219,10 @@ For a detailed walkthrough to create your first custom extraction model, *see* [
 
 This table compares the supported data extraction areas:
 
-|Model| Form fields | Selection marks | Structured fields (Tables) | Signature | Region labeling |
-|--|:--:|:--:|:--:|:--:|:--:|
-|Custom template| ✔ | ✔ | ✔ | ✔ | ✔ |
-|Custom neural| ✔| ✔ | ✔ | **n/a** | ***** |
+|Model| Form fields | Selection marks | Structured fields (Tables) | Signature | Region labeling | Overlapping fields |
+|--|:--:|:--:|:--:|:--:|:--:|:--:|
+|Custom template| ✔ | ✔ | ✔ | ✔ | ✔ | **n/a** |
+|Custom neural| ✔| ✔ | ✔ | **n/a** | * | ✔ (2024-02-29-preview) |
 
 **Table symbols**:<br>
 ✔—Supported<br>
@@ -268,27 +268,6 @@ The following table describes the features available with the associated tools a
 
 *See* our [Language Support—custom models](language-support-custom.md) page for a complete list of supported languages.
 
-### Try signature detection
-
-* **Custom model v4.0, v3.1 and v3.0 APIs** supports signature detection for custom forms. When you train custom models, you can specify certain fields as signatures. When a document is analyzed with your custom model, it indicates whether a signature was detected or not.
-* [Document Intelligence v3.1 migration guide](v3-1-migration-guide.md): This guide shows you how to use the v3.0 version in your applications and workflows.
-* [REST API](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-2023-07-31&preserve-view=true&tabs=HTTP): This API shows you more about the v3.0 version and new capabilities.
-
-1. Build your training dataset.
-
-1. Go to [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio). Under **Custom models**, select **Custom form**.
-
-    :::image type="content" source="media/label-tool/select-custom-form.png" alt-text="Screenshot that shows selecting the Document Intelligence Studio Custom form page.":::
-
-1. Follow the workflow to create a new project:
-
-   * Follow the **Custom model** input requirements.
-
-   * Label your documents. For signature fields, use **Region** labeling for better accuracy.
-
-      :::image type="content" source="media/label-tool/signature-label-region-too.png" alt-text="Screenshot that shows the Label signature field.":::
-
-After your training set is labeled, you can train your custom model and use it to analyze documents. The signature fields specify whether a signature was detected or not.
 
 ## Next steps
 
diff --git a/articles/ai-services/document-intelligence/concept-model-overview.md b/articles/ai-services/document-intelligence/concept-model-overview.md
@@ -45,7 +45,7 @@ ms.author: lajanuar
 
 The following table shows the available models for each current preview and stable API:
 
-|**Model Type**| **Model**|&bullet; [2024-02-29-preview](/rest/api/aiservices/document-models/build-model?view=rest-aiservices-2024-02-29-preview&preserve-view=true&branch=docintelligence&tabs=HTTP) <br>&bullet [2023-10-31-preview](/rest/api/aiservices/operation-groups?view=rest-aiservices-2024-02-29-preview&preserve-view=true)|[2023-07-31 (GA)](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-2023-07-31&preserve-view=true&tabs=HTTP)|[2022-08-31 (GA)](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2022-08-31/operations/AnalyzeDocument)|[v2.1 (GA)](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1/operations/AnalyzeBusinessCardAsync)|
+|**Model Type**| **Model**|&bullet; [2024-02-29-preview](/rest/api/aiservices/document-models/build-model?view=rest-aiservices-2024-02-29-preview&preserve-view=true&branch=docintelligence&tabs=HTTP) <br> &bullet [2023-10-31-preview](/rest/api/aiservices/operation-groups?view=rest-aiservices-2024-02-29-preview&preserve-view=true)|[2023-07-31 (GA)](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-2023-07-31&preserve-view=true&tabs=HTTP)|[2022-08-31 (GA)](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2022-08-31/operations/AnalyzeDocument)|[v2.1 (GA)](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1/operations/AnalyzeBusinessCardAsync)|
 |----------------|-----------|---|--|---|---|
 |Document analysis models|[Read](concept-read.md)                                  | ✔️| ✔️| ✔️| n/a|
 |Document analysis models|[Layout](concept-layout.md)                              | ✔️| ✔️| ✔️| ✔️|
@@ -61,11 +61,14 @@ The following table shows the available models for each current preview and stab
 |Prebuilt models|[US 1098-T Tax](concept-tax-document.md)                 | ✔️| ✔️| n/a| n/a|
 |Prebuilt models|[US 1099 Tax](concept-tax-document.md)                 | ✔️| n/a| n/a| n/a|
 |Prebuilt models|[US W2 Tax](concept-tax-document.md)                     | ✔️| ✔️| ✔️| n/a|
-|Prebuilt models|[Add-on capabilities](concept-add-on-capabilities.md)    | ✔️| ✔️| n/a| n/a|
+|Prebuilt models|[US Mortgage 1003 URLA](concept-mortgage-documents.md)    | ✔️| n/a| n/a| n/a|
+|Prebuilt models|[US Mortgage 1008 ](concept-mortgage-documents.md)       | ✔️| n/a| n/a| n/a|
+|Prebuilt models|[US Mortgage closing disclosure](concept-mortgage-documents.md)   | ✔️| n/a| n/a| n/a|
 |Custom models|[Custom classifier](concept-custom-classifier.md)        | ✔️| ✔️| n/a| n/a|
 |Custom models|[Custom neural](concept-custom-neural.md)                | ✔️| ✔️| ✔️| n/a|
 |Custom models|[Custom template](concept-custom-template.md)            | ✔️| ✔️| ✔️| ✔️|
 |Custom models|[Custom composed](concept-composed-models.md)            | ✔️| ✔️| ✔️| ✔️|
+|All models|[Add-on capabilities](concept-add-on-capabilities.md)    | ✔️| ✔️| n/a| n/a|
 
 |**Add-on Capability**| **Add-On/Free**|&bullet; [2024-02-29-preview](/rest/api/aiservices/document-models/build-model?view=rest-aiservices-2024-02-29-preview&preserve-view=true&branch=docintelligence&tabs=HTTP) <br>&bullet [2023-10-31-preview](/rest/api/aiservices/operation-groups?view=rest-aiservices-2024-02-29-preview&preserve-view=true|[`2023-07-31` (GA)](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-2023-07-31&preserve-view=true&tabs=HTTP)|[`2022-08-31` (GA)](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2022-08-31/operations/AnalyzeDocument)|[v2.1 (GA)](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1/operations/AnalyzeBusinessCardAsync)|
 |----------------|-----------|---|--|---|---|
diff --git a/articles/ai-services/document-intelligence/concept-mortgage-documents.md b/articles/ai-services/document-intelligence/concept-mortgage-documents.md
@@ -1,7 +1,7 @@
 ---
-title: Document Intelligence US mortgage document
+title: Document Intelligence US mortgage documents
 titleSuffix: Azure AI services
-description: Use Document Intelligence mortgage model to analyze and extract key fields from mortgage documents.
+description: Use Document Intelligence prebuilt models to analyze and extract key fields from mortgage documents.
 author: laujan
 manager: nitinme
 ms.service: azure-ai-document-intelligence
@@ -17,21 +17,18 @@ monikerRange: '>=doc-intel-4.0.0'
 <!-- markdownlint-disable MD049 -->
 <!-- markdownlint-disable MD001 -->
 
-# Document Intelligence mortgage documents model
+# Document Intelligence mortgage document models
 
 **This content applies to:** ![checkmark](media/yes-icon.png) **v4.0 (preview)** ![checkmark](media/yes-icon.png)
 
-The Document Intelligence Mortgage model uses powerful Optical Character Recognition (OCR) capabilities to analyze and extract key fields from mortgage documents. Mortgage documents can be of various formats and quality including. The API analyzes document text from mortgage documents and returns a structured JSON data representation. The model currently supports English-language document formats.
+The Document Intelligence Mortgage models use powerful Optical Character Recognition (OCR) capabilities and deep learning models to analyze and extract key fields from mortgage documents. Mortgage documents can be of various formats and quality. The API analyzes mortgage documents and returns a structured JSON data representation. The models currently support English-language documents only.
 
 **Supported document types:**
 
 * 1003 End-User License Agreement (EULA)
 * Form 1008
 * Mortgage closing disclosure
 
-## Automated mortgage documents processing
-
-Automated mortgage  card processing is the process of extracting key  fields from bank cards. Historically, bank card analysis process is achieved manually and, hence, very time consuming. Accurate extraction of key data from bank cards s is typically the first and one of the most critical steps in the contract automation process.
 
 ## Development options
 
@@ -48,7 +45,7 @@ Document Intelligence v4.0 (2024-02-29-preview) supports the following tools, ap
 
 [!INCLUDE [input requirements](./includes/input-requirements.md)]
 
-## Try mortgage document data extraction
+## Try mortgage documents data extraction
 
 To see how data extraction works for the mortgage documents service, you need the following resources:
 
diff --git a/articles/ai-services/document-intelligence/whats-new.md b/articles/ai-services/document-intelligence/whats-new.md

Original file line number	Diff line number	Diff line change
`@@ -66,7 +66,7 @@ With custom models, you need to maintain access to the training dataset to updat`
`66`	`66`
`67`	`67`	`> [!IMPORTANT]`
`68`	`68`	`>`
`69`		`-> Incremental trainiing is only supported with models trained with the same API version. If you are trying to extend a model, use the API version the original model was trained with to extend the model.`
	`69`	`+> Incremental training is only supported with models trained with the same API version. If you are trying to extend a model, use the API version the original model was trained with to extend the model. Incremental training is only supported with API version 2024-02-29-preview or later.`
`70`	`70`
`71`	`71`	Incremental training requires that you provide the original model ID as the `baseClassifierId`. See [incremental training](concept-incremental-classifier.md) to learn more about how to use incremental training.
`72`	`72`