Skip to content

Commit 4a29bac

Browse files
committed
117280-public-pr-edit-for-sign-off
1 parent 44353f6 commit 4a29bac

File tree

2 files changed

+44
-14
lines changed

2 files changed

+44
-14
lines changed

articles/ai-services/document-intelligence/concept-add-on-capabilities.md

Lines changed: 36 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.service: azure-ai-document-intelligence
88
ms.custom:
99
- ignite-2023
1010
ms.topic: conceptual
11-
ms.date: 11/15/2023
11+
ms.date: 11/21/2023
1212
ms.author: lajanuar
1313
monikerRange: '>=doc-intel-3.1.0'
1414
---
@@ -36,23 +36,27 @@ monikerRange: '>=doc-intel-3.1.0'
3636

3737
Document Intelligence supports more sophisticated and modular analysis capabilities. Use the add-on features to extend the results to include more features extracted from your documents. Some add-on features incur an extra cost. These optional features can be enabled and disabled depending on the scenario of the document extraction. The following add-on capabilities are available for `2023-07-31 (GA)` and later releases:
3838

39-
* [`ocr.highResolution`](#high-resolution-extraction)
39+
* [`ocrHighResolution`](#high-resolution-extraction)
4040

41-
* [`ocr.formula`](#formula-extraction)
41+
* [`formulas`](#formula-extraction)
4242

43-
* [`ocr.font`](#font-property-extraction)
43+
* [`styleFont`](#font-property-extraction)
44+
45+
* [`barcodes`](#barcode-property-extraction)
46+
47+
* [`languages`](#language-detection)
4448

45-
* [`ocr.barcode`](#barcode-property-extraction)
4649
:::moniker-end
4750

4851
:::moniker range="doc-intel-4.0.0"
4952

5053
> [!NOTE]
5154
>
52-
> Add-on capabilities are available within all models except for the [Read model](concept-read.md).
55+
> Not all add-on capabilities are supported by all models. For more information, *see* [model data extraction](concept-model-overview.md#model-data-extraction).
5356
5457
The following add-on capability is available for `2023-10-31-preview` and later releases:
5558

59+
* [`keyValuePairs`](#key-value-pairs)
5660
* [`queryFields`](#query-fields)
5761

5862
> [!NOTE]
@@ -159,8 +163,33 @@ The `ocr.barcode` capability extracts all identified barcodes in the `barcodes`
159163
| `ITF` |:::image type="content" source="media/barcodes/interleaved-two-five.png" alt-text="Screenshot of the interleaved-two-of-five barcode (ITF).":::|
160164
| `Data Matrix` |:::image type="content" source="media/barcodes/datamatrix.gif" alt-text="Screenshot of the Data Matrix.":::|
161165

166+
## Language detection
167+
168+
It predicts the detected primary language for each text line along with the `confidence` in the `languages` collection under `analyzeResult`.
169+
170+
```json
171+
"languages": [
172+
{
173+
"spans": [
174+
{
175+
"offset": 0,
176+
"length": 131
177+
}
178+
],
179+
"locale": "en",
180+
"confidence": 0.7
181+
},
182+
]
183+
```
184+
162185
:::moniker range="doc-intel-4.0.0"
163186

187+
## Key-value Pairs
188+
189+
Key-value pairs are specific spans within the document that identify a label or key and its associated response or value. In a structured form, these pairs could be the label and the value the user entered for that field. In an unstructured document, they could be the date a contract was executed on based on the text in a paragraph. The AI model is trained to extract identifiable keys and values based on a wide variety of document types, formats, and structures.
190+
191+
Keys can also exist in isolation when the model detects that a key exists, with no associated value or when processing optional fields. For example, a middle name field can be left blank on a form in some instances. Key-value pairs are spans of text contained in the document. For documents where the same value is described in different ways, for example, customer/user, the associated key is either customer or user (based on context).
192+
164193
## Query Fields
165194

166195
* Document Intelligence now supports query field extractions. With query field extraction, you can add fields to the extraction process using a query request without the need for added training.
@@ -185,7 +214,7 @@ For query field extraction, specify the fields you want to extract and Document
185214

186215
:::image type="content" source="media/studio/query-fields.png" alt-text="Screenshot of the query fields button in Document Intelligence Studio.":::
187216

188-
* You can pass a list of field labels like `Party1`, `Party2`, `TermsOfUse`, `PaymentTerms`, `PaymentDate`, and `TermEndDate`" as part of the analyze document request.
217+
* You can pass a list of field labels like `Party1`, `Party2`, `TermsOfUse`, `PaymentTerms`, `PaymentDate`, and `TermEndDate`" as part of the `analyze document` request.
189218

190219
:::image type="content" source="media/studio/query-field-select.png" alt-text="Screenshot of query fields selection window in Document Intelligence Studio.":::
191220

articles/ai-services/document-intelligence/concept-model-overview.md

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -88,12 +88,15 @@ The following table shows the available models for each current preview and stab
8888
| [Custom classification model](#custom-classifier)| The **Custom classification model** can classify each page in an input file to identify the document(s) within and can also identify multiple documents or multiple instances of a single document within an input file.
8989
| [Composed models](#composed-models) | Combine several custom models into a single model to automate processing of diverse document types with a single composed model.
9090

91-
For all models, except Business card model, Document Intelligence now supports add-on capabilities to allow for more sophisticated analysis. These optional capabilities can be enabled and disabled depending on the scenario of the document extraction. There are four add-on capabilities available for the `2023-07-31` (GA) and later API version:
91+
For all models, except Business card model, Document Intelligence now supports add-on capabilities to allow for more sophisticated analysis. These optional capabilities can be enabled and disabled depending on the scenario of the document extraction. There are seven add-on capabilities available for the `2023-07-31` (GA) and later API version:
9292

93-
* [`ocr.highResolution`](concept-add-on-capabilities.md#high-resolution-extraction)
94-
* [`ocr.formula`](concept-add-on-capabilities.md#formula-extraction)
95-
* [`ocr.font`](concept-add-on-capabilities.md#font-property-extraction)
96-
* [`ocr.barcode`](concept-add-on-capabilities.md#barcode-property-extraction)
93+
* [`ocrHighResolution`](concept-add-on-capabilities.md#high-resolution-extraction)
94+
* [`formulas`](concept-add-on-capabilities.md#formula-extraction)
95+
* [`styleFont`](concept-add-on-capabilities.md#font-property-extraction)
96+
* [`barcodes`](concept-add-on-capabilities.md#barcode-property-extraction)
97+
* [`languages`](concept-add-on-capabilities.md#language-detection)
98+
* [`keyValuePairs`](concept-add-on-capabilities.md#key-value-pairs) (2023-10-31-preview)
99+
* [`queryFields`](concept-add-on-capabilities.md#query-fields) (2023-31-preview)
97100

98101
## Analysis features
99102

@@ -146,7 +149,6 @@ The Layout analysis model analyzes and extracts text, tables, selection marks, a
146149
>
147150
> [Learn more: layout model](concept-layout.md)
148151
149-
150152
### Health insurance card
151153

152154
:::image type="icon" source="media/studio/health-insurance-logo.png":::
@@ -174,7 +176,6 @@ The US tax document models analyze and extract key fields and line items from a
174176
|US Tax 1098-T|Extract qualified tuition details.|**prebuilt-tax.us.1098T**|
175177
|US Tax 1099|Extract Information from 1099 forms.|**prebuilt-tax.us.1099(variations)**|
176178

177-
178179
***Sample W-2 document processed using [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=tax.us.w2)***:
179180

180181
:::image type="content" source="./media/studio/w-2.png" alt-text="Screenshot of a sample W-2.":::

0 commit comments

Comments
 (0)