Skip to content

Commit 93a661c

Browse files
authored
Merge pull request #259298 from MicrosoftDocs/main
11/22 11:00 AM IST Publish
2 parents 6770483 + ed4481f commit 93a661c

File tree

479 files changed

+3204
-830
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

479 files changed

+3204
-830
lines changed

articles/ai-services/document-intelligence/concept-add-on-capabilities.md

Lines changed: 36 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.service: azure-ai-document-intelligence
88
ms.custom:
99
- ignite-2023
1010
ms.topic: conceptual
11-
ms.date: 11/15/2023
11+
ms.date: 11/21/2023
1212
ms.author: lajanuar
1313
monikerRange: '>=doc-intel-3.1.0'
1414
---
@@ -36,23 +36,27 @@ monikerRange: '>=doc-intel-3.1.0'
3636

3737
Document Intelligence supports more sophisticated and modular analysis capabilities. Use the add-on features to extend the results to include more features extracted from your documents. Some add-on features incur an extra cost. These optional features can be enabled and disabled depending on the scenario of the document extraction. The following add-on capabilities are available for `2023-07-31 (GA)` and later releases:
3838

39-
* [`ocr.highResolution`](#high-resolution-extraction)
39+
* [`ocrHighResolution`](#high-resolution-extraction)
4040

41-
* [`ocr.formula`](#formula-extraction)
41+
* [`formulas`](#formula-extraction)
4242

43-
* [`ocr.font`](#font-property-extraction)
43+
* [`styleFont`](#font-property-extraction)
44+
45+
* [`barcodes`](#barcode-property-extraction)
46+
47+
* [`languages`](#language-detection)
4448

45-
* [`ocr.barcode`](#barcode-property-extraction)
4649
:::moniker-end
4750

4851
:::moniker range="doc-intel-4.0.0"
4952

5053
> [!NOTE]
5154
>
52-
> Add-on capabilities are available within all models except for the [Read model](concept-read.md).
55+
> Not all add-on capabilities are supported by all models. For more information, *see* [model data extraction](concept-model-overview.md#model-data-extraction).
5356
5457
The following add-on capability is available for `2023-10-31-preview` and later releases:
5558

59+
* [`keyValuePairs`](#key-value-pairs)
5660
* [`queryFields`](#query-fields)
5761

5862
> [!NOTE]
@@ -159,8 +163,33 @@ The `ocr.barcode` capability extracts all identified barcodes in the `barcodes`
159163
| `ITF` |:::image type="content" source="media/barcodes/interleaved-two-five.png" alt-text="Screenshot of the interleaved-two-of-five barcode (ITF).":::|
160164
| `Data Matrix` |:::image type="content" source="media/barcodes/datamatrix.gif" alt-text="Screenshot of the Data Matrix.":::|
161165

166+
## Language detection
167+
168+
It predicts the detected primary language for each text line along with the `confidence` in the `languages` collection under `analyzeResult`.
169+
170+
```json
171+
"languages": [
172+
{
173+
"spans": [
174+
{
175+
"offset": 0,
176+
"length": 131
177+
}
178+
],
179+
"locale": "en",
180+
"confidence": 0.7
181+
},
182+
]
183+
```
184+
162185
:::moniker range="doc-intel-4.0.0"
163186

187+
## Key-value Pairs
188+
189+
Key-value pairs are specific spans within the document that identify a label or key and its associated response or value. In a structured form, these pairs could be the label and the value the user entered for that field. In an unstructured document, they could be the date a contract was executed on based on the text in a paragraph. The AI model is trained to extract identifiable keys and values based on a wide variety of document types, formats, and structures.
190+
191+
Keys can also exist in isolation when the model detects that a key exists, with no associated value or when processing optional fields. For example, a middle name field can be left blank on a form in some instances. Key-value pairs are spans of text contained in the document. For documents where the same value is described in different ways, for example, customer/user, the associated key is either customer or user (based on context).
192+
164193
## Query Fields
165194

166195
* Document Intelligence now supports query field extractions. With query field extraction, you can add fields to the extraction process using a query request without the need for added training.
@@ -185,7 +214,7 @@ For query field extraction, specify the fields you want to extract and Document
185214

186215
:::image type="content" source="media/studio/query-fields.png" alt-text="Screenshot of the query fields button in Document Intelligence Studio.":::
187216

188-
* You can pass a list of field labels like `Party1`, `Party2`, `TermsOfUse`, `PaymentTerms`, `PaymentDate`, and `TermEndDate`" as part of the analyze document request.
217+
* You can pass a list of field labels like `Party1`, `Party2`, `TermsOfUse`, `PaymentTerms`, `PaymentDate`, and `TermEndDate`" as part of the `analyze document` request.
189218

190219
:::image type="content" source="media/studio/query-field-select.png" alt-text="Screenshot of query fields selection window in Document Intelligence Studio.":::
191220

articles/ai-services/document-intelligence/concept-model-overview.md

Lines changed: 8 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -88,12 +88,15 @@ The following table shows the available models for each current preview and stab
8888
| [Custom classification model](#custom-classifier)| The **Custom classification model** can classify each page in an input file to identify the document(s) within and can also identify multiple documents or multiple instances of a single document within an input file.
8989
| [Composed models](#composed-models) | Combine several custom models into a single model to automate processing of diverse document types with a single composed model.
9090

91-
For all models, except Business card model, Document Intelligence now supports add-on capabilities to allow for more sophisticated analysis. These optional capabilities can be enabled and disabled depending on the scenario of the document extraction. There are four add-on capabilities available for the `2023-07-31` (GA) and later API version:
91+
For all models, except Business card model, Document Intelligence now supports add-on capabilities to allow for more sophisticated analysis. These optional capabilities can be enabled and disabled depending on the scenario of the document extraction. There are seven add-on capabilities available for the `2023-07-31` (GA) and later API version:
9292

93-
* [`ocr.highResolution`](concept-add-on-capabilities.md#high-resolution-extraction)
94-
* [`ocr.formula`](concept-add-on-capabilities.md#formula-extraction)
95-
* [`ocr.font`](concept-add-on-capabilities.md#font-property-extraction)
96-
* [`ocr.barcode`](concept-add-on-capabilities.md#barcode-property-extraction)
93+
* [`ocrHighResolution`](concept-add-on-capabilities.md#high-resolution-extraction)
94+
* [`formulas`](concept-add-on-capabilities.md#formula-extraction)
95+
* [`styleFont`](concept-add-on-capabilities.md#font-property-extraction)
96+
* [`barcodes`](concept-add-on-capabilities.md#barcode-property-extraction)
97+
* [`languages`](concept-add-on-capabilities.md#language-detection)
98+
* [`keyValuePairs`](concept-add-on-capabilities.md#key-value-pairs) (2023-10-31-preview)
99+
* [`queryFields`](concept-add-on-capabilities.md#query-fields) (2023-31-preview)
97100

98101
## Analysis features
99102

@@ -146,7 +149,6 @@ The Layout analysis model analyzes and extracts text, tables, selection marks, a
146149
>
147150
> [Learn more: layout model](concept-layout.md)
148151
149-
150152
### Health insurance card
151153

152154
:::image type="icon" source="media/studio/health-insurance-logo.png":::
@@ -174,7 +176,6 @@ The US tax document models analyze and extract key fields and line items from a
174176
|US Tax 1098-T|Extract qualified tuition details.|**prebuilt-tax.us.1098T**|
175177
|US Tax 1099|Extract Information from 1099 forms.|**prebuilt-tax.us.1099(variations)**|
176178

177-
178179
***Sample W-2 document processed using [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=tax.us.w2)***:
179180

180181
:::image type="content" source="./media/studio/w-2.png" alt-text="Screenshot of a sample W-2.":::

articles/ai-services/language-service/summarization/how-to/document-summarization.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -87,8 +87,6 @@ You can also use the `sortby` parameter to specify in what order the extracted s
8787

8888
### Try document abstractive summarization
8989

90-
<!-- [Reference documentation](https://go.microsoft.com/fwlink/?linkid=2211684) -->
91-
9290
The following example gets you started with document abstractive summarization:
9391

9492
1. Copy the command below into a text editor. The BASH example uses the `\` line continuation character. If your console or terminal uses a different line continuation character, use that character instead.

articles/ai-services/language-service/summarization/includes/quickstarts/rest-api.md

Lines changed: 0 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -9,16 +9,8 @@ ms.author: aahi
99
ms.custom: ignite-fall-2021, event-tier1-build-2022, ignite-2022
1010
---
1111

12-
# [Document summarization](#tab/document-summarization)
13-
14-
[Reference documentation](https://go.microsoft.com/fwlink/?linkid=2211684)
15-
1612
# [Conversation summarization](#tab/conversation-summarization)
1713

18-
[Reference documentation](https://go.microsoft.com/fwlink/?linkid=2195178)
19-
20-
---
21-
2214
Use this quickstart to send text summarization requests using the REST API. In the following example, you will use cURL to summarize documents or text-based customer service conversations.
2315

2416
[!INCLUDE [Use Language Studio](../use-language-studio.md)]
@@ -54,8 +46,6 @@ Choose the type of summarization you would like to perform, and select one of th
5446

5547
# [Document summarization](#tab/document-summarization)
5648

57-
[Reference documentation](https://go.microsoft.com/fwlink/?linkid=2211684)
58-
5949
|parameter |Description |
6050
|---------|---------|
6151
|`-X POST <endpoint>` | Specifies your endpoint for accessing the API. |
@@ -199,8 +189,6 @@ curl -X GET $LANGUAGE_ENDPOINT/language/analyze-text/jobs/<my-job-id>?api-versio
199189
}
200190
```
201191

202-
# [Conversation summarization](#tab/conversation-summarization)
203-
204192
## Conversation issue and resolution summarization
205193

206194
The following example will get you started with conversation issue and resolution summarization:

articles/ai-services/language-service/summarization/overview.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -147,8 +147,6 @@ As you use document summarization in your applications, see the following refere
147147
|JavaScript | [JavaScript documentation](/javascript/api/overview/azure/ai-text-analytics-readme?view=azure-node-preview&preserve-view=true) | [JavaScript samples](https://github.com/Azure/azure-sdk-for-js/tree/main/sdk/textanalytics/ai-text-analytics/samples/v5) |
148148
|Python | [Python documentation](/python/api/overview/azure/ai-textanalytics-readme?view=azure-python-preview&preserve-view=true) | [Python samples](https://github.com/Azure/azure-sdk-for-python/tree/main/sdk/textanalytics/azure-ai-textanalytics/samples) |
149149

150-
<!-- |REST API | [REST API documentation](https://go.microsoft.com/fwlink/?linkid=2211684) | | -->
151-
152150
## Responsible AI
153151

154152
An AI system includes not only the technology, but also the people who will use it, the people who will be affected by it, and the environment in which it’s deployed. Read the [transparency note for summarization](/legal/cognitive-services/language-service/transparency-note-extractive-summarization?context=/azure/ai-services/language-service/context/context) to learn about responsible AI use and deployment in your systems. You can also see the following articles for more information:

0 commit comments

Comments
 (0)