Skip to content

Commit a13a546

Browse files
committed
edit for pub
1 parent 1ba63fe commit a13a546

File tree

5 files changed

+93
-91
lines changed

5 files changed

+93
-91
lines changed

articles/ai-services/document-intelligence/concept-layout.md

Lines changed: 12 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.service: azure-ai-document-intelligence
88
ms.custom:
99
- ignite-2023
1010
ms.topic: conceptual
11-
ms.date: 02/09/2024
11+
ms.date: 02/21/2024
1212
ms.author: lajanuar
1313
---
1414

@@ -184,15 +184,16 @@ Document Intelligence v2.1 supports the following tools, applications, and libra
184184
The layout model extracts text, selection marks, tables, paragraphs, and paragraph types (`roles`) from your documents.
185185

186186
> [!NOTE]
187-
> Microsoft Word and HTML file are supported since `2023-10-31-preview`. Compared with PDF and images, below features are not supported:
188-
> - There are no angle, width/height and unit with each page object.
189-
> - For each object detected, there is no bounding polygon or bounding region.
190-
> - Page range (`pages`) is not supported as a parameter.
191-
> - No `lines` object.
187+
> Version `2023-10-31-preview` and later support Microsoft Word and HTML files. The following features are not supported:
188+
>
189+
> * There are no angle, width/height and unit with each page object.
190+
> * For each object detected, there is no bounding polygon or bounding region.
191+
> * Page range (`pages`) is not supported as a parameter.
192+
> * No `lines` object.
192193
193194
### Pages
194195

195-
The pages collection is a list of pages within the document. For each page, it is represented with the sequential number of the page within the document, the orientation angle, which could indicate if the page has been rotated, the width and height (dimentions in pixels) of the page. The page units in the model output are computed as shown:
196+
The pages collection is a list of pages within the document. Each page is represented sequentially within the document and includes the orientation angle indicating if the page is rotated and the width and height (dimensions in pixels). The page units in the model output are computed as shown:
196197

197198
**File format** | **Computed page unit** | **Total pages** |
198199
| --- | --- | --- |
@@ -271,11 +272,11 @@ The new machine-learning based page object detection extracts logical roles like
271272

272273
```
273274

274-
### Text, lines and words
275+
### Text, lines, and words
275276

276277
The document layout model in Document Intelligence extracts print and handwritten style text as `lines` and `words`. The `styles` collection includes any handwritten style for lines if detected along with the spans pointing to the associated text. This feature applies to [supported handwritten languages](language-support.md).
277278

278-
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence version 2023-10-31-preview the Layout model extracts all embedded text as is. Texts are extrated as words and paragraphs. Embedded images are not supported.
279+
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence version 2023-10-31-preview the Layout model extracts all embedded text as is. Texts are extrated as words and paragraphs. Embedded images aren't supported.
279280

280281
```json
281282
"words": [
@@ -312,7 +313,8 @@ The response includes classifying whether each text line is of handwriting style
312313
]
313314
}
314315
```
315-
If you have turned on [font/style addon capability](concept-add-on-capabilities.md#font-property-extraction), you will also get the font/style result as part of the `styles` object.
316+
317+
If you enable the [font/style addon capability](concept-add-on-capabilities.md#font-property-extraction), you also get the font/style result as part of the `styles` object.
316318

317319
### Selection marks
318320

articles/ai-services/document-intelligence/concept-model-overview.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.service: azure-ai-document-intelligence
88
ms.custom:
99
- ignite-2023
1010
ms.topic: conceptual
11-
ms.date: 02/06/2024
11+
ms.date: 02/21/2024
1212
ms.author: lajanuar
1313
---
1414

@@ -38,7 +38,7 @@ ms.author: lajanuar
3838
::: moniker-end
3939

4040
::: moniker range=">=doc-intel-2.1.0"
41-
Azure AI Document Intelligence supports a wide variety of models that enable you to add intelligent document processing to your apps and flows. You can use a prebuilt domain-specific model or train a custom model tailored to your specific business need and use cases. Document Intelligence can be used with the REST API or Python, C#, Java, and JavaScript SDKs.
41+
Azure AI Document Intelligence supports a wide variety of models that enable you to add intelligent document processing to your apps and flows. You can use a prebuilt domain-specific model or train a custom model tailored to your specific business need and use cases. Document Intelligence can be used with the REST API or Python, C#, Java, and JavaScript client libraries.
4242
::: moniker-end
4343

4444
## Model overview
@@ -88,7 +88,7 @@ Add-On* - Query fields are priced differently than the other add-on features. Se
8888
| [Read OCR](#read-ocr) | Extract print and handwritten text including words, locations, and detected languages.|
8989
| [Layout analysis](#layout-analysis) | Extract text and document layout elements like tables, selection marks, titles, section headings, and more.|
9090
|**Prebuilt models**||
91-
| [Health insurance card](#health-insurance-card) | Automate healthcare processes by extracting insurer, member, prescription, group number and other key information from US health insurance cards.|
91+
| [Health insurance card](#health-insurance-card) | Automate healthcare processes by extracting insurer, member, prescription, group number, and other key information from US health insurance cards.|
9292
| [US Tax document models](#us-tax-documents) | Process US tax forms to extract employee, employer, wage, and other information. |
9393
| [Contract](#contract) | Extract agreement and party details.|
9494
| [Invoice](#invoice) | Automate invoices. |
@@ -97,8 +97,8 @@ Add-On* - Query fields are priced differently than the other add-on features. Se
9797
| [Business card](#business-card) | Scan business cards to extract key fields and data into your applications. |
9898
|**Custom models**||
9999
| [Custom model (overview)](#custom-models) | Extract data from forms and documents specific to your business. Custom models are trained for your distinct data and use cases. |
100-
| [Custom extraction models](#custom-extraction)| &#9679; **Custom template models** use layout cues to extract values from documents and are suitable to extract fields from highly structured documents with defined visual templates.</br>&#9679; **Custom neural models** are trained on various document types to extract fields from structured, semi-structured and unstructured documents.|
101-
| [Custom classification model](#custom-classifier)| The **Custom classification model** can classify each page in an input file to identify the document(s) within and can also identify multiple documents or multiple instances of a single document within an input file.
100+
| [Custom extraction models](#custom-extraction)| &#9679; **Custom template models** use layout cues to extract values from documents and are suitable to extract fields from highly structured documents with defined visual templates.</br>&#9679; **Custom neural models** are trained on various document types to extract fields from structured, semi-structured, and unstructured documents.|
101+
| [Custom classification model](#custom-classifier)| The **Custom classification model** can classify each page in an input file to identify the documents within and can also identify multiple documents or multiple instances of a single document within an input file.
102102
| [Composed models](#composed-models) | Combine several custom models into a single model to automate processing of diverse document types with a single composed model.
103103

104104
For all models, except Business card model, Document Intelligence now supports add-on capabilities to allow for more sophisticated analysis. These optional capabilities can be enabled and disabled depending on the scenario of the document extraction. There are seven add-on capabilities available for the `2023-07-31` (GA) and later API version:
@@ -109,7 +109,7 @@ For all models, except Business card model, Document Intelligence now supports a
109109
* [`barcodes`](concept-add-on-capabilities.md#barcode-property-extraction)
110110
* [`languages`](concept-add-on-capabilities.md#language-detection)
111111
* [`keyValuePairs`](concept-add-on-capabilities.md#key-value-pairs) (2023-10-31-preview)
112-
* [`queryFields`](concept-add-on-capabilities.md#query-fields) (2023-10-31-preview). `Not available with the US.Tax models`
112+
* [`queryFields`](concept-add-on-capabilities.md#query-fields) (2023-10-31-preview) `Not available with the US.Tax models`
113113

114114
## Analysis features
115115

@@ -167,7 +167,7 @@ The US tax document models analyze and extract key fields and line items from a
167167
|US Tax 1098|Extract mortgage interest details.|**prebuilt-tax.us.1098**|
168168
|US Tax 1098-E|Extract student loan interest details.|**prebuilt-tax.us.1098E**|
169169
|US Tax 1098-T|Extract qualified tuition details.|**prebuilt-tax.us.1098T**|
170-
|US Tax 1099|Extract Information from 1099 forms.|**prebuilt-tax.us.1099(variations)**|
170+
|US Tax 1099|Extract wage information details.|**prebuilt-tax.us.1099(variations)**|
171171

172172
***Sample W-2 document processed using [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=tax.us.w2)***:
173173

@@ -193,7 +193,7 @@ The US tax document models analyze and extract key fields and line items from a
193193

194194
:::image type="icon" source="media/studio/invoice.png":::
195195

196-
The invoice model automates processing of invoices to extracts customer name, billing address, due date, and amount due, line items and other key data. Currently, the model supports English, Spanish, German, French, Italian, Portuguese, and Dutch invoices.
196+
The invoice model automates processing of invoices to extracts customer name, billing address, due date, and amount due, line items, and other key data. Currently, the model supports English, Spanish, German, French, Italian, Portuguese, and Dutch invoices.
197197

198198
***Sample invoice processed using [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio/prebuilt?formType=invoice)***:
199199

@@ -263,7 +263,7 @@ Custom extraction model can be one of two types, **custom template** or **custom
263263

264264
:::image type="icon" source="media/studio/custom-classifier.png":::
265265

266-
The custom classification model enables you to identify the document type prior to invoking the extraction model. The classification model is available starting with the `2023-07-31 (GA)` API. Training a custom classification model requires at least two distinct classes and a minimum of five samples per class.
266+
The custom classification model enables you to identify the document type before invoking the extraction model. The classification model is available starting with the `2023-07-31 (GA)` API. Training a custom classification model requires at least two distinct classes and a minimum of five samples per class.
267267

268268
> [!div class="nextstepaction"]
269269
> [Learn more: custom classification model](concept-custom-classifier.md)
@@ -399,7 +399,7 @@ The business card model analyzes and extracts key information from business card
399399
400400
#### Composed custom model
401401

402-
A composed model is created by taking a collection of custom models and assigning them to a single model built from your form types. You can assign multiple custom models to a composed model called with a single model ID. you can assign up to 100 trained custom models to a single composed model.
402+
A composed model is created by taking a collection of custom models and assigning them to a single model built from your form types. You can assign multiple custom models to a composed model called with a single model ID. You can assign up to 100 trained custom models to a single composed model.
403403

404404
***Composed model dialog window using the [Sample Labeling tool](https://formrecognizer.appliedai.azure.com/studio/customform/projects)***:
405405

@@ -436,15 +436,15 @@ A composed model is created by taking a collection of custom models and assignin
436436

437437
::: moniker range=">=doc-intel-3.0.0"
438438

439-
* Try processing your own forms and documents with the [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio)
439+
* Try processing your own forms and documents with the [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio).
440440

441441
* Complete a [Document Intelligence quickstart](quickstarts/get-started-sdks-rest-api.md?view=doc-intel-3.0.0&preserve-view=true) and get started creating a document processing app in the development language of your choice.
442442

443443
::: moniker-end
444444

445445
::: moniker range="doc-intel-2.1.0"
446446

447-
* Try processing your own forms and documents with the [Document Intelligence Sample Labeling tool](https://fott-2-1.azurewebsites.net/)
447+
* Try processing your own forms and documents with the [Document Intelligence Sample Labeling tool](https://fott-2-1.azurewebsites.net/).
448448

449449
* Complete a [Document Intelligence quickstart](quickstarts/get-started-sdks-rest-api.md?view=doc-intel-2.1.0&preserve-view=true) and get started creating a document processing app in the development language of your choice.
450450

articles/ai-services/document-intelligence/concept-read.md

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -109,23 +109,24 @@ See our [Language Support—document analysis models](language-support-ocr.md) p
109109

110110
> [!NOTE]
111111
> Microsoft Word and HTML file are supported in v3.1 and later versions. Compared with PDF and images, below features are not supported:
112-
> - There are no angle, width/height and unit with each page object.
113-
> - For each object detected, there is no bounding polygon or bounding region.
114-
> - Page range (`pages`) is not supported as a parameter.
115-
> - No `lines` object.
112+
>
113+
> * There are no angle, width/height and unit with each page object.
114+
> * For each object detected, there is no bounding polygon or bounding region.
115+
> * Page range (`pages`) is not supported as a parameter.
116+
> * No `lines` object.
116117
117118
### Pages
118119

119-
The pages collection is a list of pages within the document. For each page, it is represented with the sequential number of the page within the document, the orientation angle, which could indicate if the page has been rotated, the width and height (dimentions in pixels) of the page. The page units in the model output are computed as shown:
120+
The pages collection is a list of pages within the document. Each page is represented sequentially within the document and includes the orientation angle indicating if the page is rotated and the width and height (dimensions in pixels). The page units in the model output are computed as shown:
120121

121-
**File format** | **Computed page unit** | **Total pages** |
122+
|**File format** | **Computed page unit** | **Total pages** |
122123
| --- | --- | --- |
123124
|Images (JPEG/JPG, PNG, BMP, HEIF) | Each image = 1 page unit | Total images |
124125
|PDF | Each page in the PDF = 1 page unit | Total pages in the PDF |
125126
|TIFF | Each image in the TIFF = 1 page unit | Total images in the PDF |
126127
|Word (DOCX) | Up to 3,000 characters = 1 page unit, embedded or linked images not supported | Total pages of up to 3,000 characters each |
127-
|Excel (XLSX) | Each worksheet = 1 page unit, embedded or linked images not supported | Total worksheets |
128-
|PowerPoint (PPTX) | Each slide = 1 page unit, embedded or linked images not supported | Total slides |
128+
|Excel (XLSX) | Each worksheet = 1 page unit, embedded or linked images not supported | Total worksheets |
129+
|PowerPoint (PPTX) | Each slide = 1 page unit, embedded or linked images not supported | Total slides |
129130
|HTML | Up to 3,000 characters = 1 page unit, embedded or linked images not supported | Total pages of up to 3,000 characters each |
130131

131132
```json
@@ -165,7 +166,7 @@ The Read OCR model in Document Intelligence extracts all identified blocks of te
165166

166167
The Read OCR model extracts print and handwritten style text as `lines` and `words`. The model outputs bounding `polygon` coordinates and `confidence` for the extracted words. The `styles` collection includes any handwritten style for lines if detected along with the spans pointing to the associated text. This feature applies to [supported handwritten languages](language-support.md).
167168

168-
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence Read model v3.1 and later versions extracts all embedded text as is. Texts are extrated as words and paragraphs. Embedded images are not supported.
169+
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence Read model v3.1 and later versions extracts all embedded text as is. Texts are extrated as words and paragraphs. Embedded images aren't supported.
169170

170171

171172
```json
@@ -204,7 +205,7 @@ The response includes classifying whether each text line is of handwriting style
204205
}
205206
```
206207

207-
If you have turned on [font/style addon capability](concept-add-on-capabilities.md#font-property-extraction), you will also get the font/style result as part of the `styles` object.
208+
If you enabled the [font/style addon capability](concept-add-on-capabilities.md#font-property-extraction), you also get the font/style result as part of the `styles` object.
208209

209210
## Next steps
210211

articles/ai-services/document-intelligence/overview.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.service: azure-ai-document-intelligence
88
ms.custom:
99
- ignite-2023
1010
ms.topic: overview
11-
ms.date: 02/06/2024
11+
ms.date: 02/21/2024
1212
ms.author: lajanuar
1313
monikerRange: '<=doc-intel-4.0.0'
1414
---
@@ -217,14 +217,14 @@ Prebuilt models enable you to add intelligent document processing to your apps a
217217
:::row:::
218218
:::column:::
219219
* **Classification model**</br>
220-
✔️ Custom classifiers identify document types prior to invoking an extraction model.
220+
✔️ Custom classifiers identify document types before invoking an extraction model.
221221
:::column-end:::
222222
:::row-end:::
223223

224224
:::row:::
225225
:::column span="":::
226226
:::image type="icon" source="media/overview/icon-custom-classifier.png" link="#custom-classification-model":::</br>
227-
[**Custom classifier**](#custom-classification-model) | Identify designated document types (classes) </br>prior to invoking an extraction model.
227+
[**Custom classifier**](#custom-classification-model) | Identify designated document types (classes) </br>before invoking an extraction model.
228228
:::column-end:::
229229
:::row-end:::
230230

@@ -266,7 +266,7 @@ Document Intelligence supports optional features that can be enabled and disable
266266

267267
✓ - Enabled</br>
268268
O - Optional</br>
269-
\* - Premium features incur extra costs
269+
\* - Premium features incur extra costs.
270270

271271
## Models and development options
272272

@@ -416,7 +416,7 @@ You can use Document Intelligence to automate document processing in application
416416

417417
| Model ID |Description|Development options |
418418
|----------|--------------|-----------------|
419-
|**prebuilt-tax.us.1099(Variations)**|Extract information from 1099 form variations.|&#9679; [**Document Intelligence Studio**](https://formrecognizer.appliedai.azure.com/studio)</br>&#9679; [**REST API**](https://westus.dev.cognitive.microsoft.com/docs/services?pattern=intelligence)|
419+
|**prebuilt-tax.us.1099(Variations)**|Extract information from 1099-form variations.|&#9679; [**Document Intelligence Studio**](https://formrecognizer.appliedai.azure.com/studio)</br>&#9679; [**REST API**](https://westus.dev.cognitive.microsoft.com/docs/services?pattern=intelligence)|
420420

421421
> [!div class="nextstepaction"]
422422
> [Return to model types](#prebuilt-models)
@@ -550,17 +550,17 @@ Use the links in the table to learn more about each model and browse the API ref
550550

551551
::: moniker range=">=doc-intel-3.0.0"
552552

553-
* [Choose a Document Intelligence model](choose-model-feature.md)
553+
* [Choose a Document Intelligence model](choose-model-feature.md).
554554

555-
* Try processing your own forms and documents with the [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio)
555+
* Try processing your own forms and documents with the [Document Intelligence Studio](https://formrecognizer.appliedai.azure.com/studio).
556556

557557
* Complete a [Document Intelligence quickstart](quickstarts/get-started-sdks-rest-api.md?view=doc-intel-3.0.0&preserve-view=true) and get started creating a document processing app in the development language of your choice.
558558

559559
::: moniker-end
560560

561561
::: moniker range="doc-intel-2.1.0"
562562

563-
* Try processing your own forms and documents with the [Document Intelligence Sample Labeling tool](https://fott-2-1.azurewebsites.net/)
563+
* Try processing your own forms and documents with the [Document Intelligence Sample Labeling tool](https://fott-2-1.azurewebsites.net/).
564564

565565
* Complete a [Document Intelligence quickstart](quickstarts/get-started-sdks-rest-api.md?view=doc-intel-2.1.0&preserve-view=true) and get started creating a document processing app in the development language of your choice.
566566

0 commit comments

Comments
 (0)