Skip to content

Commit c2f8c02

Browse files
committed
update
1 parent 38f10ac commit c2f8c02

File tree

4 files changed

+73
-61
lines changed

4 files changed

+73
-61
lines changed

articles/applied-ai-services/form-recognizer/concept-id-document.md

Lines changed: 13 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 11/10/2022
10+
ms.date: 11/11/2022
1111
ms.author: lajanuar
1212
recommendations: false
1313
ms.custom: references.regions
@@ -24,13 +24,21 @@ ms.custom: references.regions
2424
[!INCLUDE [applies to v2.1](includes/applies-to-v2-1.md)]
2525
::: moniker-end
2626

27-
Azure Form Recognizer Identity document (ID) model combines Optical Character Recognition (OCR) with deep learning models to analyze and extract key information from identity documents: US Drivers Licenses (all 50 states and District of Columbia), international passport biographical pages, US state IDs, social security cards, and permanent resident cards and more. The API analyzes identity documents, extracts key information, and returns a structured JSON data representation.
27+
::: moniker range="form-recog-3.0.0"
2828

29-
## Identity document processing
29+
Azure Form Recognizer Identity document (ID) model combines Optical Character Recognition (OCR) with deep learning models to analyze and extract key information from identity documents such as US Drivers Licenses (all 50 states and District of Columbia), international passport biographical pages, US state IDs, social security cards, and permanent resident cards and more. The API analyzes identity documents, extracts key information, and returns a structured JSON data representation.
30+
31+
::: moniker-end
32+
33+
::: moniker range="form-recog-2.1.0"
34+
35+
Azure Form Recognizer can analyze and extract information from government-issued identification documents (IDs) using its prebuilt IDs model. It combines our powerful [Optical Character Recognition (OCR)](../computer-vision/overview-ocr.md) capabilities with ID recognition capabilities to extract key information from Worldwide Passports and U.S. Driver's Licenses (all 50 states and D.C.). The IDs API extracts key information from these identity documents, such as first name, last name, date of birth, document number, and more. This API is available in the Form Recognizer v2.1 as a cloud service.
3036

31-
Identity document processing involves extracting data from identity documents either manually or by using OCR based techniques. Examples of identity documents include passports, driver licenses, resident cards, and national identity cards like the US social security card.
37+
::: moniker-end
38+
39+
## Identity document processing
3240

33-
ID document processing an important step in any business process that requires some proof of identity. Examples include customer verification in banks and other financial institutions, mortgage applications, medical visits, claim processing, hospitality industry, and more. Individuals provide some proof of their identity via driver licenses, passports, and other similar documents so that the business can efficiently verify them before providing services and benefits.
41+
Identity document processing involves extracting data from identity documents either manually or by using OCR-based technology. ID document is processing an important step in any business process that requires some proof of identity. Examples include customer verification in banks and other financial institutions, mortgage applications, medical visits, claim processing, hospitality industry, and more. Individuals provide some proof of their identity via driver licenses, passports, and other similar documents so that the business can efficiently verify them before providing services and benefits.
3442

3543
::: moniker range="form-recog-3.0.0"
3644

articles/applied-ai-services/form-recognizer/concept-layout.md

Lines changed: 56 additions & 52 deletions
Original file line numberDiff line numberDiff line change
@@ -14,8 +14,6 @@ recommendations: false
1414

1515
# Azure Form Recognizer layout model
1616

17-
The Form Recognizer Layout is an advanced machine-learning based document layout analysis model available in the Form Recognizer cloud API. In the version v2.1, the document layout model extracted text lines, words, tables, and selection marks.
18-
1917
::: moniker range="form-recog-3.0.0"
2018
[!INCLUDE [applies to v3.0](includes/applies-to-v3-0.md)]
2119
::: moniker-end
@@ -24,9 +22,14 @@ The Form Recognizer Layout is an advanced machine-learning based document layout
2422
[!INCLUDE [applies to v2.1](includes/applies-to-v2-1.md)]
2523
::: moniker-end
2624

25+
Form Recognizer layout model is an advanced machine-learning based document analysis API available in the Form Recognizer cloud. It enables you to take documents in a variety of formats and return structured data representations of the documents. It combines an enhanced version of our powerful [Optical Character Recognition (OCR)](../../cognitive-services/Bing-Autosuggest/computer-vision/overview-ocr.md) capabilities with deep learning models to extract text, tables, selection marks, and document structure.
26+
2727
## Document layout analysis
2828

29-
Document structure layout analysis is the process of analyzing a document to extract regions of interest and their inter-relationships. The goal is to extract text and structural elements from the page for building better semantic understanding models. For all extracted text, there are two types of roles that text plays in a document layout. Text, tables, and selection marks are examples of geometric roles. Titles, headings, and footers are examples of logical roles. For example, a reading system requires differentiating text regions from non-textual ones along with their reading order.
29+
Document structure layout analysis is the process of analyzing a document to extract regions of interest and their inter-relationships. The goal is to extract text and structural elements from the page to build better semantic understanding models. There are two types of roles that text plays in a document layout:
30+
31+
* **Geometric roles**: Text, tables, and selection marks are examples of geometric roles.
32+
* **Logical roles**: Titles, headings, and footers are examples of logical roles.
3033

3134
The following illustration shows the typical components in an image of a sample page.
3235

@@ -38,64 +41,37 @@ The following illustration shows the typical components in an image of a sample
3841

3942
:::image type="content" source="media/studio/form-recognizer-studio-layout-newspaper.png" alt-text="Screenshot of sample newspaper page processed using Form Recognizer studio":::
4043

44+
## Development options
45+
46+
The following tools are supported by Form Recognizer v3.0:
47+
48+
| Feature | Resources | Model ID |
49+
|----------|------------|------------|
50+
|**Layout model**| <ul><li>[**Form Recognizer Studio**](https://formrecognizer.appliedai.azure.com)</li><li>[**REST API**](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2022-08-31/operations/AnalyzeDocument)</li><li>[**C# SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li><li>[**Python SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li><li>[**Java SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li><li>[**JavaScript SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li></ul>|**prebuilt-layout**|
51+
4152
::: moniker-end
4253

4354
::: moniker range="form-recog-2.1.0"
4455

4556
**Sample document processed with [Form Recognizer sample labeling tool layout model](https://fott-2-1.azurewebsites.net/layout-analyze)**:
4657

4758
:::image type="content" source="media/layout-tool-example.jpg" alt-text="Screenshot of a document processed with the layout model.":::
48-
::: moniker-end
4959

50-
## Supported document types
60+
::: moniker-end
5161

52-
| **Model** | **Images** | **PDF** | **TIFF** |
53-
| --- | --- | --- | --- |
54-
| Layout ||||
62+
## Input requirements
5563

5664
::: moniker range="form-recog-3.0.0"
5765

58-
### Data extraction
59-
60-
**Starting with v3.0 GA**, it extracts paragraphs and more structure information like titles, section headings, page header, page footer, page number, and footnote from the document page. These structural elements are examples of logical roles described in the previous section. This capability is supported for PDF documents and images (JPG, PNG, BMP, TIFF).
61-
62-
| **Model** | **Text** | **Selection Marks** | **Tables** | **Paragraphs** | **Logical roles** |
63-
| --- | --- | --- | --- | --- | --- |
64-
| Layout ||||||
65-
66-
**Supported logical roles for paragraphs**:
67-
The paragraph roles are best used with unstructured documents. Paragraph roles help analyze the structure of the extracted content for better semantic search and analysis.
68-
69-
* title
70-
* sectionHeading
71-
* footnote
72-
* pageHeader
73-
* pageFooter
74-
* pageNumber
75-
76-
## Development options
77-
78-
The following tools are supported by Form Recognizer v3.0:
79-
80-
| Feature | Resources | Model ID |
81-
|----------|------------|------------|
82-
|**Layout model**| <ul><li>[**Form Recognizer Studio**](https://formrecognizer.appliedai.azure.com)</li><li>[**REST API**](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2022-08-31/operations/AnalyzeDocument)</li><li>[**C# SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li><li>[**Python SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li><li>[**Java SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li><li>[**JavaScript SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)</li></ul>|**prebuilt-layout**|
66+
[!INCLUDE [input requirements](./includes/input-requirements.md)]
8367

8468
::: moniker-end
8569

8670
::: moniker range="form-recog-2.1.0"
8771

88-
### Data extraction
89-
90-
| **Model** | **Text** | **Tables** | Selection marks|
91-
| --- | --- | --- | --- |
92-
| Layout ||||
93-
94-
The following tools are supported by Form Recognizer v2.1:
95-
96-
| Feature | Resources |
97-
|----------|-------------------------|
98-
|**Layout API**| <ul><li>[**Form Recognizer labeling tool**](https://fott-2-1.azurewebsites.net/layout-analyze)</li><li>[**REST API**](/azure/applied-ai-services/form-recognizer/how-to-guides/use-sdk-rest-api?view=form-recog-2.1.0&preserve-view=true&tabs=windows&pivots=programming-language-rest-api#analyze-layout)</li><li>[**Client-library SDK**](/azure/applied-ai-services/form-recognizer/how-to-guides/v2-1-sdk-rest-api)</li><li>[**Form Recognizer Docker container**](containers/form-recognizer-container-install-run.md?branch=main&tabs=layout#run-the-container-with-the-docker-compose-up-command)</li></ul>|
72+
* Supported file formats: JPEG, PNG, PDF, and TIFF
73+
* For PDF and TIFF, up to 2000 pages are processed. For free tier subscribers, only the first two pages are processed.
74+
* The file size must be less than 50 MB and dimensions at least 50 x 50 pixels and at most 10,000 x 10,000 pixels.
9975

10076
::: moniker-end
10177

@@ -163,25 +139,53 @@ See how data, including text, tables, table headers, selection marks, and struct
163139

164140
::: moniker-end
165141

166-
## Input requirements
142+
## Supported document types
143+
144+
| **Model** | **Images** | **PDF** | **TIFF** |
145+
| --- | --- | --- | --- |
146+
| Layout ||||
147+
148+
## Supported languages and locales
149+
150+
*See* [Language Support](language-support.md) for a complete list of supported handwritten and printed languages.
167151

168152
::: moniker range="form-recog-3.0.0"
169153

170-
[!INCLUDE [input requirements](./includes/input-requirements.md)]
154+
### Data extraction
155+
156+
**Starting with v3.0 GA**, it extracts paragraphs and more structure information like titles, section headings, page header, page footer, page number, and footnote from the document page. These structural elements are examples of logical roles described in the previous section. This capability is supported for PDF documents and images (JPG, PNG, BMP, TIFF).
157+
158+
| **Model** | **Text** | **Selection Marks** | **Tables** | **Paragraphs** | **Logical roles** |
159+
| --- | --- | --- | --- | --- | --- |
160+
| Layout ||||||
161+
162+
**Supported logical roles for paragraphs**:
163+
The paragraph roles are best used with unstructured documents. Paragraph roles help analyze the structure of the extracted content for better semantic search and analysis.
164+
165+
* title
166+
* sectionHeading
167+
* footnote
168+
* pageHeader
169+
* pageFooter
170+
* pageNumber
171171

172172
::: moniker-end
173173

174174
::: moniker range="form-recog-2.1.0"
175175

176-
* Supported file formats: JPEG, PNG, PDF, and TIFF
177-
* For PDF and TIFF, up to 2000 pages are processed. For free tier subscribers, only the first two pages are processed.
178-
* The file size must be less than 50 MB and dimensions at least 50 x 50 pixels and at most 10,000 x 10,000 pixels.
176+
### Data extraction
179177

180-
::: moniker-end
178+
| **Model** | **Text** | **Tables** | Selection marks|
179+
| --- | --- | --- | --- |
180+
| Layout ||||
181181

182-
## Supported languages and locales
182+
The following tools are supported by Form Recognizer v2.1:
183183

184-
*See* [Language Support](language-support.md) for a complete list of supported handwritten and printed languages.
184+
| Feature | Resources |
185+
|----------|-------------------------|
186+
|**Layout API**| <ul><li>[**Form Recognizer labeling tool**](https://fott-2-1.azurewebsites.net/layout-analyze)</li><li>[**REST API**](/azure/applied-ai-services/form-recognizer/how-to-guides/use-sdk-rest-api?view=form-recog-2.1.0&preserve-view=true&tabs=windows&pivots=programming-language-rest-api#analyze-layout)</li><li>[**Client-library SDK**](/azure/applied-ai-services/form-recognizer/how-to-guides/v2-1-sdk-rest-api)</li><li>[**Form Recognizer Docker container**](containers/form-recognizer-container-install-run.md?branch=main&tabs=layout#run-the-container-with-the-docker-compose-up-command)</li></ul>|
187+
188+
::: moniker-end
185189

186190
::: moniker range="form-recog-3.0.0"
187191

articles/applied-ai-services/form-recognizer/includes/applies-to-v2-1.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: include
10-
ms.date: 10/10/2022
10+
ms.date: 11/11/2022
1111
---
1212

13-
**This article applies to:** ![Form Recognizer v2.1 checkmark](../media/yes-icon.png) **Form Recognizer v2.1**. **Later versions:** [Form Recognizer v3.0](?view=form-recog-3.0.0&preserve-view=true)
13+
**This article applies to:** ![Form Recognizer v2.1 checkmark](../media/yes-icon.png) **Form Recognizer v2.1**. **Later version:** [Form Recognizer v3.0](?view=form-recog-3.0.0&preserve-view=true)

articles/applied-ai-services/form-recognizer/includes/applies-to-v3-0.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: include
10-
ms.date: 10/10/2022
10+
ms.date: 11/11/2022
1111
---
1212

13-
**This article applies to:** ![Form Recognizer v3.0 checkmark](../media/yes-icon.png) **Form Recognizer v3.0**. **Earlier versions:** [Form Recognizer v2.1](?view=form-recog-2.1.0&preserve-view=true)
13+
**This article applies to:** ![Form Recognizer v3.0 checkmark](../media/yes-icon.png) **Form Recognizer v3.0**. **Earlier version:** [Form Recognizer v2.1](?view=form-recog-2.1.0&preserve-view=true)

0 commit comments

Comments
 (0)