You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/applied-ai-services/form-recognizer/concept-layout.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ manager: nitinme
7
7
ms.service: applied-ai-services
8
8
ms.subservice: forms-recognizer
9
9
ms.topic: conceptual
10
-
ms.date: 03/03/2023
10
+
ms.date: 03/15/2023
11
11
ms.author: lajanuar
12
12
recommendations: false
13
13
---
@@ -173,7 +173,7 @@ The paragraph roles are best used with unstructured documents. Paragraph roles
173
173
174
174
::: moniker range="form-recog-2.1.0"
175
175
176
-
### Data extraction
176
+
### Data extraction support
177
177
178
178
|**Model**|**Text**|**Tables**| Selection marks|
179
179
| --- | --- | --- | --- |
@@ -367,7 +367,7 @@ The Layout model extracts annotations in documents, such as checks and crosses.
367
367
}
368
368
```
369
369
370
-
### Extracting barcodes from documents
370
+
### Barcode extraction
371
371
372
372
The Layout model extracts all identified barcodes in the `barcodes` collection as a top level object under `content`. Inside the `content`, detected barcodes are represented as `:barcode:`. Each entry in this collection represents a barcode and includes the barcode type as `kind` and the embedded barcode content as `value` along with its `polygon` coordinates. Initially, barcodes appear at the end of each page.
Copy file name to clipboardExpand all lines: articles/applied-ai-services/form-recognizer/concept-read.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ manager: nitinme
7
7
ms.service: applied-ai-services
8
8
ms.subservice: forms-recognizer
9
9
ms.topic: conceptual
10
-
ms.date: 03/02/2023
10
+
ms.date: 03/15/2023
11
11
ms.author: lajanuar
12
12
monikerRange: 'form-recog-3.0.0'
13
13
recommendations: false
@@ -108,7 +108,7 @@ The page units in the model output are computed as shown:
108
108
|PowerPoint | Each slide = 1 page unit, Each embedded image = 1 page unit | Total slides + Total images
109
109
|HTML | Up to 3,000 characters = 1 page unit, embedded or linked images not supported | Total pages of up to 3,000 characters each |
110
110
111
-
### Extracting barcodes from documents
111
+
### Barcode extraction
112
112
113
113
The Read OCR model extracts all identified barcodes in the `barcodes` collection as a top level object under `content`. Inside the `content`, detected barcodes are represented as `:barcode:`. Each entry in this collection represents a barcode and includes the barcode type as `kind` and the embedded barcode content as `value` along with its `polygon` coordinates. Initially, barcodes appear at the end of each page. Here, the `confidence` is hard-coded for the public preview (`2023-02-28`) release.
114
114
@@ -173,7 +173,7 @@ The Read OCR model in Form Recognizer adds [language detection](language-support
173
173
]
174
174
```
175
175
176
-
### Extracting pages from documents
176
+
### Extract pages from documents
177
177
178
178
The page units in the model output are computed as shown:
Copy file name to clipboardExpand all lines: articles/applied-ai-services/form-recognizer/whats-new.md
+23-24Lines changed: 23 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ manager: nitinme
7
7
ms.service: applied-ai-services
8
8
ms.subservice: forms-recognizer
9
9
ms.topic: conceptual
10
-
ms.date: 03/03/2023
10
+
ms.date: 03/15/2023
11
11
ms.author: lajanuar
12
12
monikerRange: '>=form-recog-2.1.0'
13
13
recommendations: false
@@ -30,32 +30,31 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
30
30
## March 2023
31
31
32
32
> [!IMPORTANT]
33
-
> Document classification, Query fields, and Add-on capabilities are currently only available in the following regions:
33
+
> [**`2023-02-28-preview`**](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2023-02-28-preview/operations/AnalyzeDocument) capabilities are currently only available in the following regions:
34
34
>
35
35
> * West Europe
36
36
> * West US2
37
37
> * East US
38
38
39
-
***Document classification** is now a new capability within Form Recognizer starting with the ```2023-02-28-preview``` API. Try out the document classification capability in the [Studio](https://formrecognizer-dogfood.appliedai.azure.com/studio/) or the REST API.
40
-
***Query fields** added to the General Document model uses Open AI model to extract specific fields from documents. See the [general document](concept-general-document.md) model to learn more or try the feature in the [Studio](https://formrecognizer-dogfood.appliedai.azure.com/studio/). Query fields are currently only active for resources in the East US region.
41
-
***Additions to the Read and Layout APIs**
42
-
***Barcodes** are now supported with the ```2023-02-28-preview``` API.
43
-
***Fonts** are now recognized with the ```2023-02-28-preview``` API.
44
-
***Formulas** are now recognized with the ```2023-02-28-preview``` API.
45
-
***Common name** normalizing key variation to a common name makes the General Document model more useful in processing forms with variations in key names. Learn more about the common name feature in the [General Document model](concept-general-document.md).
46
-
***Custom extraction model updates**
47
-
* Custom neural models now support added languages for training and analysis. Train neural models for Dutch, French, German, Italian and Spanish.
48
-
* Custom template models now have an improved signature detection capability.
49
-
***Service Updates**
50
-
* Support for high resolution documents
51
-
***Studio updates**
39
+
*[**Custom classifier model**](concept-custom-classifier.md) is a new capability within Form Recognizer starting with the ```2023-02-28-preview``` API. Try the document classification capability using the [Form Recognizer Studio](https://formrecognizer.appliedai.azure.com/studio/document-classifier/projects) or the [REST API](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2023-02-28-preview/operations/GetClassifyDocumentResult).
40
+
*[**Query fields**](concept-query-fields.md) capabilities, added to the General Document model, use Azure OpenAI models to extract specific fields from documents. Try the **General documents with query fields** feature using the [Form Recognizer Studio](https://formrecognizer.appliedai.azure.com/studio). Query fields are currently only active for resources in the `East US` region.
41
+
*[**Read**](concept-read.md#barcode-extraction) and [**Layout**](concept-layout.md#barcode-extraction) models support **barcode** extraction with the ```2023-02-28-preview``` API.
*[**Font extraction**](concept-add-on-capabilities.md#font-property-extraction) is now recognized with the ```2023-02-28-preview``` API.
44
+
*[**Formula extraction**](concept-add-on-capabilities.md#formula-extraction) is now recognized with the ```2023-02-28-preview``` API.
45
+
*[**High resolution extraction**](concept-add-on-capabilities.md#high-resolution-extraction) is now recognized with the ```2023-02-28-preview``` API.
46
+
*[**Common name key normalization**](concept-general-document.md#key-normalization-common-name) capabilities are added to the General Document model to improve processing forms with variations in key names.
47
+
*[**Custom extraction model updates**](concept-custom.md)
48
+
*[**Custom neural model**](concept-custom-neural.md) now supports added languages for training and analysis. Train neural models for Dutch, French, German, Italian and Spanish.
49
+
*[**Custom template model**](concept-custom-template.md) now has an improved signature detection capability.
* In addition to support for all the new features like classification and query fields, the Studio now enables project sharing for custom model projects.
53
-
***Receipt model updates**
52
+
*[**Receipt model updates**](concept-receipt.md)
54
53
* Receipt model has added support for thermal receipts.
55
54
* Receipt model now has added language support for 18 languages and three language dialects (English, French, Portuguese).
56
55
* Receipt model now supports `TaxDetails` extraction.
57
-
***Layout model** now has improved table recognition.
58
-
***Read model** now has added improvement for single-digit character recognition.
56
+
*[**Layout model**](concept-layout.md) now has improved table recognition.
57
+
*[**Read model**](concept-read.md) now has added improvement for single-digit character recognition.
59
58
60
59
---
61
60
@@ -152,7 +151,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
152
151
153
152
***Navigation**. You can select labels to target labeled words within a document.
154
153
155
-
***Auto table labeling**. After you select the table icon within a document, you can opt to auto-label the extracted table in the labeling view.
154
+
***Auto table labeling**. After you select the table icon within a document, you can opt to autolabel the extracted table in the labeling view.
156
155
157
156
***Label subtypes and second-level subtypes** The Studio now supports subtypes for table columns, table rows, and second-level subtypes for types such as dates and numbers.
158
157
@@ -480,7 +479,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
480
479
*[**Custom neural model**](concept-custom-neural.md) or custom document model is a new custom model to extract text and selection marks from structured forms, semi-strutured and **unstructured documents**.
481
480
*[**W-2 prebuilt model**](concept-w2.md) is a new prebuilt model to extract fields from W-2 forms for tax reporting and income verification scenarios.
482
481
*[**Read**](concept-read.md) API extracts printed text lines, words, text locations, detected languages, and handwritten text, if detected.
483
-
*[**General document**](concept-general-document.md)pre-trained model is now updated to support selection marks in addition to API text, tables, structure, key-value pairs, and named entities from forms and documents.
482
+
*[**General document**](concept-general-document.md)pretrained model is now updated to support selection marks in addition to API text, tables, structure, key-value pairs, and named entities from forms and documents.
484
483
*[**Invoice API**](language-support.md#invoice-model) Invoice prebuilt model expands support to Spanish invoices.
485
484
*[**Form Recognizer Studio**](https://formrecognizer.appliedai.azure.com) adds new demos for Read, W2, Hotel receipt samples, and support for training the new custom neural models.
486
485
*[**Language Expansion**](language-support.md) Form Recognizer Read, Layout, and Custom Form add support for 42 new languages including Arabic, Hindi, and other languages using Arabic and Devanagari scripts to expand the coverage to 164 languages. Handwritten language support expands to Japanese and Korean.
@@ -544,7 +543,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
544
543
545
544
***Form Recognizer v3.0 preview release version 4.0.0-beta.1 (2021-10-07)introduces several new features and capabilities:**
546
545
547
-
*[**General document**](concept-general-document.md) model is a new API that uses a pre-trained model to extract text, tables, structure, key-value pairs, and named entities from forms and documents.
546
+
*[**General document**](concept-general-document.md) model is a new API that uses a pretrained model to extract text, tables, structure, key-value pairs, and named entities from forms and documents.
548
547
*[**Hotel receipt**](concept-receipt.md) model added to prebuilt receipt processing.
549
548
*[**Expanded fields for ID document**](concept-id-document.md) the ID model supports endorsements, restrictions, and vehicle classification extraction from US driver's licenses.
550
549
*[**Signature field**](concept-custom.md) is a new field type in custom forms to detect the presence of a signature in a form field.
@@ -762,7 +761,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
762
761
763
762
* Split **FormField** type into several different interfaces. This update shouldn't cause any API compatibility issues except in certain edge cases (undefined valueType).
764
763
765
-
* Migrated to the **2.1-preview.3** Form Recognizer service endpoint for all REST API calls.
764
+
* Migrated to the **`2.1-preview.3`** Form Recognizer service endpoint for all REST API calls.
766
765
767
766
### [**Python**](#tab/python)
768
767
@@ -816,7 +815,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
816
815
817
816
---
818
817
819
-
***SDK preview updates for API version 2.1-preview.3 introduces feature updates and enhancements.**
818
+
***SDK preview updates for API version `2.1-preview.3` introduces feature updates and enhancements.**
820
819
821
820
---
822
821
@@ -885,7 +884,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
885
884
886
885
***Form Recognizer v2.1-preview.1 has been released and includes the following features:
887
886
888
-
***REST API reference is available** - View the [`v2.1-preview.1 reference`](https://westcentralus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1-preview-1/operations/AnalyzeBusinessCardAsync)
887
+
***REST API reference is available** - View the [`v2.1-preview.1 reference`](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1/operations/AnalyzeBusinessCardAsync)
889
888
***New languages supported In addition to English**, the following [languages](language-support.md) are now supported: for `Layout` and `Train Custom Model`: English (`en`), Chinese (Simplified) (`zh-Hans`), Dutch (`nl`), French (`fr`), German (`de`), Italian (`it`), Portuguese (`pt`) and Spanish (`es`).
890
889
***Checkbox / Selection Mark detection** – Form Recognizer supports detection and extraction of selection marks such as check boxes and radio buttons. Selection Marks are extracted in `Layout` and you can now also label and train in `Train Custom Model` - _Train with Labels_ to extract key-value pairs for selection marks.
891
890
***Model Compose** - allows multiple models to be composed and called with a single model ID. When you submit a document to be analyzed with a composed model ID, a classification step is first performed to route it to the correct custom model. Model Compose is available for `Train Custom Model` - _Train with labels_.
0 commit comments