Skip to content

Commit ab730ae

Browse files
Merge pull request #230888 from laujan/69653-second-pass-feb-preview
update what's new?
2 parents 29cb0a0 + 93f0054 commit ab730ae

File tree

4 files changed

+31
-32
lines changed

4 files changed

+31
-32
lines changed

articles/applied-ai-services/form-recognizer/concept-general-document.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 11/14/2022
10+
ms.date: 03/15/2023
1111
ms.author: lajanuar
1212
monikerRange: 'form-recog-3.0.0'
1313
recommendations: false
@@ -69,7 +69,7 @@ You need the following resources:
6969

7070
:::image type="content" source="media/containers/keys-and-endpoint.png" alt-text="Screenshot: keys and endpoint location in the Azure portal.":::
7171

72-
#### Form Recognizer Studio
72+
#### Form Recognizer Studio
7373

7474
> [!NOTE]
7575
> Form Recognizer studio and the general document model are available with the v3.0 API.

articles/applied-ai-services/form-recognizer/concept-layout.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 03/03/2023
10+
ms.date: 03/15/2023
1111
ms.author: lajanuar
1212
recommendations: false
1313
---
@@ -173,7 +173,7 @@ The paragraph roles are best used with unstructured documents. Paragraph roles
173173

174174
::: moniker range="form-recog-2.1.0"
175175

176-
### Data extraction
176+
### Data extraction support
177177

178178
| **Model** | **Text** | **Tables** | Selection marks|
179179
| --- | --- | --- | --- |
@@ -367,7 +367,7 @@ The Layout model extracts annotations in documents, such as checks and crosses.
367367
}
368368
```
369369

370-
### Extracting barcodes from documents
370+
### Barcode extraction
371371

372372
The Layout model extracts all identified barcodes in the `barcodes` collection as a top level object under `content`. Inside the `content`, detected barcodes are represented as `:barcode:`. Each entry in this collection represents a barcode and includes the barcode type as `kind` and the embedded barcode content as `value` along with its `polygon` coordinates. Initially, barcodes appear at the end of each page.
373373

articles/applied-ai-services/form-recognizer/concept-read.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 03/02/2023
10+
ms.date: 03/15/2023
1111
ms.author: lajanuar
1212
monikerRange: 'form-recog-3.0.0'
1313
recommendations: false
@@ -108,7 +108,7 @@ The page units in the model output are computed as shown:
108108
|PowerPoint | Each slide = 1 page unit, Each embedded image = 1 page unit | Total slides + Total images
109109
|HTML | Up to 3,000 characters = 1 page unit, embedded or linked images not supported | Total pages of up to 3,000 characters each |
110110

111-
### Extracting barcodes from documents
111+
### Barcode extraction
112112

113113
The Read OCR model extracts all identified barcodes in the `barcodes` collection as a top level object under `content`. Inside the `content`, detected barcodes are represented as `:barcode:`. Each entry in this collection represents a barcode and includes the barcode type as `kind` and the embedded barcode content as `value` along with its `polygon` coordinates. Initially, barcodes appear at the end of each page. Here, the `confidence` is hard-coded for the public preview (`2023-02-28`) release.
114114

@@ -173,7 +173,7 @@ The Read OCR model in Form Recognizer adds [language detection](language-support
173173
]
174174
```
175175

176-
### Extracting pages from documents
176+
### Extract pages from documents
177177

178178
The page units in the model output are computed as shown:
179179

articles/applied-ai-services/form-recognizer/whats-new.md

Lines changed: 23 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 03/03/2023
10+
ms.date: 03/15/2023
1111
ms.author: lajanuar
1212
monikerRange: '>=form-recog-2.1.0'
1313
recommendations: false
@@ -30,32 +30,31 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
3030
## March 2023
3131

3232
> [!IMPORTANT]
33-
> Document classification, Query fields, and Add-on capabilities are currently only available in the following regions:
33+
> [**`2023-02-28-preview`**](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2023-02-28-preview/operations/AnalyzeDocument) capabilities are currently only available in the following regions:
3434
>
3535
> * West Europe
3636
> * West US2
3737
> * East US
3838
39-
* **Document classification** is now a new capability within Form Recognizer starting with the ```2023-02-28-preview``` API. Try out the document classification capability in the [Studio](https://formrecognizer-dogfood.appliedai.azure.com/studio/) or the REST API.
40-
* **Query fields** added to the General Document model uses Open AI model to extract specific fields from documents. See the [general document](concept-general-document.md) model to learn more or try the feature in the [Studio](https://formrecognizer-dogfood.appliedai.azure.com/studio/). Query fields are currently only active for resources in the East US region.
41-
* **Additions to the Read and Layout APIs**
42-
* **Barcodes** are now supported with the ```2023-02-28-preview``` API.
43-
* **Fonts** are now recognized with the ```2023-02-28-preview``` API.
44-
* **Formulas** are now recognized with the ```2023-02-28-preview``` API.
45-
* **Common name** normalizing key variation to a common name makes the General Document model more useful in processing forms with variations in key names. Learn more about the common name feature in the [General Document model](concept-general-document.md).
46-
* **Custom extraction model updates**
47-
* Custom neural models now support added languages for training and analysis. Train neural models for Dutch, French, German, Italian and Spanish.
48-
* Custom template models now have an improved signature detection capability.
49-
* **Service Updates**
50-
* Support for high resolution documents
51-
* **Studio updates**
39+
* [**Custom classifier model**](concept-custom-classifier.md) is a new capability within Form Recognizer starting with the ```2023-02-28-preview``` API. Try the document classification capability using the [Form Recognizer Studio](https://formrecognizer.appliedai.azure.com/studio/document-classifier/projects) or the [REST API](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2023-02-28-preview/operations/GetClassifyDocumentResult).
40+
* [**Query fields**](concept-query-fields.md) capabilities, added to the General Document model, use Azure OpenAI models to extract specific fields from documents. Try the **General documents with query fields** feature using the [Form Recognizer Studio](https://formrecognizer.appliedai.azure.com/studio). Query fields are currently only active for resources in the `East US` region.
41+
* [**Read**](concept-read.md#barcode-extraction) and [**Layout**](concept-layout.md#barcode-extraction) models support **barcode** extraction with the ```2023-02-28-preview``` API.
42+
* [**Add-on capabilities**](concept-add-on-capabilities.md)
43+
* [**Font extraction**](concept-add-on-capabilities.md#font-property-extraction) is now recognized with the ```2023-02-28-preview``` API.
44+
* [**Formula extraction**](concept-add-on-capabilities.md#formula-extraction) is now recognized with the ```2023-02-28-preview``` API.
45+
* [**High resolution extraction**](concept-add-on-capabilities.md#high-resolution-extraction) is now recognized with the ```2023-02-28-preview``` API.
46+
* [**Common name key normalization**](concept-general-document.md#key-normalization-common-name) capabilities are added to the General Document model to improve processing forms with variations in key names.
47+
* [**Custom extraction model updates**](concept-custom.md)
48+
* [**Custom neural model**](concept-custom-neural.md) now supports added languages for training and analysis. Train neural models for Dutch, French, German, Italian and Spanish.
49+
* [**Custom template model**](concept-custom-template.md) now has an improved signature detection capability.
50+
* [**Form Recognizer Studio**](https://formrecognizer.appliedai.azure.com/studio) updates
5251
* In addition to support for all the new features like classification and query fields, the Studio now enables project sharing for custom model projects.
53-
* **Receipt model updates**
52+
* [**Receipt model updates**](concept-receipt.md)
5453
* Receipt model has added support for thermal receipts.
5554
* Receipt model now has added language support for 18 languages and three language dialects (English, French, Portuguese).
5655
* Receipt model now supports `TaxDetails` extraction.
57-
* **Layout model** now has improved table recognition.
58-
* **Read model** now has added improvement for single-digit character recognition.
56+
* [**Layout model**](concept-layout.md) now has improved table recognition.
57+
* [**Read model**](concept-read.md) now has added improvement for single-digit character recognition.
5958

6059
---
6160

@@ -152,7 +151,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
152151

153152
* **Navigation**. You can select labels to target labeled words within a document.
154153

155-
* **Auto table labeling**. After you select the table icon within a document, you can opt to auto-label the extracted table in the labeling view.
154+
* **Auto table labeling**. After you select the table icon within a document, you can opt to autolabel the extracted table in the labeling view.
156155

157156
* **Label subtypes and second-level subtypes** The Studio now supports subtypes for table columns, table rows, and second-level subtypes for types such as dates and numbers.
158157

@@ -480,7 +479,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
480479
* [**Custom neural model**](concept-custom-neural.md) or custom document model is a new custom model to extract text and selection marks from structured forms, semi-strutured and **unstructured documents**.
481480
* [**W-2 prebuilt model**](concept-w2.md) is a new prebuilt model to extract fields from W-2 forms for tax reporting and income verification scenarios.
482481
* [**Read**](concept-read.md) API extracts printed text lines, words, text locations, detected languages, and handwritten text, if detected.
483-
* [**General document**](concept-general-document.md) pre-trained model is now updated to support selection marks in addition to API text, tables, structure, key-value pairs, and named entities from forms and documents.
482+
* [**General document**](concept-general-document.md) pretrained model is now updated to support selection marks in addition to API text, tables, structure, key-value pairs, and named entities from forms and documents.
484483
* [**Invoice API**](language-support.md#invoice-model) Invoice prebuilt model expands support to Spanish invoices.
485484
* [**Form Recognizer Studio**](https://formrecognizer.appliedai.azure.com) adds new demos for Read, W2, Hotel receipt samples, and support for training the new custom neural models.
486485
* [**Language Expansion**](language-support.md) Form Recognizer Read, Layout, and Custom Form add support for 42 new languages including Arabic, Hindi, and other languages using Arabic and Devanagari scripts to expand the coverage to 164 languages. Handwritten language support expands to Japanese and Korean.
@@ -544,7 +543,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
544543

545544
* **Form Recognizer v3.0 preview release version 4.0.0-beta.1 (2021-10-07)introduces several new features and capabilities:**
546545

547-
* [**General document**](concept-general-document.md) model is a new API that uses a pre-trained model to extract text, tables, structure, key-value pairs, and named entities from forms and documents.
546+
* [**General document**](concept-general-document.md) model is a new API that uses a pretrained model to extract text, tables, structure, key-value pairs, and named entities from forms and documents.
548547
* [**Hotel receipt**](concept-receipt.md) model added to prebuilt receipt processing.
549548
* [**Expanded fields for ID document**](concept-id-document.md) the ID model supports endorsements, restrictions, and vehicle classification extraction from US driver's licenses.
550549
* [**Signature field**](concept-custom.md) is a new field type in custom forms to detect the presence of a signature in a form field.
@@ -762,7 +761,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
762761

763762
* Split **FormField** type into several different interfaces. This update shouldn't cause any API compatibility issues except in certain edge cases (undefined valueType).
764763

765-
* Migrated to the **2.1-preview.3** Form Recognizer service endpoint for all REST API calls.
764+
* Migrated to the **`2.1-preview.3`** Form Recognizer service endpoint for all REST API calls.
766765

767766
### [**Python**](#tab/python)
768767

@@ -816,7 +815,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
816815

817816
---
818817

819-
* **SDK preview updates for API version 2.1-preview.3 introduces feature updates and enhancements.**
818+
* **SDK preview updates for API version `2.1-preview.3` introduces feature updates and enhancements.**
820819

821820
---
822821

@@ -885,7 +884,7 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
885884

886885
* **Form Recognizer v2.1-preview.1 has been released and includes the following features:
887886

888-
* **REST API reference is available** - View the [`v2.1-preview.1 reference`](https://westcentralus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1-preview-1/operations/AnalyzeBusinessCardAsync)
887+
* **REST API reference is available** - View the [`v2.1-preview.1 reference`](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-v2-1/operations/AnalyzeBusinessCardAsync)
889888
* **New languages supported In addition to English**, the following [languages](language-support.md) are now supported: for `Layout` and `Train Custom Model`: English (`en`), Chinese (Simplified) (`zh-Hans`), Dutch (`nl`), French (`fr`), German (`de`), Italian (`it`), Portuguese (`pt`) and Spanish (`es`).
890889
* **Checkbox / Selection Mark detection** – Form Recognizer supports detection and extraction of selection marks such as check boxes and radio buttons. Selection Marks are extracted in `Layout` and you can now also label and train in `Train Custom Model` - _Train with Labels_ to extract key-value pairs for selection marks.
891890
* **Model Compose** - allows multiple models to be composed and called with a single model ID. When you submit a document to be analyzed with a composed model ID, a classification step is first performed to route it to the correct custom model. Model Compose is available for `Train Custom Model` - _Train with labels_.

0 commit comments

Comments
 (0)