Skip to content

Commit 393e91f

Browse files
Merge pull request #229797 from laujan/query-fields
add concept-query-fields
2 parents 45a1fda + 929544b commit 393e91f

File tree

7 files changed

+70
-24
lines changed

7 files changed

+70
-24
lines changed
Lines changed: 49 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,49 @@
1+
---
2+
title: Query field extraction - Form Recognizer
3+
titleSuffix: Azure Applied AI Services
4+
description: Use Form Recognizer to extract query field data.
5+
author: laujan
6+
manager: nitinme
7+
ms.service: applied-ai-services
8+
ms.subservice: forms-recognizer
9+
ms.topic: conceptual
10+
ms.date: 03/07/2023
11+
ms.author: netahw
12+
monikerRange: 'form-recog-3.0.0'
13+
recommendations: false
14+
---
15+
<!-- markdownlint-disable MD033 -->
16+
17+
# Azure Form Recognizer query field extraction
18+
19+
**This article applies to:** ![Form Recognizer v3.0 checkmark](media/yes-icon.png) **Form Recognizer v3.0**.
20+
21+
> [!IMPORTANT]
22+
>
23+
> * The Form Recognizer Studio query fields extraction feature is currently in gated preview. Features, approaches and processes may change, prior to General Availability (GA), based on user feedback.
24+
> * Complete and submit the [**Form Recognizer private preview request form**](https://aka.ms/form-recognizer/preview/survey) to request access.
25+
26+
Form Recognizer now supports query field extractions using Azure OpenAI capabilities. With query field extraction, you can add fields to the extraction process using a query request without the need for added training.
27+
28+
> [!NOTE]
29+
>
30+
> Form Recognizer Studio query field extraction is currently available with the general document model for the `2023-02-28-preview` release.
31+
32+
## Select query fields
33+
34+
For query field extraction, specify the fields you want to extract and Form Recognizer analyzes the document accordingly. Here's an example:
35+
36+
* If you're processing a contract in the Form Recognizer Studio, you can pass a list of field labels like `Party1`, `Party2`, `TermsOfUse`, `PaymentTerms`, `PaymentDate`, and `TermEndDate`" as part of the analyze document request.
37+
38+
:::image type="content" source="media/studio/query-field-select.png" alt-text="Screenshot of query fields selection window in Form Recognizer Studio.":::
39+
40+
* Form Recognizer utilizes the capabilities of both [**Azure OpenAI Service**](../../cognitive-services/openai/overview.md) and extraction models to analyze and extract the field data and return the values in a structured JSON output.
41+
42+
* In addition to the query fields, the response includes text, tables, selection marks, general document key-value pairs, and other relevant data.
43+
44+
:::image type="content" source="media/studio/query-field-analyze.png" alt-text="Screenshot of query field analysis in Form Recognizer Studio.":::
45+
46+
## Next steps
47+
48+
> [!div class="nextstepaction"]
49+
> [Try the Form Recognizer Studio quickstart](./quickstarts/try-form-recognizer-studio.md)

articles/applied-ai-services/form-recognizer/concept-w2.md

Lines changed: 7 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -21,23 +21,23 @@ The Form Recognizer W-2 model, combines Optical Character Recognition (OCR) with
2121

2222
## Automated W-2 form processing
2323

24-
Form W-2, also known as the Wage and Tax Statement, is sent by an employer to each employee and the Internal Revenue Service (IRS) at the end of the year. A W-2 form reports employees' annual wages and the amount of taxes withheld from their paychecks. The IRS also uses W-2 forms to track individuals' tax obligations. The Social Security Administration (SSA) uses the information on this and other forms to compute the Social Security benefits for all workers.
24+
An employer sends form W-2, also known as the Wage and Tax Statement, to each employee and the Internal Revenue Service (IRS) at the end of the year. A W-2 form reports employees' annual wages and the amount of taxes withheld from their paychecks. The IRS also uses W-2 forms to track individuals' tax obligations. The Social Security Administration (SSA) uses the information on this and other forms to compute the Social Security benefits for all workers.
2525

2626
***Sample W-2 tax form processed using Form Recognizer Studio***
2727

2828
:::image type="content" source="media/studio/w-2.png" alt-text="Screenshot of sample W-2 processed in the Form Recognizer Studio.":::
2929

3030
## Development options
3131

32-
The prebuilt W-2 model is supported by Form Recognizer v3.0 with the following tools:
32+
Form Recognizer v3.0 supports the following tools:
3333

3434
| Feature | Resources | Model ID |
3535
|----------|-------------|-----------|
3636
|**W-2 model**|<ul><li> [**Form Recognizer Studio**](https://formrecognizer.appliedai.azure.com)</li><li>[**REST API**](https://westus.dev.cognitive.microsoft.com/docs/services/form-recognizer-api-2022-08-31/operations/AnalyzeDocument)</li><li>[**C# SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)</li><li>[**Python SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)</li><li>[**Java SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)</li><li>[**JavaScript SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)</li></ul>|**prebuilt-tax.us.w2**|
3737

3838
### Try W-2 data extraction
3939

40-
Try extracting data from W-2 forms using the Form Recognizer Studio. You'll need the following resources:
40+
Try extracting data from W-2 forms using the Form Recognizer Studio. You need the following resources:
4141

4242
* An Azure subscription—you can [create one for free](https://azure.microsoft.com/free/cognitive-services/)
4343

@@ -92,7 +92,7 @@ Try extracting data from W-2 forms using the Form Recognizer Studio. You'll need
9292
| MedicareTaxWithheld | 6 | Number | Medicare tax with held | 1111 |
9393
| SocialSecurityTips | 7 | Number | Social security tips | 1111 |
9494
| AllocatedTips | 8 | Number | Allocated tips | 1111 |
95-
| VerificationCode | 9 | String | Verification Code on Form W-2 | A123-B456-C789-DXYZ |
95+
| Verification&#8203;Code | 9 | String | Verification Code on Form W-2 | A123-B456-C789-DXYZ |
9696
| DependentCareBenefits | 10 | Number | Dependent care benefits | 1111 |
9797
| NonqualifiedPlans | 11 | Number | The non-qualified plan, a type of retirement savings plan that is employer-sponsored and tax-deferred | 1111 |
9898
| AdditionalInfo | | Array of objects | An array of LetterCode and Amount | |
@@ -123,11 +123,6 @@ Try extracting data from W-2 forms using the Form Recognizer Studio. You'll need
123123

124124
## Next steps
125125

126-
* Complete a Form Recognizer quickstart:
127-
> [!div class="checklist"]
128-
>
129-
> * [**REST API**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true)
130-
> * [**C# SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)
131-
> * [**Python SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)
132-
> * [**Java SDK**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)
133-
> * [**JavaScript**](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true#prebuilt-model)</li></ul>
126+
* Try processing your own forms and documents with the [Form Recognizer Studio](https://formrecognizer.appliedai.azure.com/studio)
127+
128+
* Complete a [Form Recognizer quickstart](quickstarts/get-started-sdks-rest-api.md?view=form-recog-3.0.0&preserve-view=true) and get started creating a document processing app in the development language of your choice.

articles/applied-ai-services/form-recognizer/faq.yml

Lines changed: 9 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ metadata:
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: faq
10-
ms.date: 10/20/2022
10+
ms.date: 03/09/2023
1111
ms.author: lajanuar
1212
monikerRange: '>=form-recog-2.1.0'
1313
recommendations: false
@@ -39,7 +39,7 @@ sections:
3939
Learn more about [use case considerations](/legal/cognitive-services/form-recognizer/fr-transparency-note?context=/azure/applied-ai-services/form-recognizer/context/context#considerations-when-choosing-other-use-cases).
4040
4141
- question: |
42-
What languages are supported by Form Recognizer?
42+
What languages does Form Recognizer support?
4343
answer: |
4444
4545
Form Recognizer's deep-learning-based universal models support many languages that can extract multi-lingual text from your images and documents, including text lines with mixed languages.
@@ -100,7 +100,7 @@ sections:
100100
- question: |
101101
How can I improve accuracy scores?
102102
answer: |
103-
The accuracy of a model is influenced by variances in the visual structure of your documents.
103+
Variances in the visual structure of your documents can influence the accuracy of a model.
104104
105105
- Ensure that all variations of a document are included in the training dataset. Variations include different formats, for example, digital versus scanned PDFs.
106106
@@ -220,12 +220,12 @@ sections:
220220
221221
- The parameter `pages`(supported in both v2.1 and v3.0 REST API) enables you to specify pages for multi-page PDF and TIFF documents. Accepted input includes the following ranges:
222222
223-
- Single pages (for example,'1, 2' -> pages 1 and 2 will be processed).- Finite (for example '2-5' -> pages 2 to 5 will be processed)
224-
- Open-ended ranges (for example '5-' -> all the pages from page 5 will be processed & for example, '-10' -> pages 1 to 10 will be processed).
223+
- Single pages (for example,'1, 2' -> pages 1 and 2 are processed).- Finite (for example '2-5' -> pages 2 to 5 are processed)
224+
- Open-ended ranges (for example '5-' -> all the pages from page 5 are processed & for example, '-10' -> pages 1 to 10 are processed).
225225
226-
- These parameters can be mixed together and ranges are allowed to overlap (for example, '-5, 1, 3, 5-10' - pages 1 to 10 will be processed).
226+
- These parameters can be mixed together and ranges are allowed to overlap (for example, '-5, 1, 3, 5-10' - pages 1 to 10 are processed).
227227
228-
- The service accepts the request if it can process at least one page of the document. For example, using '5-100' on a five page document is a valid input where page 5 will be processed.
228+
- The service accepts the request if it can process at least one page of the document. For example, using '5-100' on a five page document is a valid input where page 5 is processed.
229229
230230
- If no page range is provided, the entire document is processed.
231231
@@ -286,7 +286,7 @@ sections:
286286
Learn more about Form Recognizer [service quotas and limits](service-limits.md)
287287
288288
- question: |
289-
How long will it take to analyze a document?
289+
How long does it take to analyze a document?
290290
answer: |
291291
Form Recognizer is a multi-tenanted service where latency for similar documents is comparable but not always identical. The time to analyze a document depends on the size (for example, number of pages) and associated content on each page.
292292
@@ -409,7 +409,7 @@ sections:
409409
- question: |
410410
If my storage account is behind a VNet or firewall, how do I give Form Recognizer access to my storage account data?
411411
answer: |
412-
If you have an Azure storage account protected by a Virtual Network (VNet) or firewall, Form Recognizer can’t directly access your storage account. However, Private Azure storage account access and authentication are supported by [managed identities for Azure resources](../../active-directory/managed-identities-azure-resources/overview.md). Once a managed identity is enabled, the Form Recognizer service can access your storage account using an assigned managed identity credential.
412+
If you have an Azure storage account protected by a Virtual Network (VNet) or firewall, Form Recognizer can’t directly access your storage account. However, Private Azure storage account access and authentication support [managed identities for Azure resources](../../active-directory/managed-identities-azure-resources/overview.md). Once a managed identity is enabled, the Form Recognizer service can access your storage account using an assigned managed identity credential.
413413
414414
If you intend to analyze your private storage account data with FOTT, the tool must be deployed behind the VNet or firewall.
415415

articles/applied-ai-services/form-recognizer/language-support.md

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,8 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: reference
10-
ms.date: 01/06/2023
11-
ms.author: lajanuar
10+
ms.date: 03/09/2023
1211
---
1312

1413
# Language support for Form Recognizer
@@ -26,7 +25,7 @@ This article covers the supported languages for text and field **extraction (by
2625

2726
## Read, layout, and custom form (template) model
2827

29-
The following lists include the currently GA languages in the most recent v3.0 version. These languages are supported by Read, Layout, and Custom form (template) model features.
28+
The following lists include the currently GA languages in the most recent v3.0 version for Read, Layout, and Custom template (form) models.
3029

3130
> [!NOTE]
3231
> **Language code optional**
607 KB
Loading
324 KB
Loading

articles/applied-ai-services/form-recognizer/toc.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -154,6 +154,9 @@ items:
154154
- name: Layout model
155155
displayName: tables, selection marks, structure, paragraph roles, text, headers, page numbers
156156
href: concept-layout.md
157+
- name: 🆕 Query field extraction (preview)
158+
displayName: queries, fields, OpenAI, chat
159+
href: concept-query-fields.md
157160
- name: 🆕 Health insurance card model (preview)
158161
displayName: health, proof, hospital
159162
href: concept-insurance-card.md

0 commit comments

Comments
 (0)