Skip to content

Commit dc6baf1

Browse files
Merge pull request #227210 from laujan/63472-update-language-support-receipt-invoice
update language support, receipt, invoice
2 parents ce780db + d21b54a commit dc6baf1

File tree

4 files changed

+40
-36
lines changed

4 files changed

+40
-36
lines changed

articles/applied-ai-services/form-recognizer/concept-invoice.md

Lines changed: 15 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 01/25/2023
10+
ms.date: 02/13/2023
1111
ms.author: lajanuar
1212
recommendations: false
1313
---
@@ -92,7 +92,7 @@ The following tools are supported by Form Recognizer v2.1:
9292

9393
## Try invoice data extraction
9494

95-
See how data, including customer information, vendor details, and line items, is extracted from invoices. You'll need the following resources:
95+
See how data, including customer information, vendor details, and line items, is extracted from invoices. You need the following resources:
9696

9797
* An Azure subscription—you can [create one for free](https://azure.microsoft.com/free/cognitive-services/)
9898

@@ -146,7 +146,7 @@ See how data, including customer information, vendor details, and line items, is
146146

147147
:::image type="content" source="media/fott-select-form-type.png" alt-text="Screenshot showing the select-form-type dropdown menu.":::
148148

149-
1. Select **Run analysis**. The Form Recognizer Sample Labeling tool will call the Analyze Prebuilt API and analyze the document.
149+
1. Select **Run analysis**. The Form Recognizer Sample Labeling tool calls the Analyze Prebuilt API and analyze the document.
150150

151151
1. View the results - see the key-value pairs extracted, line items, highlighted text extracted and tables detected.
152152

@@ -166,13 +166,13 @@ See how data, including customer information, vendor details, and line items, is
166166
167167
| Supported languages | Details |
168168
|:----------------------|:---------|
169-
| • English | United States (us), Australia (-au), Canada (-ca), Great Britain (-gb), India (-in)|
170-
| • Spanish |Spain (es)|
171-
| • German | Germany (de)|
172-
| • French | France (fr) |
173-
| • Italian | Italy (it)|
174-
| • Portuguese | Portugal (-pt), Brazil (-br)|
175-
| • Dutch | Netherlands (de)|
169+
| • English (en) | United States (us), Australia (-au), Canada (-ca), Great Britain (-gb), India (-in)|
170+
| • Spanish (es) |Spain (es)|
171+
| • German (de) | Germany (de)|
172+
| • French (fr) | France (fr) |
173+
| • Italian (it) | Italy (it)|
174+
| • Portuguese (pt) | Portugal (pt), Brazil (br)|
175+
| • Dutch (de) | Netherlands (de)|
176176

177177
## Field extraction
178178

@@ -196,7 +196,7 @@ See how data, including customer information, vendor details, and line items, is
196196
| ShippingAddress | String | Explicit shipping address for the customer | |
197197
| ShippingAddressRecipient | String | Name associated with the ShippingAddress | |
198198
| PaymentTerm | String | The terms of payment for the invoice | |
199-
| SubTotal | Number | Subtotal field identified on this invoice | Integer |
199+
|Sub​Total| Number | Subtotal field identified on this invoice | Integer |
200200
| TotalTax | Number | Total tax field identified on this invoice | Integer |
201201
| InvoiceTotal | Number (USD) | Total new charges associated with this invoice | Integer |
202202
| AmountDue | Number (USD) | Total Amount Due to the vendor | Integer |
@@ -207,7 +207,6 @@ See how data, including customer information, vendor details, and line items, is
207207
| ServiceStartDate | Date | First date for the service period (for example, a utility bill service period) | yyyy-mm-dd |
208208
| ServiceEndDate | Date | End date for the service period (for example, a utility bill service period) | yyyy-mm-dd|
209209
| PreviousUnpaidBalance | Number | Explicit previously unpaid balance | Integer |
210-
| CurrencyCode | String | The Currency Code associated with an extracted amount | |
211210
| PaymentOptions | Array | An array that holds Payment Option details such as `IBAN`and `SWIFT` | |
212211
| TotalDiscount | Number | The total discount applied to an invoice | Integer |
213212
| TaxItems (en-IN only) | Array | AN array that holds added tax information such as `CGST`, `IGST`, and `SGST`. This line item is currently only available for the en-in locale | |
@@ -235,7 +234,7 @@ The invoice key-value pairs and line items extracted are in the `documentResults
235234

236235
The prebuilt invoice **2022-06-30** and later releases support returns key-value pairs at no extra cost. Key-value pairs are specific spans within the invoice that identify a label or key and its associated response or value. In an invoice, these pairs could be the label and the value the user entered for that field or telephone number. The AI model is trained to extract identifiable keys and values based on a wide variety of document types, formats, and structures.
237236

238-
Keys can also exist in isolation when the model detects that a key exists, with no associated value or when processing optional fields. For example, a middle name field may be left blank on a form in some instances. key-value pairs are always spans of text contained in the document. For documents where the same value is described in different ways, for example, customer/user, the associated key will be either customer or user (based on context).
237+
Keys can also exist in isolation when the model detects that a key exists, with no associated value or when processing optional fields. For example, a middle name field may be left blank on a form in some instances. key-value pairs are always spans of text contained in the document. For documents where the same value is described in different ways, for example, customer/user, the associated key is either customer or user (based on context).
239238

240239
::: moniker-end
241240

@@ -247,7 +246,7 @@ Keys can also exist in isolation when the model detects that a key exists, with
247246

248247
## Fields extracted
249248

250-
The Invoice service will extract the text, tables, and 26 invoice fields. Following are the fields extracted from an invoice in the JSON output response (the following output uses this [sample invoice](media/sample-invoice.jpg)).
249+
The Invoice service extracts the text, tables, and 26 invoice fields. Following are the fields extracted from an invoice in the JSON output response (the following output uses this [sample invoice](media/sample-invoice.jpg)).
251250

252251
|Name| Type | Description | Text | Value (standardized output) |
253252
|:-----|:----|:----|:----| :----|
@@ -266,7 +265,7 @@ The Invoice service will extract the text, tables, and 26 invoice fields. Follow
266265
| BillingAddressRecipient | string | Name associated with the BillingAddress | Microsoft Services | |
267266
| ShippingAddress | string | Explicit shipping address for the customer | 123 Ship Street, Redmond WA, 98052 | |
268267
| ShippingAddressRecipient | string | Name associated with the ShippingAddress | Microsoft Delivery | |
269-
| SubTotal | number | Subtotal field identified on this invoice | $100.00 | 100 |
268+
| Sub​Total | number | Subtotal field identified on this invoice | $100.00 | 100 |
270269
| TotalTax | number | Total tax field identified on this invoice | $10.00 | 10 |
271270
| InvoiceTotal | number | Total new charges associated with this invoice | $110.00 | 110 |
272271
| AmountDue | number | Total Amount Due to the vendor | $610.00 | 610 |
@@ -298,7 +297,7 @@ The JSON output has three parts:
298297

299298
* `"readResults"` node contains all of the recognized text and selection marks. Text is organized by page, then by line, then by individual words.
300299
* `"pageResults"` node contains the tables and cells extracted with their bounding boxes, confidence, and a reference to the lines and words in "readResults".
301-
* `"documentResults"` node contains the invoice-specific values and line items that the model discovered. It's where you'll find all the fields from the invoice such as invoice ID, ship to, bill to, customer, total, line items and lots more.
300+
* `"documentResults"` node contains the invoice-specific values and line items that the model discovered. It's where to find all the fields from the invoice such as invoice ID, ship to, bill to, customer, total, line items and lots more.
302301

303302
## Migration guide
304303

articles/applied-ai-services/form-recognizer/concept-receipt.md

Lines changed: 10 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 01/25/2023
10+
ms.date: 02/13/2023
1111
ms.author: lajanuar
1212
recommendations: false
1313
---
@@ -84,7 +84,7 @@ The following tools are supported by Form Recognizer v2.1:
8484

8585
### Try receipt data extraction
8686

87-
See how data, including time and date of transactions, merchant information, and amount totals, is extracted from receipts. You'll need the following resources:
87+
See how data, including time and date of transactions, merchant information, and amount totals, is extracted from receipts. You need the following resources:
8888

8989
* An Azure subscription—you can [create one for free](https://azure.microsoft.com/free/cognitive-services/)
9090

@@ -141,7 +141,7 @@ See how data, including time and date of transactions, merchant information, and
141141

142142
:::image type="content" source="media/fott-select-form-type.png" alt-text="Screenshot of the select-form-type dropdown menu.":::
143143

144-
1. Select **Run analysis**. The Form Recognizer Sample Labeling tool will call the Analyze Prebuilt API and analyze the document.
144+
1. Select **Run analysis**. The Form Recognizer Sample Labeling tool calls the Analyze Prebuilt API and analyze the document.
145145

146146
1. View the results - see the key-value pairs extracted, line items, highlighted text extracted and tables detected.
147147

@@ -164,11 +164,13 @@ The receipt model supports all English receipts and the following locales:
164164
|Supported Languages| Details |
165165
|:-----|:----:|
166166
|• English| United States (-us), Australia (-au), Great Britain (-gb), India (-in), United Arab Emirates (-ae)|
167-
|• Dutch| Netherlands (nl)|
168-
|• French | France (fr) |
169-
|• Japanese | Japan (ja)|
170-
|• Portuguese| Portugal (-pt), Brazil (-br)|
171-
|• Spanish | Spain (es) |
167+
|• Dutch| Netherlands (nl-nl)|
168+
|• French | France (fr-fr), Canada (fr-ca) |
169+
|• German | Germany (de-de) |
170+
|• Italian | Italy (it-it) |
171+
|• Japanese | Japan (ja-ja)|
172+
|• Portuguese| Portugal (pt-pt), Brazil (pt-br)|
173+
|• Spanish | Spain (es-es) |
172174
::: moniker-end
173175

174176
::: moniker range="form-recog-2.1.0"

articles/applied-ai-services/form-recognizer/language-support.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: reference
10-
ms.date: 01/25/2023
10+
ms.date: 02/13/2023
1111
ms.author: lajanuar
1212
---
1313

@@ -223,17 +223,18 @@ Language| Locale code |
223223
>[!NOTE]
224224
> It's not necessary to specify a locale. This is an optional parameter. The Form Recognizer deep-learning technology will auto-detect the language of the text in your image.
225225
226-
Receipt supports all English receipts and the following locales:
227-
228226
|Language| Locale code |
229227
|:-----|:----:|
230228
|English (Australia)|`en-au`|
231229
|English (Canada)|`en-ca`|
232230
|English (United Kingdom)|`en-gb`|
233-
|English (India|`en-in`|
231+
|English (India)|`en-in`|
234232
|English (United States)| `en-us`|
235-
|French | `fr` |
236-
| Spanish | `es` |
233+
|French (France) | `fr` |
234+
|French (Canada)| `fr-ca`|
235+
|German | `de`|
236+
|Italian| `it`|
237+
|Spanish | `es` |
237238

238239
## Business card model
239240

articles/applied-ai-services/form-recognizer/whats-new.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
ms.service: applied-ai-services
88
ms.subservice: forms-recognizer
99
ms.topic: conceptual
10-
ms.date: 02/08/2023
10+
ms.date: 02/13/2023
1111
ms.author: lajanuar
1212
monikerRange: '>=form-recog-2.1.0'
1313
recommendations: false
@@ -46,11 +46,13 @@ Form Recognizer service is updated on an ongoing basis. Bookmark this page to st
4646

4747
The **prebuilt receipt model** now has added support for the following languages:
4848

49-
* English - United Arab Emirates (en-AE)
50-
* Dutch - Netherlands (nl-NL)
51-
* French - Canada (fr-CA)
52-
* Japanese - Japan (ja-JP)
53-
* Portuguese - Brazil (pt-BR)
49+
* English - United Arab Emirates (en-ae)
50+
* Dutch - Netherlands (nl-nl)
51+
* French - Canada (fr-ca)
52+
* German - (de-de)
53+
* Italian - (it-it)
54+
* Japanese - Japan (ja-jp)
55+
* Portuguese - Brazil (pt-br)
5456

5557
* **[Prebuilt invoice model](concept-invoice.md)—additional language support and field extractions**
5658

0 commit comments

Comments
 (0)