You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/document-intelligence/language-support-custom.md
+39-36Lines changed: 39 additions & 36 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -37,17 +37,17 @@ ms.date: 11/15/2023
37
37
38
38
Azure AI Document Intelligence models provide multilingual document processing support. Our language support capabilities enable your users to communicate with your applications in natural ways and empower global outreach. Custom models are trained using your labeled datasets to extract distinct data from structured, semi-structured, and unstructured documents specific to your use cases. Standalone custom models can be combined to create composed models. The following tables list the available language and locale support by model and feature:
39
39
40
-
## [Custom classifier](#tab/custom-classifier)
41
-
42
-
***custom classifier model***
40
+
## Custom classifier
43
41
44
42
:::moniker range="doc-intel-3.1.0"
43
+
45
44
| Language—Locale code | Default |
46
45
|:----------------------|:---------|
47
46
| English (United States)—en-US| English (United States)—en-US|
48
47
:::moniker-end
49
48
50
49
:::moniker range="doc-intel-4.0.0"
50
+
51
51
|Language| Code (optional) |
52
52
|:-----|:----:|
53
53
|Afrikaans|`af`|
@@ -97,25 +97,14 @@ Azure AI Document Intelligence models provide multilingual document processing s
97
97
|Ukrainian|`uk`|
98
98
|Urdu|`ur`|
99
99
|Vietnamese|`vi`|
100
-
:::moniker-end
101
100
102
-
## [Custom neural](#tab/custom-neural)
103
-
104
-
***custom neural model***
105
-
106
-
#### Handwritten text
101
+
:::moniker-end
107
102
108
-
The following table lists the supported languages for extracting handwritten texts.
103
+
## Custom neural
109
104
110
-
|Language| Language code (optional) | Language| Language code (optional) |
111
-
|:-----|:----:|:-----|:----:|
112
-
|English|`en`|Japanese |`ja`|
113
-
|Chinese Simplified |`zh-Hans`|Korean |`ko`|
114
-
|French |`fr`|Portuguese |`pt`|
115
-
|German |`de`|Spanish |`es`|
116
-
|Italian |`it`|
105
+
:::moniker range=">=doc-intel-3.1.0"
117
106
118
-
#### Printed text
107
+
##[**Printed text**](#tab/printed)
119
108
120
109
The following table lists the supported languages for printed text.
121
110
@@ -125,8 +114,8 @@ The following table lists the supported languages for printed text.
125
114
|Albanian|`sq`|
126
115
|Arabic|`ar`|
127
116
|Bulgarian|`bg`|
128
-
|Chinese (Han (Simplified variant))|`zh-Hans`|
129
-
|Chinese (Han (Traditional variant))|`zh-Hant`|
117
+
|Chinese Simplified|`zh-Hans`|
118
+
|Chinese Traditional|`zh-Hant`|
130
119
|Croatian|`hr`|
131
120
|Czech|`cs`|
132
121
|Danish|`da`|
@@ -169,7 +158,19 @@ The following table lists the supported languages for printed text.
169
158
|Urdu|`ur`|
170
159
|Vietnamese|`vi`|
171
160
172
-
:::moniker range=">=doc-intel-3.1.0"
161
+
## [**Handwritten text**](#tab/handwritten)
162
+
163
+
The following table lists the supported languages for extracting **handwritten** texts.
164
+
165
+
|Language| Language code (optional) | Language| Language code (optional) |
166
+
|:-----|:----:|:-----|:----:|
167
+
|English|`en`|Japanese |`ja`|
168
+
|Chinese Simplified |`zh-Hans`|Korean |`ko`|
169
+
|French |`fr`|Portuguese |`pt`|
170
+
|German |`de`|Spanish |`es`|
171
+
|Italian |`it`|
172
+
173
+
---
173
174
174
175
Neural models support added languages for the `v3.1` and later APIs.
175
176
@@ -184,25 +185,14 @@ Neural models support added languages for the `v3.1` and later APIs.
184
185
185
186
:::moniker-end
186
187
187
-
## [Custom template](#tab/custom-template)
188
-
189
-
***custom template model***
188
+
## Custom template
190
189
191
-
#### Handwritten text
190
+
:::moniker range=">=doc-intel-3.0.0"
192
191
193
-
The following table lists the supported languages for extracting handwritten texts.
194
-
195
-
|Language| Language code (optional) | Language| Language code (optional) |
196
-
|:-----|:----:|:-----|:----:|
197
-
|English|`en`|Japanese |`ja`|
198
-
|Chinese Simplified |`zh-Hans`|Korean |`ko`|
199
-
|French |`fr`|Portuguese |`pt`|
200
-
|German |`de`|Spanish |`es`|
201
-
|Italian |`it`|
192
+
## [**Printed**](#tab/printed)
202
193
203
-
#### Printed text
194
+
The following table lists the supported languages for **printed**text.</br>
204
195
205
-
The following table lists the supported languages for printed text.
206
196
:::row:::
207
197
:::column span="":::
208
198
|Language| Code (optional) |
@@ -522,4 +512,17 @@ The following table lists the supported languages for printed text.
522
512
:::column-end:::
523
513
:::row-end:::
524
514
515
+
## [**Handwritten**](#tab/handwritten)
516
+
517
+
The following table lists the supported languages for extracting handwritten texts.
518
+
519
+
|Language| Language code (optional) | Language| Language code (optional) |
Copy file name to clipboardExpand all lines: articles/ai-services/document-intelligence/language-support-ocr.md
+13-12Lines changed: 13 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -53,19 +53,20 @@ Azure AI Document Intelligence models provide multilingual document processing s
53
53
54
54
::: moniker-end
55
55
56
-
## Read model
57
-
58
-
##### Model ID: **prebuilt-read**
59
-
60
56
> [!NOTE]
61
57
> **Language code optional**
62
58
>
63
59
> * Document Intelligence's deep learning based universal models extract all multi-lingual text in your documents, including text lines with mixed languages, and don't require specifying a language code.
64
-
> * Don't provide the language code as the parameter unless you are sure about the language and want to force the service to apply only the relevant model. Otherwise, the service may return incomplete and incorrect text.
60
+
>
61
+
> * Don't provide the language code as the parameter unless you are sure of the language and want to force the service to apply only the relevant model. Otherwise, the service may return incomplete and incorrect text.
65
62
>
66
63
> * Also, It's not necessary to specify a locale. This is an optional parameter. The Document Intelligence deep-learning technology will auto-detect the text language in your image.
67
64
68
-
### [Read: handwritten text](#tab/read-hand)
65
+
## Read model
66
+
67
+
##### Model ID: **prebuilt-read**
68
+
69
+
### [**Read: handwritten text**](#tab/read-hand)
69
70
70
71
:::moniker range="doc-intel-4.0.0"
71
72
@@ -107,15 +108,15 @@ The following table lists read model language support for extracting and analyzi
107
108
108
109
:::moniker-end
109
110
110
-
### [Read: printed text](#tab/read-print)
111
+
### [**Read: printed text**](#tab/read-print)
111
112
112
113
:::moniker range=">=doc-intel-3.1.0"
113
114
114
115
The following table lists read model language support for extracting and analyzing **printed** text. </br>
115
116
116
117
:::row:::
117
118
:::column span="":::
118
-
|Language| Code (optional) |
119
+
|Language| Code (optional) |
119
120
|:-----|:----:|
120
121
|Abaza|abq|
121
122
|Abkhazian|ab|
@@ -194,7 +195,7 @@ The following table lists read model language support for extracting and analyzi
194
195
|Finnish|fi|
195
196
:::column-end:::
196
197
:::column span="":::
197
-
|Language| Code (optional) |
198
+
|Language| Code (optional) |
198
199
|:-----|:----:|
199
200
|Fon|fon|
200
201
|French|fr|
@@ -622,7 +623,7 @@ The following table lists read model language support for extracting and analyzi
622
623
623
624
:::moniker-end
624
625
625
-
### [Read: language detection](#tab/read-detection)
626
+
### [**Read: language detection**](#tab/read-detection)
626
627
627
628
The [Read model API](concept-read.md) supports **language detection** for the following languages in your documents. This list can include languages not currently supported for text extraction.
628
629
@@ -768,7 +769,7 @@ The [Read model API](concept-read.md) supports **language detection** for the fo
0 commit comments