@@ -17,14 +17,15 @@ Azure AI Content Understanding provides multilingual support in multiple geograp
17
17
18
18
## Region support
19
19
20
- To use the Azure AI Content Understanding service, you must create your Azure AI Service resource in a supported region. The Content Understanding features are available in the following regions:
20
+ To use Azure AI Content Understanding, create your Azure AI Service resource in a supported region. All data at rest is stored in the selected region. For lower latency or increased capacity, you can specify the [ processing location ] ( ./concepts/analyzers-overview.md#data-processing-location ) where analysis occurs. Content Understanding is available in the following regions. When the processing location is set to ` geography ` or ` data zone ` , the corresponding locations are shown below.
21
21
22
- | Identifier | Region | Geography | Data Zone |
23
- | --- | --- | --- | --- |
24
- | ` westus ` | West US | United States | United States |
25
- | ` swedencentral ` | Sweden Central | Sweden | European Union |
26
- | ` australiaeast ` | Australia East | Australia | N/A |
22
+ | Identifier | Region | Geography | Data Zone |
23
+ | ----------------- | ---------------- | ----------------- | ------------------ |
24
+ | ` westus ` | West US | United States | United States |
25
+ | ` swedencentral ` | Sweden Central | Sweden | European Union |
26
+ | ` australiaeast ` | Australia East | Australia | N/A † |
27
27
28
+ † Australia East does not support data zone as a processing location.
28
29
29
30
## Language support
30
31
@@ -36,97 +37,89 @@ Content Understanding applies [Azure OpenAI models](../openai/overview.md) which
36
37
37
38
The following table lists the supported languages/locales for ** printed** text.
38
39
39
- | ** Language** | ** Language code** | ** Language** | ** Language code** |
40
- | :-----| :-----| :-----| :-----|
41
- | Afrikaans| ` af ` | Kurdish (Arabic)| ` ku-arab ` |
42
- | Albanian| ` sq ` | Kurdish (Latin)| ` ku ` , ` ku-latn ` |
43
- | Angika| ` anp ` | Kurukh| ` kru ` |
44
- | Arabic| ` ar ` | Kölsch| ` ksh ` |
45
- | Asturian| ` ast ` | Lakota| ` lkt ` |
46
- | Awadhi| ` awa ` | Latin| ` la ` |
47
- | Azerbaijani| ` az ` | Lithuanian| ` lt ` |
48
- | Bagheli| ` bfy ` | Lower Sorbian| ` dsb ` |
49
- | Basque| ` eu ` | Lule Sami| ` smj ` |
50
- | Belarusian (Cyrillic)| ` be ` , ` be-cyrl ` | Luxembourgish| ` lb ` |
51
- | Belarusian (Latin)| ` be-latn ` | Mahasu Pahari| ` bfz ` |
52
- | Bhojpuri| ` bho ` | Malay| ` ms ` |
53
- | Bislama| ` bi ` | Malto| ` kmj ` |
54
- | Bodo| ` brx ` | Manx| ` gv ` |
55
- | Bosnian| ` bs ` | Maori| ` mi ` |
56
- | Braj| ` bra ` | Marathi| ` mr ` |
57
- | Breton| ` br ` | Mongolian| ` mn ` |
58
- | Bulgarian| ` bg ` | Montenegrin (Cyrillic)| ` cnr-cyrl ` |
59
- | Bundeli| ` bns ` | Montenegrin (Latin)| ` cnr ` , ` cnr-latn ` |
60
- | Buriat| ` bua ` | Neapolitan| ` nap ` |
61
- | Camling| ` rab ` | Nepali| ` ne ` |
62
- | Catalan| ` ca ` | Niuean| ` niu ` |
63
- | Cebuano| ` ceb ` | Nogai| ` nog ` |
64
- | Chamorro| ` ch ` | Northern Sami| ` sme ` |
65
- | Chhattisgarhi| ` hne ` | Norwegian| ` no ` |
66
- | Chinese (Simplified)| ` zh ` , ` zh-hans ` | Occitan| ` oc ` |
67
- | Chinese (Traditional)| ` zh-hant ` | Ossetian| ` os ` |
68
- | Cornish| ` kw ` | Panjabi| ` pa ` |
69
- | Corsican| ` co ` | Persian| ` fa ` |
70
- | Crimean Tatar| ` crh ` | Polish| ` pl ` |
71
- | Croatian| ` hr ` | Portuguese| ` pt ` |
72
- | Czech| ` cs ` | Pushto| ` ps ` |
73
- | Danish| ` da ` | Romanian| ` ro ` |
74
- | Dari| ` prs ` | Romansh| ` rm ` |
75
- | Dhimal| ` dhi ` | Russian| ` ru ` |
76
- | Dogri| ` doi ` | Sadri| ` sck ` |
77
- | Dutch| ` nl ` | Samoan| ` sm ` |
78
- | English| ` en-US ` , ` en-AU ` , ` en-CA ` ,` en-GB ` , ` en-IN ` | Sanskrit| ` sa ` |
79
- | Erzya| ` myv ` | Santali| ` sat ` |
80
- | Estonian| ` et ` | Scots| ` sco ` |
81
- | Faroese| ` fo ` | Scottish Gaelic| ` gd ` |
82
- | Fijian| ` fj ` | Serbian (Latin)| ` sr ` , ` sr-latn ` |
83
- | Filipino| ` fil ` | Sirmauri| ` srx ` |
84
- | Finnish| ` fi ` | Skolt Sami| ` sms ` |
85
- | French| ` fr ` | Slovak| ` sk ` |
86
- | Friulian| ` fur ` | Slovenian| ` sl ` |
87
- | Gagauz| ` gag ` | Somali| ` so ` |
88
- | Galician| ` gl ` | Southern Sami| ` sma ` |
89
- | German| ` de ` | Spanish| ` es ` |
90
- | Gilbertese| ` gil ` | Swahili| ` sw ` |
91
- | Gondi| ` gon ` | Swedish| ` sv ` |
92
- | Gurung| ` gvr ` | Tajik| ` tg ` |
93
- | Haitian| ` ht ` | Tatar| ` tt ` |
94
- | Halbi| ` hlb ` | Tetum| ` tet ` |
95
- | Hani| ` hni ` | Thangmi| ` thf ` |
96
- | Haryanvi| ` bgc ` | Thai| ` th ` |
97
- | Hawaiian| ` haw ` | Tonga| ` to ` |
98
- | Hindi| ` hi ` | Turkish| ` tr ` |
99
- | Hmong Daw| ` mww ` | Tuvinian| ` tyv ` |
100
- | Ho| ` hoc ` | Uighur| ` ug ` |
101
- | Hungarian| ` hu ` | Upper Sorbian| ` hsb ` |
102
- | Icelandic| ` is ` | Urdu| ` ur ` |
103
- | Inari Sami| ` smn ` | Uzbek (Arabic)| ` uz-arab ` |
104
- | Indonesian| ` id ` | Uzbek (Cyrillic)| ` uz-cyrl ` |
105
- | Interlingua| ` ia ` | Uzbek (Latin)| ` uz ` , ` uz-latn ` |
106
- | Inuktitut| ` iu ` | Volapük| ` vo ` |
107
- | Irish| ` ga ` | Walser| ` wae ` |
108
- | Italian| ` it ` | Welsh| ` cy ` |
109
- | Japanese| ` ja ` | Western Frisian| ` fy ` |
110
- | Jaunsari| ` jns ` | Yucateco| ` yua ` |
111
- | Javanese| ` jv ` | Zhuang| ` za ` |
112
- | K'iche'| ` quc ` | Zulu| ` zu ` |
113
- | Kabuverdianu| ` kea ` ||
114
- | Kachin| ` kac ` ||
115
- | Kalaallisut| ` kl ` ||
116
- | Kangri| ` xnr ` ||
117
- | Kara-Kalpak (Cyrillic)| ` kaa-cyrl ` ||
118
- | Kara-Kalpak (Latin)| ` kaa ` , ` kaa-latn ` ||
119
- | Karachay-Balkar| ` krc ` ||
120
- | Kashubian| ` csb ` ||
121
- | Kazakh (Cyrillic)| ` kk-cyrl ` ||
122
- | Kazakh (Latin)| ` kk ` , ` kk-latn ` ||
123
- | Khaling| ` klr ` ||
124
- | Khasi| ` kha ` ||
125
- | Kirghiz| ` ky ` ||
126
- | Korean| ` ko ` ||
127
- | Korku| ` kfq ` ||
128
- | Koryak| ` kpy ` ||
129
- | Kosraean| ` kos ` ||
40
+ | ** Language** | ** Language code** | ** Language** | ** Language code** |
41
+ | :-----| :-----| :-----| :-----|
42
+ | Afrikaans| ` af ` | Kazakh (Latin)| ` kk, kk-latn ` |
43
+ | Albanian| ` sq ` | Khaling| ` klr ` |
44
+ | Angika| ` anp ` | Khasi| ` kha ` |
45
+ | Arabic| ` ar ` | Kirghiz| ` ky ` |
46
+ | Asturian| ` ast ` | Korean| ` ko ` |
47
+ | Awadhi| ` awa ` | Korku| ` kfq ` |
48
+ | Azerbaijani| ` az ` | Koryak| ` kpy ` |
49
+ | Bagheli| ` bfy ` | Kosraean| ` kos ` |
50
+ | Basque| ` eu ` | Kurdish (Arabic)| ` ku-arab ` |
51
+ | Belarusian (Cyrillic)| ` be, be-cyrl ` | Kurdish (Latin)| ` ku, ku-latn ` |
52
+ | Belarusian (Latin)| ` be-latn ` | Kurukh| ` kru ` |
53
+ | Bhojpuri| ` bho ` | Kölsch| ` ksh ` |
54
+ | Bislama| ` bi ` | Lakota| ` lkt ` |
55
+ | Bodo| ` brx ` | Latin| ` la ` |
56
+ | Bosnian| ` bs ` | Lithuanian| ` lt ` |
57
+ | Braj| ` bra ` | Lower Sorbian| ` dsb ` |
58
+ | Breton| ` br ` | volapük| ` smj ` |
59
+ | Bulgarian| ` bg ` | Luxembourgish| ` lb ` |
60
+ | Bundeli| ` bns ` | Mahasu Pahari| ` bfz ` |
61
+ | Buriat| ` bua ` | Malay| ` ms ` |
62
+ | Camling| ` rab ` | Malto| ` kmj ` |
63
+ | Catalan| ` ca ` | Manx| ` gv ` |
64
+ | Cebuano| ` ceb ` | Maori| ` mi ` |
65
+ | Chamorro| ` ch ` | Marathi| ` mr ` |
66
+ | Chhattisgarhi| ` hne ` | Mongolian| ` mn ` |
67
+ | Chinese (Simplified)| ` zh, zh-hans ` | Montenegrin (Cyrillic)| ` cnr-cyrl ` |
68
+ | Chinese (Traditional)| ` zh-hant ` | Montenegrin (Latin)| ` cnr, cnr-latn ` |
69
+ | Cornish| ` kw ` | Neapolitan| ` nap ` |
70
+ | Corsican| ` co ` | Nepali| ` ne ` |
71
+ | Crimean Tatar| ` crh ` | Niuean| ` niu ` |
72
+ | Croatian| ` hr ` | Nogai| ` nog ` |
73
+ | Czech| ` cs ` | Northern Sami| ` sme ` |
74
+ | Danish| ` da ` | Norwegian| ` no ` |
75
+ | Dari| ` prs ` | Occitan| ` oc ` |
76
+ | Dhimal| ` dhi ` | Ossetian| ` os ` |
77
+ | Dogri| ` doi ` | Panjabi| ` pa ` |
78
+ | Dutch| ` nl ` | Persian| ` fa ` |
79
+ | English| ` en-US, en-AU, en-CA,en-GB, en-IN ` | Polish| ` pl ` |
80
+ | Erzya| ` myv ` | Portuguese| ` pt ` |
81
+ | Estonian| ` et ` | Pushto| ` ps ` |
82
+ | Faroese| ` fo ` | Romanian| ` ro ` |
83
+ | Fijian| ` fj ` | Romansh| ` rm ` |
84
+ | Filipino| ` fil ` | Russian| ` ru ` |
85
+ | Finnish| ` fi ` | Sadri| ` sck ` |
86
+ | French| ` fr ` | Samoan| ` sm ` |
87
+ | Friulian| ` fur ` | Sanskrit| ` sa ` |
88
+ | Gagauz| ` gag ` | Santali| ` sat ` |
89
+ | Galician| ` gl ` | Scots| ` sco ` |
90
+ | German| ` de ` | Scottish Gaelic| ` gd ` |
91
+ | Gilbertese| ` gil ` | Serbian (Latin)| ` sr, sr-latn ` |
92
+ | Gondi| ` gon ` | Sirmauri| ` srx ` |
93
+ | Gurung| ` gvr ` | Skolt Sami| ` sms ` |
94
+ | Haitian| ` ht ` | Slovak| ` sk ` |
95
+ | Halbi| ` hlb ` | Slovenian| ` sl ` |
96
+ | Hani| ` hni ` | Somali| ` so ` |
97
+ | Haryanvi| ` bgc ` | Southern Sami| ` sma ` |
98
+ | Hawaiian| ` haw ` | Spanish| ` es ` |
99
+ | Hindi| ` hi ` | Swahili| ` sw ` |
100
+ | Hmong Daw| ` mww ` | Swedish| ` sv ` |
101
+ | Ho| ` hoc ` | Tajik| ` tg ` |
102
+ | Hungarian| ` hu ` | Tatar| ` tt ` |
103
+ | Icelandic| ` is ` | Tetum| ` tet ` |
104
+ | Inari Sami| ` smn ` | Thangmi| ` thf ` |
105
+ | Indonesian| ` id ` | Thai| ` th ` |
106
+ | Interlingua| ` ia ` | Tonga| ` to ` |
107
+ | Inuktitut| ` iu ` | Turkish| ` tr ` |
108
+ | Irish| ` ga ` | Tuvinian| ` tyv ` |
109
+ | Italian| ` it ` | Uighur| ` ug ` |
110
+ | Japanese| ` ja ` | Upper Sorbian| ` hsb ` |
111
+ | Jaunsari| ` jns ` | Urdu| ` ur ` |
112
+ | Javanese| ` jv ` | Uzbek (Arabic)| ` uz-arab ` |
113
+ | K'iche'| ` quc ` | Uzbek (Cyrillic)| ` uz-cyrl ` |
114
+ | Kabuverdianu| ` kea ` | Uzbek (Latin)| ` uz, uz-latn ` |
115
+ | Kachin| ` kac ` | Volapük| ` vo ` |
116
+ | Kalaallisut| ` kl ` | Walser| ` wae ` |
117
+ | Kangri| ` xnr ` | Welsh| ` cy ` |
118
+ | Kara-Kalpak (Cyrillic)| ` kaa-cyrl ` | Western Frisian| ` fy ` |
119
+ | Kara-Kalpak (Latin)| ` kaa, kaa-latn ` | Yucateco| ` yua ` |
120
+ | Karachay-Balkar| ` krc ` | Zhuang| ` za ` |
121
+ | Kashubian| ` csb ` | Zulu| ` zu ` |
122
+ | Kazakh (Cyrillic)| ` kk-cyrl ` |||
130
123
131
124
The following table lists the supported languages/locales for ** handwritten** text.
132
125
@@ -142,61 +135,129 @@ The following table lists the supported languages/locales for **handwritten** te
142
135
143
136
### Speech transcription
144
137
145
- Content Understanding supports the full set of [ Azure AI speech to text languages] ( ../speech-service/language-support.md ) . Content Understanding uses [ fast transcriptions] ( ../speech-service/speech-to-text.md#fast-transcription ) for supported languages to reduce processing latency.
138
+ Content Understanding applies [ Azure AI speech to text] ( ../speech-service/speech-to-text ) to transcribe spoken words in the input. For a subset of supported languages, it uses [ fast transcription] ( ../speech-service/speech-to-text.md#fast-transcription ) to reduce processing latency.
139
+
140
+ The following table lists the supported languages/locales for fast transcription.
141
+
142
+ | ** Language** | ** Language code** | ** Language** | ** Language code** |
143
+ | :-----| :----:| :-----| :----:|
144
+ | Chinese (Mandarin, Simplified) | ` zh-CN ` | Indonesian (Indonesia) | ` id-ID ` |
145
+ | Danish (Denmark) | ` da-DK ` | Italian (Italy) | ` it-IT ` |
146
+ | English (India) | ` en-IN ` | Japanese (Japan) | ` ja-JP ` |
147
+ | English (United Kingdom) | ` en-GB ` | Korean (Korea) | ` ko-KR ` |
148
+ | English (United States) | ` en-US ` | Polish (Poland) | ` pl-PL ` |
149
+ | Finnish (Finland) | ` fi-FI ` | Portuguese (Brazil) | ` pt-BR ` |
150
+ | French (France) | ` fr-FR ` | Portuguese (Portugal) | ` pt-PT ` |
151
+ | German (Germany) | ` de-DE ` | Spanish (Mexico) | ` es-MX ` |
152
+ | Hebrew (Israel) | ` he-IL ` | Spanish (Spain) | ` es-ES ` |
153
+ | Hindi (India) | ` hi-IN ` | Swedish (Sweden) | ` sv-SE ` |
154
+
155
+ The following table lists all supported languages/locales.
146
156
147
- > [ !NOTE]
148
- > Only spoken words are transcribed. Music, sound effects, and ambient noise are ignored.
157
+ | ** Language** | ** Language code** | ** Language** | ** Language code** |
158
+ | :-----| :----:| :-----| :----:|
159
+ | Afrikaans (South Africa) | ` af-ZA ` | Hungarian (Hungary) | ` hu-HU ` |
160
+ | Albanian (Albania) | ` sq-AL ` | Icelandic (Iceland) | ` is-IS ` |
161
+ | Amharic (Ethiopia) | ` am-ET ` | Indonesian (Indonesia) | ` id-ID ` |
162
+ | Arabic (Algeria) | ` ar-DZ ` | Irish (Ireland) | ` ga-IE ` |
163
+ | Arabic (Bahrain) | ` ar-BH ` | isiZulu (South Africa) | ` zu-ZA ` |
164
+ | Arabic (Egypt) | ` ar-EG ` | Italian (Italy) | ` it-IT ` |
165
+ | Arabic (Iraq) | ` ar-IQ ` | Italian (Switzerland) | ` it-CH ` |
166
+ | Arabic (Israel) | ` ar-IL ` | Japanese (Japan) | ` ja-JP ` |
167
+ | Arabic (Jordan) | ` ar-JO ` | Javanese (Latin, Indonesia) | ` jv-ID ` |
168
+ | Arabic (Kuwait) | ` ar-KW ` | Kannada (India) | ` kn-IN ` |
169
+ | Arabic (Lebanon) | ` ar-LB ` | Kazakh (Kazakhstan) | ` kk-KZ ` |
170
+ | Arabic (Libya) | ` ar-LY ` | Khmer (Cambodia) | ` km-KH ` |
171
+ | Arabic (Morocco) | ` ar-MA ` | Kiswahili (Kenya) | ` sw-KE ` |
172
+ | Arabic (Oman) | ` ar-OM ` | Kiswahili (Tanzania) | ` sw-TZ ` |
173
+ | Arabic (Palestinian Authority) | ` ar-PS ` | Korean (Korea) | ` ko-KR ` |
174
+ | Arabic (Qatar) | ` ar-QA ` | Lao (Laos) | ` lo-LA ` |
175
+ | Arabic (Saudi Arabia) | ` ar-SA ` | Latvian (Latvia) | ` lv-LV ` |
176
+ | Arabic (Syria) | ` ar-SY ` | Lithuanian (Lithuania) | ` lt-LT ` |
177
+ | Arabic (Tunisia) | ` ar-TN ` | Macedonian (North Macedonia) | ` mk-MK ` |
178
+ | Arabic (United Arab Emirates) | ` ar-AE ` | Malay (Malaysia) | ` ms-MY ` |
179
+ | Arabic (Yemen) | ` ar-YE ` | Malayalam (India) | ` ml-IN ` |
180
+ | Armenian (Armenia) | ` hy-AM ` | Maltese (Malta) | ` mt-MT ` |
181
+ | Assamese (India) | ` as-IN ` | Marathi (India) | ` mr-IN ` |
182
+ | Azerbaijani (Latin, Azerbaijan) | ` az-AZ ` | Mongolian (Mongolia) | ` mn-MN ` |
183
+ | Basque | ` eu-ES ` | Nepali (Nepal) | ` ne-NP ` |
184
+ | Bengali (India) | ` bn-IN ` | Norwegian Bokmål (Norway) | ` nb-NO ` |
185
+ | Bosnian (Bosnia and Herzegovina) | ` bs-BA ` | Odia (India) | ` or-IN ` |
186
+ | Bulgarian (Bulgaria) | ` bg-BG ` | Pashto (Afghanistan) | ` ps-AF ` |
187
+ | Burmese (Myanmar) | ` my-MM ` | Persian (Iran) | ` fa-IR ` |
188
+ | Catalan | ` ca-ES ` | Polish (Poland) | ` pl-PL ` |
189
+ | Chinese (Cantonese, Simplified) | ` yue-CN ` | Portuguese (Brazil) | ` pt-BR ` |
190
+ | Chinese (Cantonese, Traditional) | ` zh-HK ` | Portuguese (Portugal) | ` pt-PT ` |
191
+ | Chinese (Jilu Mandarin, Simplified) | ` zh-CN-shandong ` | Punjabi (India) | ` pa-IN ` |
192
+ | Chinese (Mandarin, Simplified) | ` zh-CN ` | Romanian (Romania) | ` ro-RO ` |
193
+ | Chinese (Southwestern Mandarin, Simplified) | ` zh-CN-sichuan ` | Russian (Russia) | ` ru-RU ` |
194
+ | Chinese (Taiwanese Mandarin, Traditional) | ` zh-TW ` | Serbian (Cyrillic, Serbia) | ` sr-RS ` |
195
+ | Chinese (Wu, Simplified) | ` wuu-CN ` | Sinhala (Sri Lanka) | ` si-LK ` |
196
+ | Croatian (Croatia) | ` hr-HR ` | Slovak (Slovakia) | ` sk-SK ` |
197
+ | Czech (Czechia) | ` cs-CZ ` | Slovenian (Slovenia) | ` sl-SI ` |
198
+ | Danish (Denmark) | ` da-DK ` | Somali (Somalia) | ` so-SO ` |
199
+ | Dutch (Belgium) | ` nl-BE ` | Spanish (Argentina) | ` es-AR ` |
200
+ | Dutch (Netherlands) | ` nl-NL ` | Spanish (Bolivia) | ` es-BO ` |
201
+ | English (Australia) | ` en-AU ` | Spanish (Chile) | ` es-CL ` |
202
+ | English (Canada) | ` en-CA ` | Spanish (Colombia) | ` es-CO ` |
203
+ | English (Ghana) | ` en-GH ` | Spanish (Costa Rica) | ` es-CR ` |
204
+ | English (Hong Kong SAR) | ` en-HK ` | Spanish (Cuba) | ` es-CU ` |
205
+ | English (India) | ` en-IN ` | Spanish (Dominican Republic) | ` es-DO ` |
206
+ | English (Ireland) | ` en-IE ` | Spanish (Ecuador) | ` es-EC ` |
207
+ | English (Kenya) | ` en-KE ` | Spanish (El Salvador) | ` es-SV ` |
208
+ | English (New Zealand) | ` en-NZ ` | Spanish (Equatorial Guinea) | ` es-GQ ` |
209
+ | English (Nigeria) | ` en-NG ` | Spanish (Guatemala) | ` es-GT ` |
210
+ | English (Philippines) | ` en-PH ` | Spanish (Honduras) | ` es-HN ` |
211
+ | English (Singapore) | ` en-SG ` | Spanish (Mexico) | ` es-MX ` |
212
+ | English (South Africa) | ` en-ZA ` | Spanish (Nicaragua) | ` es-NI ` |
213
+ | English (Tanzania) | ` en-TZ ` | Spanish (Panama) | ` es-PA ` |
214
+ | English (United Kingdom) | ` en-GB ` | Spanish (Paraguay) | ` es-PY ` |
215
+ | English (United States) | ` en-US ` | Spanish (Peru) | ` es-PE ` |
216
+ | Estonian (Estonia) | ` et-EE ` | Spanish (Puerto Rico) | ` es-PR ` |
217
+ | Filipino (Philippines) | ` fil-PH ` | Spanish (Spain) | ` es-ES ` |
218
+ | Finnish (Finland) | ` fi-FI ` | Spanish (United States)<sup >1</sup > | ` es-US ` |
219
+ | French (Belgium) | ` fr-BE ` | Spanish (Uruguay) | ` es-UY ` |
220
+ | French (Canada)<sup >1</sup > | ` fr-CA ` | Spanish (Venezuela) | ` es-VE ` |
221
+ | French (France) | ` fr-FR ` | Swedish (Sweden) | ` sv-SE ` |
222
+ | French (Switzerland) | ` fr-CH ` | Tamil (India) | ` ta-IN ` |
223
+ | Galician | ` gl-ES ` | Telugu (India) | ` te-IN ` |
224
+ | Georgian (Georgia) | ` ka-GE ` | Thai (Thailand) | ` th-TH ` |
225
+ | German (Austria) | ` de-AT ` | Turkish (Türkiye) | ` tr-TR ` |
226
+ | German (Germany) | ` de-DE ` | Ukrainian (Ukraine) | ` uk-UA ` |
227
+ | German (Switzerland) | ` de-CH ` | Urdu (India) | ` ur-IN ` |
228
+ | Greek (Greece) | ` el-GR ` | Uzbek (Latin, Uzbekistan) | ` uz-UZ ` |
229
+ | Gujarati (India) | ` gu-IN ` | Vietnamese (Vietnam) | ` vi-VN ` |
230
+ | Hebrew (Israel) | ` he-IL ` | Welsh (United Kingdom) | ` cy-GB ` |
231
+ | Hindi (India) | ` hi-IN ` |||
149
232
150
233
151
234
### Field value normalization
152
235
153
236
Different locales have different ways to represent numbers, date, and time. Content Understanding supports normalizing these different representations into standardized ISO forms for the following locales.
154
237
155
- | ** Language** | ** Language code** |
156
- | :-----------| :-----------------|
157
- | Arabic| ` ar-AE ` , ` ar-EG ` , ` ar-SA ` |
158
- | Bengla| ` bn-IN ` |
159
- | Bulgarian| ` bg-BG ` |
160
- | Catalan| ` ca-ES ` |
161
- | Chinese (Simplified) | ` zh-CN ` |
162
- | Chinese (Traditional)| ` zh-TW ` |
163
- | Croatian| ` hr-HR ` |
164
- | Czech| ` cs-CZ ` |
165
- | Danish| ` da-DK ` |
166
- | Dutch| ` nl-NL ` |
167
- | English| ` en-AU ` , ` en-CA ` , ` en-GB ` , ` en-IL ` , ` en-IN ` , ` en-MY ` , ` en-US ` |
168
- | Estonian| ` et-EE ` |
169
- | Finnish| ` fi-FI ` |
170
- | French| ` fr-CA ` , ` fr-FR ` |
171
- | Galician| ` gl-ES ` |
172
- | German| ` de-DE ` |
173
- | Greek| ` el-GR ` |
174
- | Hebrew| ` he-IL ` |
175
- | Hindi| ` hi-IN ` |
176
- | Hungarian| ` hu-HU ` |
177
- | Icelandic| ` is-IS ` |
178
- | Indonesian| ` id-ID ` |
179
- | Italian| ` it-IT ` |
180
- | Japanese| ` ja-JP ` |
181
- | Korean| ` ko-KR ` |
182
- | Latvian| ` lv-LV ` |
183
- | Lithuanian| ` lt-LT ` |
184
- | Malay| ` ms-MY ` |
185
- | Marathi| ` mr-IN ` |
186
- | Nepali| ` ne-IN ` |
187
- | Norwegian| ` no-NO ` |
188
- | Polish| ` pl-PL ` |
189
- | Portuguese| ` pt-BR ` , ` pt-PT ` |
190
- | Romanian| ` ro-RO ` |
191
- | Russian| ` ru-RU ` |
192
- | Serbian| ` sr-RS ` |
193
- | Slovak| ` sk-SK ` |
194
- | Slovenian| ` sl-SI ` |
195
- | Spanish| ` es-AR ` , ` es-ES ` , ` es-MX ` |
196
- | Swedish| ` sv-SE ` |
197
- | Tamil| ` ta-IN ` |
198
- | Thai| ` th-TH ` |
199
- | Turkish| ` tr-TR ` |
200
- | Ukrainian| ` uk-UA ` |
201
- | Vietnamese| ` vi-VN ` |
238
+ | ** Language** | ** Language code** | ** Language** | ** Language code** |
239
+ | :-----| :----:| :-----| :----:|
240
+ | Arabic| ` ar-AE ` , ` ar-EG ` , ` ar-SA ` | Japanese| ` ja-JP ` |
241
+ | Bengla| ` bn-IN ` | Korean| ` ko-KR ` |
242
+ | Bulgarian| ` bg-BG ` | Latvian| ` lv-LV ` |
243
+ | Catalan| ` ca-ES ` | Lithuanian| ` lt-LT ` |
244
+ | Chinese (Simplified) | ` zh-CN ` | Malay| ` ms-MY ` |
245
+ | Chinese (Traditional)| ` zh-TW ` | Marathi| ` mr-IN ` |
246
+ | Croatian| ` hr-HR ` | Nepali| ` ne-IN ` |
247
+ | Czech| ` cs-CZ ` | Norwegian| ` no-NO ` |
248
+ | Danish| ` da-DK ` | Polish| ` pl-PL ` |
249
+ | Dutch| ` nl-NL ` | Portuguese| ` pt-BR ` , ` pt-PT ` |
250
+ | English| ` en-AU ` , ` en-CA ` , ` en-GB ` , ` en-IL ` , ` en-IN ` , ` en-MY ` , ` en-US ` | Romanian| ` ro-RO ` |
251
+ | Estonian| ` et-EE ` | Russian| ` ru-RU ` |
252
+ | Finnish| ` fi-FI ` | Serbian| ` sr-RS ` |
253
+ | French| ` fr-CA ` , ` fr-FR ` | Slovak| ` sk-SK ` |
254
+ | Galician| ` gl-ES ` | Slovenian| ` sl-SI ` |
255
+ | German| ` de-DE ` | Spanish| ` es-AR ` , ` es-ES ` , ` es-MX ` |
256
+ | Greek| ` el-GR ` | Swedish| ` sv-SE ` |
257
+ | Hebrew| ` he-IL ` | Tamil| ` ta-IN ` |
258
+ | Hindi| ` hi-IN ` | Thai| ` th-TH ` |
259
+ | Hungarian| ` hu-HU ` | Turkish| ` tr-TR ` |
260
+ | Icelandic| ` is-IS ` | Ukrainian| ` uk-UA ` |
261
+ | Indonesian| ` id-ID ` | Vietnamese| ` vi-VN ` |
262
+ | Italian| ` it-IT ` |||
202
263
0 commit comments