Skip to content

Commit db32664

Browse files
authored
Update language-support.md
1 parent 28c75dd commit db32664

File tree

1 file changed

+106
-144
lines changed

1 file changed

+106
-144
lines changed

articles/cognitive-services/Computer-vision/language-support.md

Lines changed: 106 additions & 144 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: computer-vision
1010
ms.topic: conceptual
11-
ms.date: 10/27/2021
11+
ms.date: 02/04/2022
1212
ms.author: pafarley
1313
---
1414

@@ -20,159 +20,121 @@ Some capabilities of Computer Vision support multiple languages; any capabilitie
2020

2121
The Computer Vision OCR APIs support many languages. Read can extract text from images and documents with mixed languages, including from the same text line, without requiring a language parameter. See the [Optical Character Recognition (OCR) overview](overview-ocr.md) for more information.
2222

23-
2423
> [!NOTE]
2524
> **Language code optional**
2625
>
2726
> Read OCR's deep-learning-based universal models extract all multi-lingual text in your documents, including text lines with mixed languages, and do not require specifying a language code. Do not provide the language code as the parameter unless you are sure about the language and want to force the service to apply only the relevant model. Otherwise, the service may return incomplete and incorrect text.
2827
29-
The preview model includes any enhancements to the current GA version of the API. See [How to specify the model version](./Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) to use the preview languages and features.
28+
See [How to specify the model version](./Vision-API-How-to-Topics/call-read-api.md#determine-how-to-process-the-data-optional) to use the new languages.
3029

3130
### Handwritten languages
3231

3332
The following table lists the languages supported by Read for handwritten text.
3433

35-
|Language| Language code (optional) | Read |
36-
|:-----|:----:|:-----|
37-
|English|`en`||
38-
|Chinese Simplified |`zh-Hans`|✅ preview |
39-
|French|`fr`|✅ preview|
40-
|German |`de`|✅ preview |
41-
|Italian|`it`|✅ preview |
42-
|Portuguese |`pt`|✅ preview |
43-
|Spanish |`es`|✅ preview |
44-
45-
### Print languages
46-
47-
The following table lists the languages supported by the OCR APIs for printed text.
48-
49-
|Language| Language code (optional) | Read | OCR |
50-
|:-----|:----:|:-----|:---:|
51-
|Afrikaans|`af`|| |
52-
|Albanian |`sq`|| |
53-
|Arabic | `ar`| ||
54-
|Asturian |`ast`|| |
55-
|Azerbaijani (Latin) | `az` | ✅ preview | |
56-
|Basque |`eu`|| |
57-
|Belarusian (Cyrillic) | `be` |✅ preview | |
58-
|Belarusian (Latin) | `be` |✅ preview | |
59-
|Bislama |`bi`|| |
60-
|Bosnian (Latin) |`bs`|✅ preview | |
61-
|Breton |`br`|| |
62-
|Bulgarian |`bg`|✅ preview | |
63-
|Buryat (Cyrillic)|`bua`|✅ preview | |
64-
|Catalan |`ca`|| |
65-
|Cebuano |`ceb`|| |
66-
|Chamorro |`ch`|| |
67-
|Chinese Simplified | `zh-Hans`|||
68-
|Chinese Traditional | `zh-Hant`|||
69-
|Cornish |`kw`|| |
70-
|Corsican |`co`|| |
71-
|Crimean Tatar (Latin)|`crh`|| |
72-
|Croatian |`hr`|✅ preview | |
73-
|Czech | `cs` |||
74-
|Danish | `da` |||
75-
|Dutch | `nl` |||
76-
|English | `en` |||
77-
|Erzya (Cyrillic) |`myv`|✅ preview | |
78-
|Estonian |`et`|| |
79-
|Faroese |`fo`|✅ preview | |
80-
|Fijian |`fj`|| |
81-
|Filipino |`fil`|| |
82-
|Finnish | `fi` |||
83-
|French | `fr` |||
84-
|Friulian | `fur` || |
85-
|Gagauz (Latin) |`gag`|✅ preview | |
86-
|Galician | `gl` || |
87-
|German | `de` |||
88-
|Gilbertese | `gil` || |
89-
|Greek | `el` | ||
90-
|Greenlandic | `kl` || |
91-
|Haitian Creole | `ht` || |
92-
|Hani | `hni` || |
93-
|Hawaiian |`haw`|✅ preview | |
94-
|Hmong Daw (Latin)| `mww` || |
95-
|Hungarian | `hu` |||
96-
|Icelandic |`is`|✅ preview | |
97-
|Inari Sami |`smn`|✅ preview | |
98-
|Indonesian | `id` || |
99-
|Interlingua | `ia` || |
100-
|Inuktitut (Latin) | `iu` || |
101-
|Irish | `ga` || |
102-
|Italian | `it` |||
103-
|Japanese | `ja` |||
104-
|Javanese | `jv` || |
105-
|K'iche' | `quc` || |
106-
|Kabuverdianu | `kea` || |
107-
|Kachin (Latin) | `kac` || |
108-
|Kara-Kalpak (Latin) | `kaa` || |
109-
|Kara-Kalpak (Cyrillic) | `kaa-cyrl` | ✅ preview | |
110-
|Karachay-Balkar |`krc`|✅ preview | |
111-
|Kashubian | `csb` || |
112-
|Kazakh (Cyrillic) |`kk-cyrl`|✅ preview | |
113-
|Kazakh (Latin) |`kk-latn`|✅ preview | |
114-
|Khasi | `kha` || |
115-
|Korean | `ko` |||
116-
|Koryak |`kpy`|✅ preview | |
117-
|Kosraean |`kos`|✅ preview | |
118-
|Kumyk (Cyrillic) |`kum`|✅ preview | |
119-
|Kurdish (Latin)| `ku` || |
120-
|Kyrgyz (Cyrillic) |`ky`|✅ preview | |
121-
|Lakota |`lkt`|✅ preview | |
122-
|Latin|`la`|✅ preview | |
123-
|Lithuanian|`lt`|✅ preview | |
124-
|Lower Sorbian|`dsb`|✅ preview | |
125-
|Lule Sami|`smj`|✅ preview | |
126-
|Luxembourgish | `lb` || |
127-
|Malay (Latin) | `ms` || |
128-
|Maltese|`mt`|✅ preview | |
129-
|Manx | `gv` || |
130-
|Maori|`mi`|✅ preview | |
131-
|Mongolian (Cyrillic)|`mn`|✅ preview | |
132-
|Montenegrin (Cyrillic)|`cnr-cyrl`|✅ preview | |
133-
|Montenegrin (Latin)|`cnr-latn`|✅ preview | |
134-
|Neapolitan | `nap` || |
135-
|Niuean|`niu`|✅ preview | |
136-
|Nogay|`nog`|✅ preview | |
137-
|Northern Sami (Latin)|`sme`|✅ preview | |
138-
|Norwegian | `no` || |
139-
|Occitan | `oc` || |
140-
|Ossetic|`os`|✅ preview | |
141-
|Polish | `pl` |||
142-
|Portuguese | `pt` |||
143-
|Ripuarian|`ksh`|✅ preview | |
144-
|Romanian | `ro` | ✅ preview ||
145-
|Romansh | `rm` || |
146-
|Russian | `ru` |✅ preview ||
147-
|Samoan (Latin)|`sm`|✅ preview | |
148-
|Scots | `sco` || |
149-
|Scottish Gaelic | `gd` || |
150-
|Serbian (Cyrillic) | `sr-cyrl` | ||
151-
|Serbian (Latin) | `sr-latn` | ✅ preview ||
152-
|Skolt Sami|`sms`|✅ preview | |
153-
|Slovak | `sk` | ✅ preview ||
154-
|Slovenian | `sl` |||
155-
|Southern Sami|`sma`|✅ preview | |
156-
|Spanish | `es` |||
157-
|Swahili (Latin) | `sw` || |
158-
|Swedish | `sv` |||
159-
|Tajik (Cyrillic)|`tg`|✅ preview | |
160-
|Tatar (Latin) | `tt` ||
161-
|Tetum | `tet` || |
162-
|Tongan|`to`|✅ preview | |
163-
|Turkish | `tr` |||
164-
|Turkmen (Latin)|`tk`|✅ preview | |
165-
|Tuvan|`tyv`|✅ preview | |
166-
|Upper Sorbian | `hsb` || |
167-
|Uzbek (Cyrillic) | `uz-cyrl` || |
168-
|Uzbek (Latin) | `uz` || |
169-
|Volapük | `vo` || |
170-
|Walser | `wae` || |
171-
|Welsh | `cy` |✅ preview | |
172-
|Western Frisian | `fy` || |
173-
|Yucatec Maya | `yua` || |
174-
|Zhuang | `za` || |
175-
|Zulu | `zu` || |
34+
|Language| Language code (optional) | Language| Language code (optional) |
35+
|:-----|:----:|:-----|:----:|
36+
|English|`en`|Japanese (preview) |`ja`|
37+
|Chinese Simplified (preview) |`zh-Hans`|Korean (preview)|`ko`|
38+
|French (preview) |`fr`|Portuguese (preview)|`pt`|
39+
|German (preview) |`de`|Spanish (preview) |`es`|
40+
|Italian (preview) |`it`|
41+
42+
### Print languages (GA)
43+
44+
This section lists the supported languages in the latest GA version.
45+
46+
|Language| Code (optional) |Language| Code (optional) |
47+
|:-----|:----:|:-----|:----:|
48+
|Afrikaans|`af`|Japanese | `ja` |
49+
|Albanian |`sq`|Javanese | `jv` |
50+
|Asturian |`ast`|K'iche' | `quc` |
51+
|Basque |`eu`|Kabuverdianu | `kea` |
52+
|Bislama |`bi`|Kachin (Latin) | `kac` |
53+
|Breton |`br`|Kara-Kalpak (Latin) | `kaa` |
54+
|Catalan |`ca`|Kashubian | `csb` |
55+
|Cebuano |`ceb`|Khasi | `kha` |
56+
|Chamorro |`ch`|Korean | `ko` |
57+
|Chinese Simplified | `zh-Hans`|Kurdish (Latin) | `ku-latn`
58+
|Chinese Traditional | `zh-Hant`|Luxembourgish | `lb` |
59+
|Cornish |`kw`|Malay (Latin) | `ms` |
60+
|Corsican |`co`|Manx | `gv` |
61+
|Crimean Tatar (Latin)|`crh`|Neapolitan | `nap` |
62+
|Czech | `cs` |Norwegian | `no` |
63+
|Danish | `da` |Occitan | `oc` |
64+
|Dutch | `nl` |Polish | `pl` |
65+
|English | `en` |Portuguese | `pt` |
66+
|Estonian |`et`|Romansh | `rm` |
67+
|Fijian |`fj`|Scots | `sco` |
68+
|Filipino |`fil`|Scottish Gaelic | `gd` |
69+
|Finnish | `fi` |Slovenian | `sl` |
70+
|French | `fr` |Spanish | `es` |
71+
|Friulian | `fur` |Swahili (Latin) | `sw` |
72+
|Galician | `gl` |Swedish | `sv` |
73+
|German | `de` |Tatar (Latin) | `tt` |
74+
|Gilbertese | `gil` |Tetum | `tet` |
75+
|Greenlandic | `kl` |Turkish | `tr` |
76+
|Haitian Creole | `ht` |Upper Sorbian | `hsb` |
77+
|Hani | `hni` |Uzbek (Latin) | `uz` |
78+
|Hmong Daw (Latin)| `mww` |Volapük | `vo` |
79+
|Hungarian | `hu` |Walser | `wae` |
80+
|Indonesian | `id` |Western Frisian | `fy` |
81+
|Interlingua | `ia` |Yucatec Maya | `yua` |
82+
|Inuktitut (Latin) | `iu` |Zhuang | `za` |
83+
|Irish | `ga` |Zulu | `zu` |
84+
|Italian | `it` |
85+
86+
### Print languages (preview)
87+
88+
This section lists the supported languages in the latest preview.
89+
90+
|Language| Code (optional) |Language| Code (optional) |
91+
|:-----|:----:|:-----|:----:|
92+
|Angika (Devanagiri) | `anp`|Lakota | `lkt`
93+
|Arabic | `ar`|Latin | `la`
94+
|Awadhi-Hindi (Devanagiri) | `awa`|Lithuanian | `lt`
95+
|Azerbaijani (Latin) | `az`|Lower Sorbian | `dsb`
96+
|Bagheli | `bfy`|Lule Sami | `smj`
97+
|Belarusian (Cyrillic) | `be`, `be-cyrl`|Mahasu Pahari (Devanagiri) | `bfz`
98+
|Belarusian (Latin) | `be`, `be-latn`|Maltese | `mt`
99+
|Bhojpuri-Hindi (Devanagiri) | `bho`|Malto (Devanagiri) | `kmj`
100+
|Bodo (Devanagiri) | `brx`|Maori | `mi`
101+
|Bosnian (Latin) | `bs`|Marathi | `mr`
102+
|Brajbha | `bra`|Mongolian (Cyrillic) | `mn`
103+
|Bulgarian | `bg`|Montenegrin (Cyrillic) | `cnr-cyrl`
104+
|Bundeli | `bns`|Montenegrin (Latin) | `cnr-latn`
105+
|Buryat (Cyrillic) | `bua`|Nepali | `ne`
106+
|Chamling | `rab`|Niuean | `niu`
107+
|Chhattisgarhi (Devanagiri)| `hne`|Nogay | `nog`
108+
|Croatian | `hr`|Northern Sami (Latin) | `sme`
109+
|Dari | `prs`|Ossetic | `os`
110+
|Dhimal (Devanagiri) | `dhi`|Pashto | `ps`
111+
|Dogri (Devanagiri) | `doi`|Persian | `fa`
112+
|Erzya (Cyrillic) | `myv`|Punjabi (Arabic) | `pa`
113+
|Faroese | `fo`|Ripuarian | `ksh`
114+
|Gagauz (Latin) | `gag`|Romanian | `ro`
115+
|Gondi (Devanagiri) | `gon`|Russian | `ru`
116+
|Gurung (Devanagiri) | `gvr`|Sadri (Devanagiri) | `sck`
117+
|Halbi (Devanagiri) | `hlb`|Samoan (Latin) | `sm`
118+
|Haryanvi | `bgc`|Sanskrit (Devanagari) | `sa`
119+
|Hawaiian | `haw`|Santali(Devanagiri) | `sat`
120+
|Hindi | `hi`|Serbian (Latin) | `sr`, `sr-latn`
121+
|Ho(Devanagiri) | `hoc`|Sherpa (Devanagiri) | `xsr`
122+
|Icelandic | `is`|Sirmauri (Devanagiri) | `srx`
123+
|Inari Sami | `smn`|Skolt Sami | `sms`
124+
|Jaunsari (Devanagiri) | `Jns`|Slovak | `sk`
125+
|Kangri (Devanagiri) | `xnr`|Somali (Arabic) | `so`
126+
|Karachay-Balkar | `krc`|Southern Sami | `sma`
127+
|Kara-Kalpak (Cyrillic) | `kaa-cyrl`|Tajik (Cyrillic) | `tg`
128+
|Kazakh (Cyrillic) | `kk-cyrl`|Thangmi | `thf`
129+
|Kazakh (Latin) | `kk-latn`|Tongan | `to`
130+
|Khaling | `klr`|Turkmen (Latin) | `tk`
131+
|Korku | `kfq`|Tuvan | `tyv`
132+
|Koryak | `kpy`|Urdu | `ur`
133+
|Kosraean | `kos`|Uyghur (Arabic) | `ug`
134+
|Kumyk (Cyrillic) | `kum`|Uzbek (Arabic) | `uz-arab`
135+
|Kurdish (Arabic) | `ku-arab`|Uzbek (Cyrillic) | `uz-cyrl`
136+
|Kurukh (Devanagiri) | `kru`|Welsh | `cy`
137+
|Kyrgyz (Cyrillic) | `ky`
176138

177139
## Image analysis
178140

0 commit comments

Comments
 (0)