Skip to content

Commit 67ed6e9

Browse files
Merge pull request #281242 from jboback/language-detection-scripts-overview
Language support, overview edit
2 parents 72dde59 + 62a4c07 commit 67ed6e9

File tree

2 files changed

+128
-129
lines changed

2 files changed

+128
-129
lines changed

articles/ai-services/language-service/language-detection/language-support.md

Lines changed: 127 additions & 128 deletions
Original file line numberDiff line numberDiff line change
@@ -22,128 +22,127 @@ If you have content expressed in a less frequently used language, you can try La
2222

2323
## Languages supported by Language Detection
2424

25-
| Language | Language Code |
26-
|---------------------|---------------|
27-
| Afrikaans | `af` |
28-
| Albanian | `sq` |
29-
| Amharic | `am` |
30-
| Arabic | `ar` |
31-
| Armenian | `hy` |
32-
| Assamese | `as` |
33-
| Azerbaijani | `az` |
34-
| Bashkir | `ba` |
35-
| Basque | `eu` |
36-
| Belarusian | `be` |
37-
| Bengali | `bn` |
38-
| Bosnian | `bs` |
39-
| Bulgarian | `bg` |
40-
| Burmese | `my` |
41-
| Catalan | `ca` |
42-
| Central Khmer | `km` |
43-
| Chinese | `zh` |
44-
| Chinese Simplified | `zh_chs` |
45-
| Chinese Traditional | `zh_cht` |
46-
| Chuvash | `cv` |
47-
| Corsican | `co` |
48-
| Croatian | `hr` |
49-
| Czech | `cs` |
50-
| Danish | `da` |
51-
| Dari | `prs` |
52-
| Divehi | `dv` |
53-
| Dutch | `nl` |
54-
| English | `en` |
55-
| Esperanto | `eo` |
56-
| Estonian | `et` |
57-
| Faroese | `fo` |
58-
| Fijian | `fj` |
59-
| Finnish | `fi` |
60-
| French | `fr` |
61-
| Galician | `gl` |
62-
| Georgian | `ka` |
63-
| German | `de` |
64-
| Greek | `el` |
65-
| Gujarati | `gu` |
66-
| Haitian | `ht` |
67-
| Hausa | `ha` |
68-
| Hebrew | `he` |
69-
| Hindi | `hi` |
70-
| Hmong Daw | `mww` |
71-
| Hungarian | `hu` |
72-
| Icelandic | `is` |
73-
| Igbo | `ig` |
74-
| Indonesian | `id` |
75-
| Inuktitut | `iu` |
76-
| Irish | `ga` |
77-
| Italian | `it` |
78-
| Japanese | `ja` |
79-
| Javanese | `jv` |
80-
| Kannada | `kn` |
81-
| Kazakh | `kk` |
82-
| Kinyarwanda | `rw` |
83-
| Kirghiz | `ky` |
84-
| Korean | `ko` |
85-
| Kurdish | `ku` |
86-
| Lao | `lo` |
87-
| Latin | `la` |
88-
| Latvian | `lv` |
89-
| Lithuanian | `lt` |
90-
| Luxembourgish | `lb` |
91-
| Macedonian | `mk` |
92-
| Malagasy | `mg` |
93-
| Malay | `ms` |
94-
| Malayalam | `ml` |
95-
| Maltese | `mt` |
96-
| Maori | `mi` |
97-
| Marathi | `mr` |
98-
| Mongolian | `mn` |
99-
| Nepali | `ne` |
100-
| Norwegian | `no` |
101-
| Norwegian Nynorsk | `nn` |
102-
| Odia | `or` |
103-
| Pasht | `ps` |
104-
| Persian | `fa` |
105-
| Polish | `pl` |
106-
| Portuguese | `pt` |
107-
| Punjabi | `pa` |
108-
| Queretaro Otomi | `otq` |
109-
| Romanian | `ro` |
110-
| Russian | `ru` |
111-
| Samoan | `sm` |
112-
| Serbian | `sr` |
113-
| Shona | `sn` |
114-
| Sindhi | `sd` |
115-
| Sinhala | `si` |
116-
| Slovak | `sk` |
117-
| Slovenian | `sl` |
118-
| Somali | `so` |
119-
| Spanish | `es` |
120-
| Sundanese | `su` |
121-
| Swahili | `sw` |
122-
| Swedish | `sv` |
123-
| Tagalog | `tl` |
124-
| Tahitian | `ty` |
125-
| Tajik | `tg` |
126-
| Tamil | `ta` |
127-
| Tatar | `tt` |
128-
| Telugu | `te` |
129-
| Thai | `th` |
130-
| Tibetan | `bo` |
131-
| Tigrinya | `ti` |
132-
| Tongan | `to` |
133-
| Turkish | `tr` |
134-
| Turkmen | `tk` |
135-
| Upper Sorbian | `hsb` |
136-
| Uyghur | `ug` |
137-
| Ukrainian | `uk` |
138-
| Urdu | `ur` |
139-
| Uzbek | `uz` |
140-
| Vietnamese | `vi` |
141-
| Welsh | `cy` |
142-
| Xhosa | `xh` |
143-
| Yiddish | `yi` |
144-
| Yoruba | `yo` |
145-
| Yucatec Maya | `yua` |
146-
| Zulu | `zu` |
25+
| Language | Language Code | Supported Script Code |
26+
|---------------------|---------------|-----------------------|
27+
| Afrikaans | `af` | `Latn` |
28+
| Albanian | `sq` | `Latn` |
29+
| Amharic | `am` | `Ethi` |
30+
| Arabic | `ar` | `Arab` |
31+
| Armenian | `hy` | `Armn` |
32+
| Assamese | `as` | `Beng`, `Latn` |
33+
| Azerbaijani | `az` | `Latn` |
34+
| Bashkir | `ba` | `Cyrl` |
35+
| Basque | `eu` | `Latn` |
36+
| Belarusian | `be` | `Cyrl` |
37+
| Bengali | `bn` | `Beng`, `Latn` |
38+
| Bosnian | `bs` | `Latn` |
39+
| Bulgarian | `bg` | `Cyrl` |
40+
| Burmese | `my` | `Mymr` |
41+
| Catalan | `ca` | `Latn` |
42+
| Central Khmer | `km` | `Khmr` |
43+
| Chinese Simplified | `zh_chs` | `Hans` |
44+
| Chinese Traditional | `zh_cht` | `Hant` |
45+
| Chuvash | `cv` | `Cyrl` |
46+
| Corsican | `co` | `Latn` |
47+
| Croatian | `hr` | `Latn` |
48+
| Czech | `cs` | `Latn` |
49+
| Danish | `da` | `Latn` |
50+
| Dari | `prs` | `Arab` |
51+
| Divehi | `dv` | `Thaa` |
52+
| Dutch | `nl` | `Latn` |
53+
| English | `en` | `Latn` |
54+
| Esperanto | `eo` | `Latn` |
55+
| Estonian | `et` | `Latn` |
56+
| Faroese | `fo` | `Latn` |
57+
| Fijian | `fj` | `Latn` |
58+
| Finnish | `fi` | `Latn` |
59+
| French | `fr` | `Latn` |
60+
| Galician | `gl` | `Latn` |
61+
| Georgian | `ka` | `Gujr` |
62+
| German | `de` | `Latn` |
63+
| Greek | `el` | `Grek` |
64+
| Gujarati | `gu` | `Gujr`, `Latn` |
65+
| Haitian | `ht` | `Latn` |
66+
| Hausa | `ha` | `Latn` |
67+
| Hebrew | `he` | `Hebr` |
68+
| Hindi | `hi` | `Deva`, `Latn` |
69+
| Hmong Daw | `mww` | `Latn` |
70+
| Hungarian | `hu` | `Latn` |
71+
| Icelandic | `is` | `Latn` |
72+
| Igbo | `ig` | `Latn` |
73+
| Indonesian | `id` | `Latn` |
74+
| Inuktitut | `iu` | `Cans`, `Latn` |
75+
| Irish | `ga` | `Latn` |
76+
| Italian | `it` | `Latn` |
77+
| Japanese | `ja` | `Jpan` |
78+
| Javanese | `jv` | `Latn` |
79+
| Kannada | `kn` | `Knda`, `Latn` |
80+
| Kazakh | `kk` | `Cyrl` |
81+
| Kinyarwanda | `rw` | `Latn` |
82+
| Kirghiz | `ky` | `Cyrl` |
83+
| Korean | `ko` | `Hang` |
84+
| Kurdish | `ku` | `Arab` |
85+
| Lao | `lo` | `Laoo` |
86+
| Latin | `la` | `Latn` |
87+
| Latvian | `lv` | `Latn` |
88+
| Lithuanian | `lt` | `Latn` |
89+
| Luxembourgish | `lb` | `Latn` |
90+
| Macedonian | `mk` | `Cyrl` |
91+
| Malagasy | `mg` | `Latn` |
92+
| Malay | `ms` | `Latn` |
93+
| Malayalam | `ml` | `Mlym`, `Latn` |
94+
| Maltese | `mt` | `Latn` |
95+
| Maori | `mi` | `Latn` |
96+
| Marathi | `mr` | `Deva`, `Latn` |
97+
| Mongolian | `mn` | `Cyrl` |
98+
| Nepali | `ne` | `Deva` |
99+
| Norwegian | `no` | `Latn` |
100+
| Norwegian Nynorsk | `nn` | `Latn` |
101+
| Odia | `or` | `Orya`, `Latn` |
102+
| Pashto | `ps` | `Arab` |
103+
| Persian | `fa` | `Arab` |
104+
| Polish | `pl` | `Latn` |
105+
| Portuguese | `pt` | `Latn` |
106+
| Punjabi | `pa` | `Guru`, `Latn` |
107+
| Queretaro Otomi | `otq` | `Latn` |
108+
| Romanian | `ro` | `Latn` |
109+
| Russian | `ru` | `Cyrl` |
110+
| Samoan | `sm` | `Latn` |
111+
| Serbian | `sr` | `Latn`, `Cyrl` |
112+
| Shona | `sn` | `Latn` |
113+
| Sindhi | `sd` | `Arab` |
114+
| Sinhala | `si` | `Sinh` |
115+
| Slovak | `sk` | `Latn` |
116+
| Slovenian | `sl` | `Latn` |
117+
| Somali | `so` | `Latn` |
118+
| Spanish | `es` | `Latn` |
119+
| Sundanese | `su` | `Latn` |
120+
| Swahili | `sw` | `Latn` |
121+
| Swedish | `sv` | `Latn` |
122+
| Tagalog | `tl` | `Latn` |
123+
| Tahitian | `ty` | `Latn` |
124+
| Tajik | `tg` | `Cyrl` |
125+
| Tamil | `ta` | `Taml`, `Latn` |
126+
| Tatar | `tt` | `Cyrl` |
127+
| Telugu | `te` | `Telu`, `Latn` |
128+
| Thai | `th` | `Thai` |
129+
| Tibetan | `bo` | `Tibt` |
130+
| Tigrinya | `ti` | `Ethi` |
131+
| Tongan | `to` | `Latn` |
132+
| Turkish | `tr` | `Latn` |
133+
| Turkmen | `tk` | `Latn` |
134+
| Upper Sorbian | `hsb` | `Latn` |
135+
| Uyghur | `ug` | `Arab` |
136+
| Ukrainian | `uk` | `Latn` |
137+
| Urdu | `ur` | `Arab`, `Latn` |
138+
| Uzbek | `uz` | `Latn` |
139+
| Vietnamese | `vi` | `Latn` |
140+
| Welsh | `cy` | `Latn` |
141+
| Xhosa | `xh` | `Latn` |
142+
| Yiddish | `yi` | `Hebr` |
143+
| Yoruba | `yo` | `Latn` |
144+
| Yucatec Maya | `yua` | `Latn` |
145+
| Zulu | `zu` | `Latn` |
147146

148147
## Romanized Indic Languages supported by Language Detection
149148

@@ -166,21 +165,21 @@ If you have content expressed in a less frequently used language, you can try La
166165

167166
| Language | Script code | Scripts |
168167
| ------------------------------------- | ---------- | -------------- |
169-
| Bengali (Bengali-Assamese) | `as` | `Latn`, `Beng` |
170-
| Bengali (Bangla) | `bn` | `Latn`, `Beng` |
168+
| Assamese | `as` | `Latn`, `Beng` |
169+
| Bengali | `bn` | `Latn`, `Beng` |
171170
| Gujarati | `gu` | `Latn`, `Gujr` |
172171
| Hindi | `hi` | `Latn`, `Deva` |
173172
| Kannada | `kn` | `Latn`, `Knda` |
174173
| Malayalam | `ml` | `Latn`, `Mlym` |
175174
| Marathi | `mr` | `Latn`, `Deva` |
176175
| Oriya | `or` | `Latn`, `Orya` |
177-
| Gurmukhi | `pa` | `Latn`, `Guru` |
176+
| Punjabi | `pa` | `Latn`, `Guru` |
178177
| Tamil | `ta` | `Latn`, `Taml` |
179178
| Telugu | `te` | `Latn`, `Telu` |
180-
| Arabic | `ar` | `Latn`, `Arab` |
181-
| Cyrillic | `tt` | `Latn`, `Cyrl` |
179+
| Urdu | `ur` | `Latn`, `Arab` |
180+
| Tatar | `tt` | `Latn`, `Cyrl` |
182181
| Serbian | `sr` | `Latn`, `Cyrl` |
183-
| Unified Canadian Aboriginal Syllabics | `iu` | `Latn`, `Cans` |
182+
| Inuktitut | `iu` | `Latn`, `Cans` |
184183

185184
## Next steps
186185

articles/ai-services/language-service/language-detection/overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ ms.custom: language-service-language-detection
1414

1515
# What is language detection in Azure AI Language?
1616

17-
Language detection is one of the features offered by [Azure AI Language](../overview.md), a collection of machine learning and AI algorithms in the cloud for developing intelligent applications that involve written language. Language detection is able to detect more than 100 languages in their primary script. In addition, it offers [script detection](./how-to/call-api.md#script-name-and-script-code) to detect multiple scripts per language according to the [ISO 15924 standard](https://wikipedia.org/wiki/ISO_15924) for a [select number of languages](./language-support.md#script-detection).
17+
Language detection is one of the features offered by [Azure AI Language](../overview.md), a collection of machine learning and AI algorithms in the cloud for developing intelligent applications that involve written language. Language detection is able to detect more than 100 languages in their primary script. In addition, it offers [script detection](./how-to/call-api.md#script-name-and-script-code) to detect supported scripts for each detected language according to the [ISO 15924 standard](https://wikipedia.org/wiki/ISO_15924) for a [select number of languages](./language-support.md#script-detection) supported by Azure AI Language Service.
1818

1919
This documentation contains the following types of articles:
2020

0 commit comments

Comments
 (0)