Skip to content

Commit c2097a0

Browse files
authored
Merge pull request #283748 from MicrosoftDocs/main
[Out of Band Publish] 08/06 - 13:00 PST
2 parents f0f16da + 48ded78 commit c2097a0

File tree

65 files changed

+1161
-1092
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

65 files changed

+1161
-1092
lines changed

articles/ai-services/language-service/language-detection/language-support.md

Lines changed: 127 additions & 128 deletions
Original file line numberDiff line numberDiff line change
@@ -22,128 +22,127 @@ If you have content expressed in a less frequently used language, you can try La
2222

2323
## Languages supported by Language Detection
2424

25-
| Language | Language Code |
26-
|---------------------|---------------|
27-
| Afrikaans | `af` |
28-
| Albanian | `sq` |
29-
| Amharic | `am` |
30-
| Arabic | `ar` |
31-
| Armenian | `hy` |
32-
| Assamese | `as` |
33-
| Azerbaijani | `az` |
34-
| Bashkir | `ba` |
35-
| Basque | `eu` |
36-
| Belarusian | `be` |
37-
| Bengali | `bn` |
38-
| Bosnian | `bs` |
39-
| Bulgarian | `bg` |
40-
| Burmese | `my` |
41-
| Catalan | `ca` |
42-
| Central Khmer | `km` |
43-
| Chinese | `zh` |
44-
| Chinese Simplified | `zh_chs` |
45-
| Chinese Traditional | `zh_cht` |
46-
| Chuvash | `cv` |
47-
| Corsican | `co` |
48-
| Croatian | `hr` |
49-
| Czech | `cs` |
50-
| Danish | `da` |
51-
| Dari | `prs` |
52-
| Divehi | `dv` |
53-
| Dutch | `nl` |
54-
| English | `en` |
55-
| Esperanto | `eo` |
56-
| Estonian | `et` |
57-
| Faroese | `fo` |
58-
| Fijian | `fj` |
59-
| Finnish | `fi` |
60-
| French | `fr` |
61-
| Galician | `gl` |
62-
| Georgian | `ka` |
63-
| German | `de` |
64-
| Greek | `el` |
65-
| Gujarati | `gu` |
66-
| Haitian | `ht` |
67-
| Hausa | `ha` |
68-
| Hebrew | `he` |
69-
| Hindi | `hi` |
70-
| Hmong Daw | `mww` |
71-
| Hungarian | `hu` |
72-
| Icelandic | `is` |
73-
| Igbo | `ig` |
74-
| Indonesian | `id` |
75-
| Inuktitut | `iu` |
76-
| Irish | `ga` |
77-
| Italian | `it` |
78-
| Japanese | `ja` |
79-
| Javanese | `jv` |
80-
| Kannada | `kn` |
81-
| Kazakh | `kk` |
82-
| Kinyarwanda | `rw` |
83-
| Kirghiz | `ky` |
84-
| Korean | `ko` |
85-
| Kurdish | `ku` |
86-
| Lao | `lo` |
87-
| Latin | `la` |
88-
| Latvian | `lv` |
89-
| Lithuanian | `lt` |
90-
| Luxembourgish | `lb` |
91-
| Macedonian | `mk` |
92-
| Malagasy | `mg` |
93-
| Malay | `ms` |
94-
| Malayalam | `ml` |
95-
| Maltese | `mt` |
96-
| Maori | `mi` |
97-
| Marathi | `mr` |
98-
| Mongolian | `mn` |
99-
| Nepali | `ne` |
100-
| Norwegian | `no` |
101-
| Norwegian Nynorsk | `nn` |
102-
| Odia | `or` |
103-
| Pasht | `ps` |
104-
| Persian | `fa` |
105-
| Polish | `pl` |
106-
| Portuguese | `pt` |
107-
| Punjabi | `pa` |
108-
| Queretaro Otomi | `otq` |
109-
| Romanian | `ro` |
110-
| Russian | `ru` |
111-
| Samoan | `sm` |
112-
| Serbian | `sr` |
113-
| Shona | `sn` |
114-
| Sindhi | `sd` |
115-
| Sinhala | `si` |
116-
| Slovak | `sk` |
117-
| Slovenian | `sl` |
118-
| Somali | `so` |
119-
| Spanish | `es` |
120-
| Sundanese | `su` |
121-
| Swahili | `sw` |
122-
| Swedish | `sv` |
123-
| Tagalog | `tl` |
124-
| Tahitian | `ty` |
125-
| Tajik | `tg` |
126-
| Tamil | `ta` |
127-
| Tatar | `tt` |
128-
| Telugu | `te` |
129-
| Thai | `th` |
130-
| Tibetan | `bo` |
131-
| Tigrinya | `ti` |
132-
| Tongan | `to` |
133-
| Turkish | `tr` |
134-
| Turkmen | `tk` |
135-
| Upper Sorbian | `hsb` |
136-
| Uyghur | `ug` |
137-
| Ukrainian | `uk` |
138-
| Urdu | `ur` |
139-
| Uzbek | `uz` |
140-
| Vietnamese | `vi` |
141-
| Welsh | `cy` |
142-
| Xhosa | `xh` |
143-
| Yiddish | `yi` |
144-
| Yoruba | `yo` |
145-
| Yucatec Maya | `yua` |
146-
| Zulu | `zu` |
25+
| Language | Language Code | Supported Script Code |
26+
|---------------------|---------------|-----------------------|
27+
| Afrikaans | `af` | `Latn` |
28+
| Albanian | `sq` | `Latn` |
29+
| Amharic | `am` | `Ethi` |
30+
| Arabic | `ar` | `Arab` |
31+
| Armenian | `hy` | `Armn` |
32+
| Assamese | `as` | `Beng`, `Latn` |
33+
| Azerbaijani | `az` | `Latn` |
34+
| Bashkir | `ba` | `Cyrl` |
35+
| Basque | `eu` | `Latn` |
36+
| Belarusian | `be` | `Cyrl` |
37+
| Bengali | `bn` | `Beng`, `Latn` |
38+
| Bosnian | `bs` | `Latn` |
39+
| Bulgarian | `bg` | `Cyrl` |
40+
| Burmese | `my` | `Mymr` |
41+
| Catalan | `ca` | `Latn` |
42+
| Central Khmer | `km` | `Khmr` |
43+
| Chinese Simplified | `zh_chs` | `Hans` |
44+
| Chinese Traditional | `zh_cht` | `Hant` |
45+
| Chuvash | `cv` | `Cyrl` |
46+
| Corsican | `co` | `Latn` |
47+
| Croatian | `hr` | `Latn` |
48+
| Czech | `cs` | `Latn` |
49+
| Danish | `da` | `Latn` |
50+
| Dari | `prs` | `Arab` |
51+
| Divehi | `dv` | `Thaa` |
52+
| Dutch | `nl` | `Latn` |
53+
| English | `en` | `Latn` |
54+
| Esperanto | `eo` | `Latn` |
55+
| Estonian | `et` | `Latn` |
56+
| Faroese | `fo` | `Latn` |
57+
| Fijian | `fj` | `Latn` |
58+
| Finnish | `fi` | `Latn` |
59+
| French | `fr` | `Latn` |
60+
| Galician | `gl` | `Latn` |
61+
| Georgian | `ka` | `Gujr` |
62+
| German | `de` | `Latn` |
63+
| Greek | `el` | `Grek` |
64+
| Gujarati | `gu` | `Gujr`, `Latn` |
65+
| Haitian | `ht` | `Latn` |
66+
| Hausa | `ha` | `Latn` |
67+
| Hebrew | `he` | `Hebr` |
68+
| Hindi | `hi` | `Deva`, `Latn` |
69+
| Hmong Daw | `mww` | `Latn` |
70+
| Hungarian | `hu` | `Latn` |
71+
| Icelandic | `is` | `Latn` |
72+
| Igbo | `ig` | `Latn` |
73+
| Indonesian | `id` | `Latn` |
74+
| Inuktitut | `iu` | `Cans`, `Latn` |
75+
| Irish | `ga` | `Latn` |
76+
| Italian | `it` | `Latn` |
77+
| Japanese | `ja` | `Jpan` |
78+
| Javanese | `jv` | `Latn` |
79+
| Kannada | `kn` | `Knda`, `Latn` |
80+
| Kazakh | `kk` | `Cyrl` |
81+
| Kinyarwanda | `rw` | `Latn` |
82+
| Kirghiz | `ky` | `Cyrl` |
83+
| Korean | `ko` | `Hang` |
84+
| Kurdish | `ku` | `Arab` |
85+
| Lao | `lo` | `Laoo` |
86+
| Latin | `la` | `Latn` |
87+
| Latvian | `lv` | `Latn` |
88+
| Lithuanian | `lt` | `Latn` |
89+
| Luxembourgish | `lb` | `Latn` |
90+
| Macedonian | `mk` | `Cyrl` |
91+
| Malagasy | `mg` | `Latn` |
92+
| Malay | `ms` | `Latn` |
93+
| Malayalam | `ml` | `Mlym`, `Latn` |
94+
| Maltese | `mt` | `Latn` |
95+
| Maori | `mi` | `Latn` |
96+
| Marathi | `mr` | `Deva`, `Latn` |
97+
| Mongolian | `mn` | `Cyrl` |
98+
| Nepali | `ne` | `Deva` |
99+
| Norwegian | `no` | `Latn` |
100+
| Norwegian Nynorsk | `nn` | `Latn` |
101+
| Odia | `or` | `Orya`, `Latn` |
102+
| Pashto | `ps` | `Arab` |
103+
| Persian | `fa` | `Arab` |
104+
| Polish | `pl` | `Latn` |
105+
| Portuguese | `pt` | `Latn` |
106+
| Punjabi | `pa` | `Guru`, `Latn` |
107+
| Queretaro Otomi | `otq` | `Latn` |
108+
| Romanian | `ro` | `Latn` |
109+
| Russian | `ru` | `Cyrl` |
110+
| Samoan | `sm` | `Latn` |
111+
| Serbian | `sr` | `Latn`, `Cyrl` |
112+
| Shona | `sn` | `Latn` |
113+
| Sindhi | `sd` | `Arab` |
114+
| Sinhala | `si` | `Sinh` |
115+
| Slovak | `sk` | `Latn` |
116+
| Slovenian | `sl` | `Latn` |
117+
| Somali | `so` | `Latn` |
118+
| Spanish | `es` | `Latn` |
119+
| Sundanese | `su` | `Latn` |
120+
| Swahili | `sw` | `Latn` |
121+
| Swedish | `sv` | `Latn` |
122+
| Tagalog | `tl` | `Latn` |
123+
| Tahitian | `ty` | `Latn` |
124+
| Tajik | `tg` | `Cyrl` |
125+
| Tamil | `ta` | `Taml`, `Latn` |
126+
| Tatar | `tt` | `Cyrl` |
127+
| Telugu | `te` | `Telu`, `Latn` |
128+
| Thai | `th` | `Thai` |
129+
| Tibetan | `bo` | `Tibt` |
130+
| Tigrinya | `ti` | `Ethi` |
131+
| Tongan | `to` | `Latn` |
132+
| Turkish | `tr` | `Latn` |
133+
| Turkmen | `tk` | `Latn` |
134+
| Upper Sorbian | `hsb` | `Latn` |
135+
| Uyghur | `ug` | `Arab` |
136+
| Ukrainian | `uk` | `Latn` |
137+
| Urdu | `ur` | `Arab`, `Latn` |
138+
| Uzbek | `uz` | `Latn` |
139+
| Vietnamese | `vi` | `Latn` |
140+
| Welsh | `cy` | `Latn` |
141+
| Xhosa | `xh` | `Latn` |
142+
| Yiddish | `yi` | `Hebr` |
143+
| Yoruba | `yo` | `Latn` |
144+
| Yucatec Maya | `yua` | `Latn` |
145+
| Zulu | `zu` | `Latn` |
147146

148147
## Romanized Indic Languages supported by Language Detection
149148

@@ -166,21 +165,21 @@ If you have content expressed in a less frequently used language, you can try La
166165

167166
| Language | Script code | Scripts |
168167
| ------------------------------------- | ---------- | -------------- |
169-
| Bengali (Bengali-Assamese) | `as` | `Latn`, `Beng` |
170-
| Bengali (Bangla) | `bn` | `Latn`, `Beng` |
168+
| Assamese | `as` | `Latn`, `Beng` |
169+
| Bengali | `bn` | `Latn`, `Beng` |
171170
| Gujarati | `gu` | `Latn`, `Gujr` |
172171
| Hindi | `hi` | `Latn`, `Deva` |
173172
| Kannada | `kn` | `Latn`, `Knda` |
174173
| Malayalam | `ml` | `Latn`, `Mlym` |
175174
| Marathi | `mr` | `Latn`, `Deva` |
176175
| Oriya | `or` | `Latn`, `Orya` |
177-
| Gurmukhi | `pa` | `Latn`, `Guru` |
176+
| Punjabi | `pa` | `Latn`, `Guru` |
178177
| Tamil | `ta` | `Latn`, `Taml` |
179178
| Telugu | `te` | `Latn`, `Telu` |
180-
| Arabic | `ar` | `Latn`, `Arab` |
181-
| Cyrillic | `tt` | `Latn`, `Cyrl` |
179+
| Urdu | `ur` | `Latn`, `Arab` |
180+
| Tatar | `tt` | `Latn`, `Cyrl` |
182181
| Serbian | `sr` | `Latn`, `Cyrl` |
183-
| Unified Canadian Aboriginal Syllabics | `iu` | `Latn`, `Cans` |
182+
| Inuktitut | `iu` | `Latn`, `Cans` |
184183

185184
## Next steps
186185

articles/ai-services/language-service/language-detection/overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ ms.custom: language-service-language-detection
1414

1515
# What is language detection in Azure AI Language?
1616

17-
Language detection is one of the features offered by [Azure AI Language](../overview.md), a collection of machine learning and AI algorithms in the cloud for developing intelligent applications that involve written language. Language detection is able to detect more than 100 languages in their primary script. In addition, it offers [script detection](./how-to/call-api.md#script-name-and-script-code) to detect multiple scripts per language according to the [ISO 15924 standard](https://wikipedia.org/wiki/ISO_15924) for a [select number of languages](./language-support.md#script-detection).
17+
Language detection is one of the features offered by [Azure AI Language](../overview.md), a collection of machine learning and AI algorithms in the cloud for developing intelligent applications that involve written language. Language detection is able to detect more than 100 languages in their primary script. In addition, it offers [script detection](./how-to/call-api.md#script-name-and-script-code) to detect supported scripts for each detected language according to the [ISO 15924 standard](https://wikipedia.org/wiki/ISO_15924) for a [select number of languages](./language-support.md#script-detection) supported by Azure AI Language Service.
1818

1919
This documentation contains the following types of articles:
2020

articles/ai-services/language-service/summarization/language-support.md

Lines changed: 15 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -31,8 +31,7 @@ Extractive text and document summarization support the following languages:
3131
| Japanese | `ja` | |
3232
| Korean | `ko` | |
3333
| Polish | `pl` | |
34-
| Portuguese (Portugal) | `pt` | |
35-
| Portuguese (Brazil) | `pt-br` | |
34+
| Portuguese | `pt` | `pt-br` also accepted |
3635
| Spanish | `es` | |
3736

3837
Abstractive text and document summarization support the following languages:
@@ -49,8 +48,7 @@ Abstractive text and document summarization support the following languages:
4948
| Japanese | `ja` | |
5049
| Korean | `ko` | |
5150
| Polish | `pl` | |
52-
| Portuguese (Portugal) | `pt` | |
53-
| Portuguese (Brazil) | `pt-br` | |
51+
| Portuguese | `pt` | `pt-br` also accepted |
5452
| Spanish | `es` | |
5553

5654
## Conversation summarization
@@ -60,7 +58,7 @@ Conversation summarization supports the following languages:
6058
| Language | Language code | Notes |
6159
|-----------------------|---------------|---------------------|
6260
| Chinese-Simplified | `zh-hans` | `zh` also accepted |
63-
| Chinese-Traditional | `zh-hant` | |
61+
|Chinese-Traditional |`zh-hant`||
6462
| English | `en` | |
6563
| French | `fr` | |
6664
| German | `de` | |
@@ -69,20 +67,19 @@ Conversation summarization supports the following languages:
6967
| Japanese | `ja` | |
7068
| Korean | `ko` | |
7169
| Polish | `pl` | |
72-
| Portuguese (Portugal) | `pt` | |
73-
| Portuguese (Brazil) | `pt-br` | |
74-
| Dutch, Flemish | `nl` | |
75-
| Swedish | `sv` | |
76-
| Danish | `da` | |
77-
| Finnish | `fi` | |
78-
| Russian | `ru` | |
79-
| Norwegian | `no` | |
80-
| Turkish | `tr` | |
81-
| Arabic | `ar` | |
82-
| Czech | `cs` | |
83-
| Hungarian | `hu` | |
84-
| Thai | `th` | |
70+
| Portuguese | `pt` | `pt-br` also accepted |
8571
| Spanish | `es` | |
72+
|Dutch| `nl`||
73+
|Swedish| `sv`||
74+
|Danish| `da`||
75+
|Finnish| `fi`||
76+
|Russian| `ru`||
77+
|Norwegian| `no`||
78+
|Turkish| `tr`||
79+
|Arabic| `ar`||
80+
|Czech| `cs`||
81+
|Hungarian| `hu`||
82+
|Thai| `th`||
8683

8784
## Custom summarization
8885

0 commit comments

Comments
 (0)