Skip to content

Commit 8d38e88

Browse files
authored
Merge pull request #223136 from Juliako/patch-2
Update language-support.md
2 parents dd46d48 + acf11eb commit 8d38e88

File tree

1 file changed

+109
-95
lines changed

1 file changed

+109
-95
lines changed

articles/azure-video-indexer/language-support.md

Lines changed: 109 additions & 95 deletions
Original file line numberDiff line numberDiff line change
@@ -6,113 +6,127 @@ manager: femila
66
ms.topic: conceptual
77
ms.custom: ignite-2022
88
ms.author: juliako
9-
ms.date: 12/12/2022
9+
ms.date: 01/06/2023
1010
---
1111

1212
# Language support in Azure Video Indexer
1313

1414
This article provides a comprehensive list of language support by service features in Azure Video Indexer. For the list and definitions of all the features, see [Overview](video-indexer-overview.md).
1515

16-
The following list contains the source languages for transcription that are supported by the Azure Video Indexer API.
16+
Some languages are supported only through the API (see [Get Supported Languages](https://api-portal.videoindexer.ai/api-details#api=Operations&operation=Get-Supported-Languages)) and not through the Video Indexer website or widgets. To make sure a language is supported for search, transcription, or translation by the Azure Video Indexer website and widgets, see the [frontend language
17+
support table](#language-support-in-frontend-experiences) further below.
1718

18-
> [!NOTE]
19-
> Some languages are supported only through the API and not through the Video Indexer website or widgets.
20-
>
21-
> To make sure a language is supported for search, transcription, or translation by the Azure Video Indexer website and widgets, see the [frontend language
22-
> support table](#language-support-in-frontend-experiences) further below.
19+
## API language support
20+
21+
The API returns a list of supported languages with the following values:
2322

24-
## General language support
23+
```json
24+
"name": "Language",
25+
"languageCode": "Code",
26+
"isRightToLeft": true/false,
27+
"isSourceLanguage": true/false,
28+
"isAutoDetect": true/false
29+
```
2530

26-
This section describes languages supported by Azure Video Indexer API.
31+
Some notes for the above values are:
2732

28-
- Transcription (source language of the video/audio file)
29-
- Language identification (LID)
30-
- Multi-language identification (MLID)
31-
- Translation
33+
- Supported source language:
3234

33-
The following insights are translated, otherwise will remain in English:
35+
If `isSourceLanguage` is `false`, the language is supported for translation only.
36+
If `isSourceLanguage` is `true`, the language is supported as source for transcription, translation, and search.
37+
- Language identification (auto detection):
38+
39+
If `isAutoDetect` set to `true`, the language is supported for language identification (LID) and multi-language identification (MLID).
40+
- The following insights are translated, otherwise will remain in English:
3441

35-
- Transcript
36-
- Keywords
37-
- Topics
38-
- Labels
39-
- Frame patterns (Only to Hebrew as of now)
40-
- Language customization
41-
42-
| **Language** | **Code** | **Transcription** | **LID** | **MLID** | **Translation** | **Customization** (language model) |
43-
|:------------------------------:|:--------------:|:-----------------:|:-----------:|:-----------:|:-------------:|:---------------:|
44-
| Afrikaans | `af-ZA` | | | || |
45-
| Arabic (Israel) | `ar-IL` || | |||
46-
| Arabic (Jordan) | `ar-JO` ||||||
47-
| Arabic (Kuwait) | `ar-KW` ||||||
48-
| Arabic (Lebanon) | `ar-LB` || | |||
49-
| Arabic (Oman) | `ar-OM` ||||||
50-
| Arabic (Palestinian Authority) | `ar-PS` || | |||
51-
| Arabic (Qatar) | `ar-QA` ||||||
52-
| Arabic (Saudi Arabia) | `ar-SA` ||||||
53-
| Arabic (United Arab Emirates) | `ar-AE` ||||||
54-
| Arabic Egypt | `ar-EG` ||||||
55-
| Arabic Modern Standard (Bahrain) | `ar-BH` ||||||
56-
| Arabic Syrian Arab Republic | `ar-SY` ||||||
57-
| Bangla | `bn-BD` | | | || |
58-
| Bosnian | `bs-Latn` | | | || |
59-
| Bulgarian | `bg-BG` | | | || |
60-
| Catalan | `ca-ES` | | | || |
61-
| Chinese (Cantonese Traditional) | `zh-HK` ||||||
62-
| Chinese (Simplified) | `zh-Hans` ||| |||
63-
| Chinese (Simplified) | `zh-CK` ||| |||
64-
| Chinese (Traditional) | `zh-Hant` | | | || |
65-
| Croatian | `hr-HR` | | | || |
66-
| Czech | `cs-CZ` ||||||
67-
| Danish | `da-DK` ||||||
68-
| Dutch | `nl-NL` ||||||
69-
| English Australia | `en-AU` ||||||
70-
| English United Kingdom | `en-GB` ||||||
71-
| English United States | `en-US` ||||||
72-
| Estonian | `et-EE` | | | || |
73-
| Fijian | `en-FJ` | | | || |
74-
| Filipino | `fil-PH` | | | || |
75-
| Finnish | `fi-FI` ||||||
76-
| French | `fr-FR` ||||||
77-
| French (Canada) | `fr-CA` ||||||
78-
| German | `de-DE` ||||||
79-
| Greek | `el-GR` | | | || |
80-
| Haitian | `fr-HT` | | | || |
81-
| Hebrew | `he-IL` ||||||
82-
| Hindi | `hi-IN` ||||||
83-
| Hungarian | `hu-HU` | | | || |
84-
| Indonesian | `id-ID` | | | || |
85-
| Italian | `it-IT` ||||||
86-
| Japanese | `ja-JP` ||||||
87-
| Kiswahili | `sw-KE` | | | || |
88-
| Korean | `ko-KR` ||||||
89-
| Latvian | `lv-LV` | | | || |
90-
| Lithuanian | `lt-LT` | | | || |
91-
| Malagasy | `mg-MG` | | | || |
92-
| Malay | `ms-MY` | | | || |
93-
| Maltese | `mt-MT` | | | || |
94-
| Norwegian | `nb-NO` ||||||
95-
| Persian | `fa-IR` || | |||
96-
| Polish | `pl-PL` ||||||
97-
| Portuguese | `pt-BR` ||||||
98-
| Portuguese (Portugal) | `pt-PT` ||||||
99-
| Romanian | `ro-RO` | | | || |
100-
| Russian | `ru-RU` ||||||
101-
| Samoan | `en-WS` | | | || |
102-
| Serbian (Cyrillic) | `sr-Cyrl-RS` | | | || |
103-
| Serbian (Latin) | `sr-Latn-RS` | | | || |
104-
| Slovak | `sk-SK` | | | || |
105-
| Slovenian | `sl-SI` | | | || |
106-
| Spanish | `es-ES` ||||||
107-
| Spanish (Mexico) | `es-MX` || | |||
108-
| Swedish | `sv-SE` ||||||
109-
| Tamil | `ta-IN` | | | || |
110-
| Thai | `th-TH` ||||||
111-
| Tongan | `to-TO` | | | || |
112-
| Turkish | `tr-TR` ||||||
113-
| Ukrainian | `uk-UA` ||||| |
114-
| Urdu | `ur-PK` | | | || |
115-
| Vietnamese | `vi-VN` ||||| |
42+
- Transcript
43+
- Keywords
44+
- Topics
45+
- Labels
46+
- Frame patterns (Only to Hebrew as of now)
47+
48+
| **Language** | **Code** | **Supported source language** | **Language identification** | **Customization** (language model) |
49+
|:--------------:|:--------------:|:-----------------------------:|:----------------------------:|:-----------:|
50+
| Afrikaans | `af-ZA` | || |
51+
| Arabic (Israel) | `ar-IL` || ||
52+
| Arabic (Iraq) | `ar-IQ` ||| |
53+
| Arabic (Jordan) | `ar-JO` ||||
54+
| Arabic (Kuwait) | `ar-KW` ||||
55+
| Arabic (Lebanon) | `ar-LB` || ||
56+
| Arabic (Oman) | `ar-OM` ||||
57+
| Arabic (Paestinian Authority) | `ar-PS` || ||
58+
| Arabic (Qatar) | `ar-QA` ||||
59+
| Arabic (Saudi Arabia) | `ar-SA` ||||
60+
| Arabic (United Arab Emirates) | `ar-AE` ||||
61+
| Arabic Egypt | `ar-EG` ||||
62+
| Arabic Modern Standard (Bahrain) | `ar-BH` ||||
63+
| Arabic Syrian Arab Republic | `ar-SY` ||||
64+
| Armenian | `hy-AM` || | |
65+
| Bangla | `bn-BD` | || |
66+
| Bosnian | `bs-Latn` | || |
67+
| Bulgarian | `bg-BG` ||| |
68+
| Catalan | `ca-ES` ||| |
69+
| Chinese (Cantonese Traditional) | `zh-HK` ||||
70+
| Chinese (Simplified) | `zh-Hans` ||||
71+
| Chinese (Simplified) | `zh-CK` ||||
72+
| Chinese (Traditional) | `zh-Hant` | || |
73+
| Croatian | `hr-HR` ||| |
74+
| Czech | `cs-CZ` ||||
75+
| Danish | `da-DK` ||||
76+
| Dutch | `nl-NL` ||||
77+
| English Australia | `en-AU` ||||
78+
| English United Kingdom | `en-GB` ||||
79+
| English United States | `en-US` ||||
80+
| Estonian | `et-EE` ||| |
81+
| Fijian | `en-FJ` | || |
82+
| Filipino | `fil-PH` | || |
83+
| Finnish | `fi-FI` ||||
84+
| French | `fr-FR` ||||
85+
| French (Canada) | `fr-CA` ||||
86+
| German | `de-DE` ||||
87+
| Greek | `el-GR` ||| |
88+
| Gujarati | `gu-IN` ||| |
89+
| Haitian | `fr-HT` | || |
90+
| Hebrew | `he-IL` ||||
91+
| Hindi | `hi-IN` ||||
92+
| Hungarian | `hu-HU` | || |
93+
| Icelandic | `is-IS` || | |
94+
| Indonesian | `id-ID` | || |
95+
| Irish | `ga-IE` ||| |
96+
| Italian | `it-IT` ||||
97+
| Japanese | `ja-JP` ||||
98+
| Kannada | `kn-IN` ||| |
99+
| Kiswahili | `sw-KE` | || |
100+
| Korean | `ko-KR` ||||
101+
| Latvian | `lv-LV` ||| |
102+
| Lithuanian | `lt-LT` | || |
103+
| Malagasy | `mg-MG` | || |
104+
| Malay | `ms-MY` || | |
105+
| Malayalam | `ml-IN` ||| |
106+
| Maltese | `mt-MT` | || |
107+
| Norwegian | `nb-NO` ||||
108+
| Persian | `fa-IR` || ||
109+
| Polish | `pl-PL` ||||
110+
| Portuguese | `pt-BR` ||||
111+
| Portuguese (Portugal) | `pt-PT` ||||
112+
| Romanian | `ro-RO` ||| |
113+
| Russian | `ru-RU` ||||
114+
| Samoan | `en-WS` | || |
115+
| Serbian (Cyrillic) | `sr-Cyrl-RS` | || |
116+
| Serbian (Latin) | `sr-Latn-RS` | || |
117+
| Slovak | `sk-SK` ||| |
118+
| Slovenian | `sl-SI` ||| |
119+
| Spanish | `es-ES` ||||
120+
| Spanish (Mexico) | `es-MX` ||||
121+
| Swedish | `sv-SE` ||||
122+
| Tamil | `ta-IN` ||| |
123+
| Telugu | `te-IN` ||| |
124+
| Thai | `th-TH` ||||
125+
| Tongan | `to-TO` | || |
126+
| Turkish | `tr-TR` ||||
127+
| Ukrainian | `uk-UA` ||| |
128+
| Urdu | `ur-PK` | | | |
129+
| Vietnamese | `vi-VN` ||| |
116130

117131
**Default languages supported by Language identification (LID)**: German (de-DE) , English United States (en-US) , Spanish (es-ES) , French (fr-FR), Italian (it-IT) , Japanese (ja-JP), Portuguese (pt-BR), Russian (ru-RU), Chinese (Simplified) (zh-Hans).
118132

0 commit comments

Comments
 (0)