You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/media-services/video-indexer/language-identification-model.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,13 +9,15 @@ manager: femila
9
9
ms.service: media-services
10
10
ms.subservice: video-indexer
11
11
ms.topic: article
12
-
ms.date: 09/12/2019
12
+
ms.date: 04/12/2020
13
13
ms.author: ellbe
14
14
---
15
15
16
16
# Automatically identify the spoken language with language identification model
17
17
18
-
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language. Currently LID supports English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Portuguese (Brazilian).
18
+
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language. Currently LID supports English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Russian, and Portuguese (Brazilian).
19
+
20
+
Make sure to review the [Guidelines and limitations](#guidelines-and-limitations) section below.
19
21
20
22
## Choosing auto language identification on indexing
21
23
@@ -45,7 +47,7 @@ Model dominant language is available in the insights JSON as the `sourceLanguage
45
47
46
48
## Guidelines and limitations
47
49
48
-
*Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Brazilian Portuguese.
50
+
*Not supported in LID: Arabic (Modern Standard and Levantine), Hindi, and Korean are not supported in automatic language identification.
49
51
* If the audio contains languages other than the supported list above, the result is unexpected.
50
52
* If Video Indexer cannot identify the language with a high enough confidence (`>0.6`), the fallback language is English.
51
53
* There is no current support for file with mixed languages audio. If the audio contains mixed languages, the result is unexpected.
0 commit comments