Skip to content

Commit 41f8302

Browse files
authored
Update language-identification-model.md
1 parent e2d74a8 commit 41f8302

File tree

1 file changed

+5
-3
lines changed

1 file changed

+5
-3
lines changed

articles/media-services/video-indexer/language-identification-model.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -9,13 +9,15 @@ manager: femila
99
ms.service: media-services
1010
ms.subservice: video-indexer
1111
ms.topic: article
12-
ms.date: 09/12/2019
12+
ms.date: 04/12/2020
1313
ms.author: ellbe
1414
---
1515

1616
# Automatically identify the spoken language with language identification model
1717

18-
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language. Currently LID supports English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Portuguese (Brazilian).
18+
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language. Currently LID supports English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Russian, and Portuguese (Brazilian).
19+
20+
Make sure to review the [Guidelines and limitations](#guidelines-and-limitations) section below.
1921

2022
## Choosing auto language identification on indexing
2123

@@ -45,7 +47,7 @@ Model dominant language is available in the insights JSON as the `sourceLanguage
4547

4648
## Guidelines and limitations
4749

48-
* Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Brazilian Portuguese.
50+
* Not supported in LID: Arabic (Modern Standard and Levantine), Hindi, and Korean are not supported in automatic language identification.
4951
* If the audio contains languages other than the supported list above, the result is unexpected.
5052
* If Video Indexer cannot identify the language with a high enough confidence (`>0.6`), the fallback language is English.
5153
* There is no current support for file with mixed languages audio. If the audio contains mixed languages, the result is unexpected.

0 commit comments

Comments
 (0)