Skip to content

Commit e3fa476

Browse files
authored
Merge pull request #111078 from Juliako/patch-23
Update language-identification-model.md
2 parents 22b4a83 + 407a5a0 commit e3fa476

File tree

2 files changed

+13
-6
lines changed

2 files changed

+13
-6
lines changed

articles/media-services/video-indexer/language-identification-model.md

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -9,13 +9,17 @@ manager: femila
99
ms.service: media-services
1010
ms.subservice: video-indexer
1111
ms.topic: article
12-
ms.date: 09/12/2019
12+
ms.date: 04/12/2020
1313
ms.author: ellbe
1414
---
1515

1616
# Automatically identify the spoken language with language identification model
1717

18-
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language. Currently LID supports English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Portuguese (Brazilian).
18+
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language.
19+
20+
Currently LID supports: English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Russian, and Portuguese (Brazilian).
21+
22+
Make sure to review the [Guidelines and limitations](#guidelines-and-limitations) section below.
1923

2024
## Choosing auto language identification on indexing
2125

@@ -45,7 +49,10 @@ Model dominant language is available in the insights JSON as the `sourceLanguage
4549

4650
## Guidelines and limitations
4751

48-
* Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Brazilian Portuguese.
52+
* Automatic language identification (LID) supports the following languages:
53+
54+
English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Russian, and Portuguese (Brazilian).
55+
* Even though Video Indexer supports Arabic (Modern Standard and Levantine), Hindi, and Korean, these languages are not supported in LID.
4956
* If the audio contains languages other than the supported list above, the result is unexpected.
5057
* If Video Indexer cannot identify the language with a high enough confidence (`>0.6`), the fallback language is English.
5158
* There is no current support for file with mixed languages audio. If the audio contains mixed languages, the result is unexpected.

articles/media-services/video-indexer/video-indexer-overview.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ manager: femila
99
ms.service: media-services
1010
ms.subservice: video-indexer
1111
ms.topic: article
12-
ms.date: 02/02/2020
12+
ms.date: 04/12/2020
1313
ms.author: juliako
1414
---
1515

@@ -66,9 +66,9 @@ The following list shows the insights you can retrieve from your videos using Vi
6666

6767
### Audio insights
6868

69-
* **Automatic language detection**: Automatically identifies the dominant spoken language. Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Brazilian Portuguese. If the language can't be identified with confidence, Video Indexer assumes the spoken language is English. For more information, see [Language identification model](language-identification-model.md).
69+
* **Audio transcription**: Converts speech to text in 12 languages and allows extensions. Supported languages include English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Arabic, Russian, Brazilian Portuguese, Hindi, and Korean.
70+
* **Automatic language detection**: Automatically identifies the dominant spoken language. Supported languages include English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Russian, and Brazilian Portuguese. If the language can't be identified with confidence, Video Indexer assumes the spoken language is English. For more information, see [Language identification model](language-identification-model.md).
7071
* **Multi-language speech identification and transcription** (preview): Automatically identifies the spoken language in different segments from audio. It sends each segment of the media file to be transcribed and then combines the transcription back to one unified transcription. For more information, see [Automatically identify and transcribe multi-language content](multi-language-identification-transcription.md).
71-
* **Audio transcription**: Converts speech to text in 12 languages and allows extensions. Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Arabic, Russian, Brazilian Portuguese, Hindi, and Korean.
7272
* **Closed captioning**: Creates closed captioning in three formats: VTT, TTML, SRT.
7373
* **Two channel processing**: Auto detects separate transcript and merges to single timeline.
7474
* **Noise reduction**: Clears up telephony audio or noisy recordings (based on Skype filters).

0 commit comments

Comments
 (0)