You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/media-services/video-indexer/language-identification-model.md
+10-3Lines changed: 10 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,13 +9,17 @@ manager: femila
9
9
ms.service: media-services
10
10
ms.subservice: video-indexer
11
11
ms.topic: article
12
-
ms.date: 09/12/2019
12
+
ms.date: 04/12/2020
13
13
ms.author: ellbe
14
14
---
15
15
16
16
# Automatically identify the spoken language with language identification model
17
17
18
-
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language. Currently LID supports English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Portuguese (Brazilian).
18
+
Video Indexer supports automatic language identification (LID), which is the process of automatically identifying the spoken language content from audio and sending the media file to be transcribed in the dominant identified language.
19
+
20
+
Currently LID supports: English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Russian, and Portuguese (Brazilian).
21
+
22
+
Make sure to review the [Guidelines and limitations](#guidelines-and-limitations) section below.
19
23
20
24
## Choosing auto language identification on indexing
21
25
@@ -45,7 +49,10 @@ Model dominant language is available in the insights JSON as the `sourceLanguage
45
49
46
50
## Guidelines and limitations
47
51
48
-
* Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Brazilian Portuguese.
52
+
* Automatic language identification (LID) supports the following languages:
Copy file name to clipboardExpand all lines: articles/media-services/video-indexer/video-indexer-overview.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ manager: femila
9
9
ms.service: media-services
10
10
ms.subservice: video-indexer
11
11
ms.topic: article
12
-
ms.date: 02/02/2020
12
+
ms.date: 04/12/2020
13
13
ms.author: juliako
14
14
---
15
15
@@ -66,9 +66,9 @@ The following list shows the insights you can retrieve from your videos using Vi
66
66
67
67
### Audio insights
68
68
69
-
***Automatic language detection**: Automatically identifies the dominant spoken language. Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Russian, and Brazilian Portuguese. If the language can't be identified with confidence, Video Indexer assumes the spoken language is English. For more information, see [Language identification model](language-identification-model.md).
69
+
***Audio transcription**: Converts speech to text in 12 languages and allows extensions. Supported languages include English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Arabic, Russian, Brazilian Portuguese, Hindi, and Korean.
70
+
***Automatic language detection**: Automatically identifies the dominant spoken language. Supported languages include English, Spanish, French, German, Italian, Mandarin Chines, Japanese, Russian, and Brazilian Portuguese. If the language can't be identified with confidence, Video Indexer assumes the spoken language is English. For more information, see [Language identification model](language-identification-model.md).
70
71
***Multi-language speech identification and transcription** (preview): Automatically identifies the spoken language in different segments from audio. It sends each segment of the media file to be transcribed and then combines the transcription back to one unified transcription. For more information, see [Automatically identify and transcribe multi-language content](multi-language-identification-transcription.md).
71
-
***Audio transcription**: Converts speech to text in 12 languages and allows extensions. Supported languages include English, Spanish, French, German, Italian, Chinese (Simplified), Japanese, Arabic, Russian, Brazilian Portuguese, Hindi, and Korean.
72
72
***Closed captioning**: Creates closed captioning in three formats: VTT, TTML, SRT.
73
73
***Two channel processing**: Auto detects separate transcript and merges to single timeline.
74
74
***Noise reduction**: Clears up telephony audio or noisy recordings (based on Skype filters).
0 commit comments