Skip to content

Commit 1a7191a

Browse files
Merge pull request #239241 from Juliako/patch-9
Update audio-effects-detection.md
2 parents 40b6b74 + ee3b6b3 commit 1a7191a

File tree

1 file changed

+17
-13
lines changed

1 file changed

+17
-13
lines changed

articles/azure-video-indexer/audio-effects-detection.md

Lines changed: 17 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
title: Enable audio effects detection
33
description: Audio Effects Detection is one of Azure Video Indexer AI capabilities that detects various acoustics events and classifies them into different acoustic categories (for example, gunshot, screaming, crowd reaction and more).
44
ms.topic: how-to
5-
ms.date: 01/04/2022
5+
ms.date: 05/24/2023
66
ms.author: juliako
77
---
88

@@ -18,31 +18,36 @@ Some scenarios where this feature is useful:
1818

1919
## Supported audio categories
2020

21-
**Audio effect detection** can detect and classify 7 different categories. In the next table, you can find the different categories split in to the different presets, divided to **Standard** and **Advanced**. For more information, see [pricing](https://azure.microsoft.com/pricing/details/media-services/).
21+
**Audio effect detection** can detect and classify different categories. In the following table, you can find the different categories split in to the different presets, divided to **Standard** and **Advanced**. For more information, see [pricing](https://azure.microsoft.com/pricing/details/video-indexer/).
22+
23+
The following table shows which categories are supported depending on **Preset Name** (**Audio Only** / **Video + Audio** vs. **Advance Audio** / **Advance Video + Audio**). When you are using the **Advanced** indexing, categories appear in the **Insights** pane of the website.
2224

2325
|Indexing type |Standard indexing| Advanced indexing|
2426
|---|---|---|
25-
|**Preset Name** |**"Audio Only"** <br/>**"Video + Audio"** |**"Advance Audio"**<br/> **"Advance Video + Audio"**|
26-
|**Appear in insights pane**|| V|
2727
| Crowd Reactions || V|
2828
| Silence| V| V|
2929
| Gunshot or explosion ||V |
3030
| Breaking glass ||V|
3131
| Alarm or siren|| V |
3232
| Laughter|| V |
33-
| Dog barking|| V|
33+
| Dog || V|
34+
| Bell ringing|| V|
35+
| Bird|| V|
36+
| Car|| V|
37+
| Engine|| V|
38+
| Crying|| V|
39+
| Music playing|| V|
40+
| Screaming|| V|
41+
| Thunderstorm || V|
3442

3543
## Result formats
3644

37-
The audio effects are retrieved in the insights JSON that includes the category ID, type, name, and set of instances per category along with their specific timeframe and confidence score.
38-
39-
The `name` parameter will be presented in the language in which the JSON was indexed, while the type will always remain the same.
45+
The audio effects are retrieved in the insights JSON that includes the category ID, type, and set of instances per category along with their specific timeframe and confidence score.
4046

4147
```json
4248
audioEffects: [{
4349
id: 0,
4450
type: "Gunshot or explosion",
45-
name: "Gunshot",
4651
instances: [{
4752
confidence: 0.649,
4853
adjustedStart: "0:00:13.9",
@@ -60,7 +65,6 @@ audioEffects: [{
6065
}, {
6166
id: 1,
6267
type: "CrowdReactions",
63-
name: "Crowd Reactions",
6468
instances: [{
6569
confidence: 0.6816,
6670
adjustedStart: "0:00:47.9",
@@ -89,7 +93,7 @@ In order to set the index process to include the detection of audio effects, sel
8993
9094
## Closed Caption
9195

92-
When audio effects are retrieved in the closed caption files, they will be retrieved in square brackets the following structure:
96+
When audio effects are retrieved in the closed caption files, they are retrieved in square brackets the following structure:
9397

9498
|Type| Example|
9599
|---|---|
@@ -99,7 +103,7 @@ When audio effects are retrieved in the closed caption files, they will be retri
99103
|TXT |[Gunshot or explosion]|
100104
|CSV |0.9047,00:00:00.000,00:00:03.671, [Gunshot or explosion]|
101105

102-
Audio Effects in closed captions file will be retrieved with the following logic employed:
106+
Audio Effects in closed captions file is retrieved with the following logic employed:
103107

104108
* `Silence` event type will not be added to the closed captions
105109
* Maximum duration to show an event I 5 seconds
@@ -113,7 +117,7 @@ Audio effects can be added to the closed captions files supported by Azure Video
113117
> :::image type="content" source="./media/audio-effects-detection/close-caption.jpg" alt-text="Audio Effects in CC":::
114118
115119
> [!NOTE]
116-
> When using [update transcript](https://api-portal.videoindexer.ai/api-details#api=Operations&operation=Update-Video-Transcript) from closed caption files or [update custom language model](https://api-portal.videoindexer.ai/api-details#api=Operations&operation=Update-Language-Model) from closed caption files, audio effects included in those files will be ignored.
120+
> When using [update transcript](https://api-portal.videoindexer.ai/api-details#api=Operations&operation=Update-Video-Transcript) from closed caption files or [update custom language model](https://api-portal.videoindexer.ai/api-details#api=Operations&operation=Update-Language-Model) from closed caption files, audio effects included in those files are ignored.
117121
118122
## Limitations and assumptions
119123

0 commit comments

Comments
 (0)