Skip to content

Commit 3ee4bde

Browse files
authored
Merge pull request #3342 from MicrosoftDocs/main
3/5/2025 AM Publish
2 parents c00700d + 5aa22af commit 3ee4bde

22 files changed

+533
-162
lines changed

articles/ai-services/speech-service/includes/how-to/professional-voice/train-voice/speech-studio.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -208,7 +208,7 @@ It was Janet Maslin.
208208

209209
Each paragraph of the utterance results in a separate audio. If you want to combine all sentences into one audio, make them a single paragraph.
210210

211-
>[!NOTE]
211+
> [!NOTE]
212212
> The generated audio files are a combination of the automatic test scripts and custom test scripts.
213213
214214
### Update engine version for your voice model
Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,47 @@
1+
---
2+
title: include file
3+
description: include file
4+
author: eric-urban
5+
ms.author: eur
6+
ms.service: azure-ai-speech
7+
ms.topic: include
8+
ms.date: 3/4/2025
9+
ms.custom: references_regions
10+
---
11+
12+
[!INCLUDE [Feature preview](../../../../includes/preview-feature.md)]
13+
14+
In this article, you learn how to use video translation with Azure AI Speech in the [Azure AI Foundry portal](https://ai.azure.com/).
15+
16+
## Try out video translation
17+
18+
To try out video translation, follow these steps:
19+
20+
1. Go to the [model catalog in Azure AI Foundry portal](https://ai.azure.com/explore/models).
21+
22+
1. Enter "Azure-AI-Speech" in the catalog search box.
23+
24+
:::image type="content" source="../../../media/video-translation/search-model-catalog.png" alt-text="Screenshot of the model catalog in Azure AI Foundry portal." lightbox="../../../media/video-translation/search-model-catalog.png":::
25+
26+
1. Select **Azure-AI-Speech** and you're taken to the **Azure-AI-Speech** try out page.
27+
1. Select **Speech capabilities by scenario** > **Video translation**.
28+
29+
:::image type="content" source="../../../media/video-translation/capabilities-by-scenario.png" alt-text="Screenshot of filtering Speech service capabilities by scenario." lightbox="../../../media/video-translation/capabilities-by-scenario.png":::
30+
31+
1. Under the **Sample option** to the right, select personal or prebuilt voice.
32+
33+
1. Select the **Play** button to hear the translated audio. Select the original video tab to play the original audio.
34+
35+
:::image type="content" source="../../../media/video-translation/compare-original-translated.png" alt-text="Screenshot of selecting a voice type on the new project page." lightbox="../../../media/video-translation/compare-original-translated.png":::
36+
37+
The voice type options are:
38+
- **Prebuilt voice**: The service automatically selects the most suitable prebuilt voice by matching the speaker's voice in the video with prebuilt voices.
39+
- **Personal voice**: Use the personal voice that matches the voice of the speakers in the video.
40+
41+
> [!NOTE]
42+
> To use personal voice via the API, you need to apply for [access](https://aka.ms/customneural).
43+
44+
45+
## Related content
46+
47+
- [Video translation overview](../../../video-translation-overview.md)

articles/ai-services/speech-service/includes/how-to/video-translation/rest.md

Lines changed: 126 additions & 77 deletions
Large diffs are not rendered by default.

articles/ai-services/speech-service/includes/how-to/video-translation/speech-studio.md

Lines changed: 50 additions & 30 deletions
Original file line numberDiff line numberDiff line change
@@ -5,15 +5,20 @@ author: eric-urban
55
ms.author: eur
66
ms.service: azure-ai-speech
77
ms.topic: include
8-
ms.date: 10/11/2024
9-
ms.custom: include
8+
ms.date: 3/4/2025
9+
ms.custom: references_regions
1010
---
1111

1212
[!INCLUDE [Feature preview](../../../../includes/preview-feature.md)]
1313

14-
In this article, you learn how to use Azure AI Speech video translation in the studio.
14+
In this article, you learn how to use video translation with Azure AI Speech in the [Speech Studio](https://aka.ms/speechstudio).
1515

16-
All it takes to get started is an original video. See if video translation supports your [language](../../../language-support.md?tabs=speech-translation#video-translation) and [region](../../../video-translation-overview.md#supported-regions-and-languages).
16+
## Pre-requisites
17+
18+
- An Azure subscription. If you don't have an Azure subscription, create a free account before you begin.
19+
- A Speech resource [in a supported region](../../../video-translation-overview.md#supported-regions-and-languages). If you don't have a Speech resource, create one in the [Azure portal](https://portal.azure.com/).
20+
- You need a video file in .mp4 format, less than 500 MB, and shorter than 60 minutes. For testing purposes, you can use the sample video file provided by Microsoft at [https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4](https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4).
21+
- Make sure video translation supports your [source and target language](../../../language-support.md?tabs=speech-translation#video-translation).
1722

1823
## Create a video translation project
1924

@@ -27,59 +32,74 @@ To create a video translation project, follow these steps:
2732

2833
1. On the **Create and Manage Projects** page, select **Create a project**.
2934

30-
1. On the **New project** page, select **Voice type**.
35+
1. On the **New project** page, select a **Voice type**.
3136

32-
:::image type="content" source="../../../media/video-translation/select-voice-type.png" alt-text="Screenshot of selecting voice type on the new project page.":::
37+
:::image type="content" source="../../../media/video-translation/select-voice-type.png" alt-text="Screenshot of selecting a voice type on the new project page." lightbox="../../../media/video-translation/select-voice-type.png":::
3338

34-
You can select **Prebuilt neural voice** or **Personal voice** for **Voice type**. For prebuilt neural voice, the system automatically
35-
selects the most suitable prebuilt voice by matching the speaker's voice in the video with prebuilt voices. For personal voice, the system offers a model that generates high-quality voice replication in a few seconds.
39+
The voice type options are:
40+
- **Prebuilt neural voice**: The service automatically selects the most suitable prebuilt voice by matching the speaker's voice in the video with prebuilt voices.
41+
- **Personal voice**: Use the personal voice that matches the voice of the speakers in the video.
3642

37-
>[!NOTE]
43+
> [!NOTE]
3844
> To use personal voice, you need to apply for [access](https://aka.ms/customneural).
3945
40-
1. Upload your video file by dragging and dropping the video file or selecting the file manually.
46+
1. Upload your video file by dragging and dropping the video file or selecting the file manually. The video must be in .mp4 format, less than 500 MB, and shorter than 60 minutes.
47+
48+
1. Provide the **Project name**, **Number of speakers**, **Language of the video**, and **Translate to** language.
4149

42-
:::image type="content" source="../../../media/video-translation/upload-video-file.png" alt-text="Screenshot of uploading your video file on the new project page.":::
50+
1. Select the boxes to acknowledge the pricing information and code of conduct.
4351

44-
Ensure the video is in .mp4 format, less than 500 MB, and shorter than 60 minutes.
45-
46-
1. Provide **Project name**, and select **Number of speakers**, **Language of the video**, **Translate to** language.
52+
1. Select **Next: Advanced settings** if you want to adjust the advanced settings.
4753

48-
:::image type="content" source="../../../media/video-translation/provide-video-information.png" alt-text="Screenshot of providing video information on the new project page.":::
49-
50-
If you want to use your own subtitle files, select **Add subtitle file**. You can choose to upload either the source subtitle file or the target subtitle file. The subtitle file can be in WebVTT or JSON format. You can download a sample VTT file for your reference by selecting **Download sample VTT file**.
54+
:::image type="content" source="../../../media/video-translation/provide-video-information.png" alt-text="Screenshot of providing video information on the new project page." lightbox="../../../media/video-translation/provide-video-information.png":::
55+
56+
1. Optionally, you can adjust the following settings:
57+
58+
- **Lexicon file**: This option allows you to add custom words or phrases that the system should recognize and pronounce correctly. You can create a lexicon file in the [audio content creation tool in the Speech Studio](https://aka.ms/speechstudio) and select it here.
59+
- **Burn subtitles**: This option allows you to add subtitles to the video. The subtitle file can be in WebVTT or JSON format. You can download a sample WebVTT file for your reference by selecting **Download sample VTT file**.
5160

52-
:::image type="content" source="../../../media/video-translation/add-subtitle-file.png" alt-text="Screenshot of adding subtitle file on the new project page.":::
61+
:::image type="content" source="../../../media/video-translation/provide-video-information-advanced.png" alt-text="Screenshot of providing lexicon and subtitle information while creating a new project." lightbox="../../../media/video-translation/provide-video-information-advanced.png":::
62+
63+
If you want to use your own subtitle files, select **Subtitle** > **Upload your own**. You can choose to upload either the source subtitle file or the target subtitle file.
64+
- Automatic subtitles: Results in both source and target language subtitles.
65+
- Upload source language subtitles: Results in both source and target language subtitles.
66+
- Upload target language subtitles: Results in only target language subtitles.
5367

54-
1. After reviewing the pricing information and code of conduct, then proceed to create the project.
68+
1. Select **Create**.
5569

56-
Once the upload is complete, you can check the processing status on the project tab.
70+
Once the upload to Azure Blob Storage is complete, you can check the processing status on the project tab.
5771

58-
After the project is created, you can select the project to review detailed settings and make adjustments according to your preferences.
72+
After the project is created, you can select the project to review detailed settings and make adjustments according to your preferences.
5973

6074
## Check and adjust voice settings
6175

62-
On the project details page, the project offers two tabs **Translated** and **Original** under **Video**, allowing you to compare them side by side.
76+
On the project details page, you can see the **Translated** and **Original** tabs under **Video**. You can compare the original and translated videos by selecting the corresponding tab. The translated video is generated automatically, and you can play it to check the translation quality.
6377

64-
On the right side of the video, you can view both the original script and the translated script. Hovering over each part of the original script triggers the video to automatically jump to the corresponding segment of the original video, while hovering over each part of the translated script triggers the video to jump to the corresponding translated segment.
78+
To the right side of the video, you can view both the original script and the translated script. Hovering over each part of the original script triggers the video to automatically jump to the corresponding segment of the original video, while hovering over each part of the translated script triggers the video to jump to the corresponding translated segment.
6579

6680
You can also add or remove segments as needed. When you want to add a segment, ensure that the new segment timestamp doesn't overlap with the previous and next segment, and the segment end time should be larger than the start time. The correct format of timestamp should be `hh:mm:ss.ms`. Otherwise, you can't apply the changes.
6781

68-
You can adjust the time frame of the scripts directly using the audio waveform below the video. After selecting **Apply changes**, the adjustments will be applied.
82+
You can adjust the time frame of the scripts directly using the audio waveform below the video. The adjustments will be applied after you select **Apply changes**.
6983

7084
If you encounter segments with an "unidentified" voice name, it might be because the system couldn't accurately detect the voice, especially in situations where speaker voices overlap. In such cases, it's advisable to manually change the voice name.
7185

72-
:::image type="content" source="../../../media/video-translation/voice-unidentified.png" alt-text="Screenshot of one segment with unidentified voice name.":::
86+
:::image type="content" source="../../../media/video-translation/voice-unidentified.png" alt-text="Screenshot of one segment with unidentified voice name." lightbox="../../../media/video-translation/voice-unidentified.png":::
87+
88+
If you want to adjust the voice, select **Voice settings** to make some changes. On the **Voice settings** page, you can adjust the voice type, gender, and the voice. Select the voice sample on the right of **Voice** to determine your voice selection. If you find there's missing voice, you can add the new voice name by selecting **Add speaker**. After changing the settings, select **Update**.
89+
90+
:::image type="content" source="../../../media/video-translation/voice-settings.png" alt-text="Screenshot of adjusting voice settings on the voice settings page." lightbox="../../../media/video-translation/voice-settings.png":::
7391

74-
If you want to adjust the voice, select **Voice settings** to make some changes. On the **Voice settings** page, you can adjust the voice type, gender, and the voice. Select the voice sample on the right of **Voice** to determine your voice selection. If you find there is missing voice, you can add the new voice name by selecting **Add speaker**. After changing the settings, select **Update**.
92+
You can make multiple changes to the video, including adjusting the voice settings, adding or removing segments, and changing the time frame of the scripts. You're only charged after you select **Apply changes** to apply your changes. You can select **Save** to save work in progress without incurring any charges.
7593

76-
:::image type="content" source="../../../media/video-translation/voice-settings.png" alt-text="Screenshot of adjusting voice settings on the voice settings page.":::
94+
:::image type="content" source="../../../media/video-translation/apply-changes.png" alt-text="Screenshot of selecting apply changes button after making all changes." lightbox="../../../media/video-translation/apply-changes.png":::
7795

78-
If you make changes multiple times but haven't finished, you only need to save the changes you've made by selecting **Save**. After making all changes, select **Apply changes** to apply them to the video. You'll be charged only after you select **Apply changes**.
96+
## Translate to another language
7997

80-
:::image type="content" source="../../../media/video-translation/apply-changes.png" alt-text="Screenshot of selecting apply changes button after making all changes.":::
98+
You can keep the current translation project and translate the original video into another language.
8199

82-
You can translate the original video into a new language by selecting **New language**. On the **Translate** page, you can choose a new translated language and voice type. Once the video file has been translated, a new project is automatically created.
100+
1. Open your project.
101+
1. Select **+ New language**.
102+
1. On the new **Translate** page that appears, choose a new translated language and voice type. Once the video is translated, a new project is automatically created.
83103

84104
## Related content
85105

0 commit comments

Comments
 (0)