Skip to content

Commit 075180e

Browse files
committed
update video translation guide
1 parent 144f6f1 commit 075180e

12 files changed

+100
-7
lines changed

articles/ai-services/speech-service/includes/how-to/video-translation/ai-foundry.md

Lines changed: 96 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -16,26 +16,34 @@ In this article, you learn how to use video translation with Azure AI Speech in
1616
> [!TIP]
1717
> Try out video translation in the [Azure AI Foundry portal](https://ai.azure.com/) before using the API. Use the [video translation REST API](?pivots=rest-api) to integrate video translation into your applications. For more information about the API, see [Video translation REST API](/rest/api/aiservices/videotranslation/translation-operations/create-translation).
1818
19+
## Pre-requisites
20+
21+
- An Azure subscription. If you don't have an Azure subscription, create a free account before you begin.
22+
- An Azure AI Services resource for Speech [in a supported region](../../../video-translation-overview.md#supported-regions-and-languages). If you don't have a Speech resource, create one in the [Azure portal](https://portal.azure.com/).
23+
- An [Azure Blob Storage](/azure/storage/blobs/storage-blobs-overview) account.
24+
- You need a video file in .mp4 format, less than 5 GB, and shorter than 4 hours. For testing purposes, you can use the sample video file provided by Microsoft at [https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4](https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4).
25+
- Make sure video translation supports your [source and target language](../../../language-support.md?tabs=speech-translation#video-translation).
26+
1927
## Try out video translation
2028

21-
To try out video translation, follow these steps:
29+
To try out the video translation demo, follow these steps:
2230

2331
1. Go to the [model catalog in Azure AI Foundry portal](https://ai.azure.com/explore/models).
2432

2533
1. Enter and search for "Azure-AI-Speech" in the catalog search box.
2634

27-
:::image type="content" source="../../../media/video-translation/search-model-catalog.png" alt-text="Screenshot of the model catalog in Azure AI Foundry portal." lightbox="../../../media/video-translation/search-model-catalog.png":::
35+
:::image type="content" source="../../../media/video-translation/foundry-portal/search-model-catalog.png" alt-text="Screenshot of the model catalog in Azure AI Foundry portal." lightbox="../../../media/video-translation/foundry-portal/search-model-catalog.png":::
2836

2937
1. Select **Azure-AI-Speech** and you're taken to the **Azure-AI-Speech** try out page.
3038
1. Select **Speech capabilities by scenario** > **Video translation**.
3139

32-
:::image type="content" source="../../../media/video-translation/capabilities-by-scenario.png" alt-text="Screenshot of filtering Speech service capabilities by scenario." lightbox="../../../media/video-translation/capabilities-by-scenario.png":::
40+
:::image type="content" source="../../../media/video-translation/foundry-portal/capabilities-by-scenario.png" alt-text="Screenshot of filtering Speech service capabilities by scenario." lightbox="../../../media/video-translation/foundry-portal/capabilities-by-scenario.png":::
3341

3442
1. Under the **Sample option** to the right, select personal or prebuilt voice.
3543

3644
1. Select the **Play** button to hear the translated audio. Select the original video tab to play the original audio.
3745

38-
:::image type="content" source="../../../media/video-translation/compare-original-translated.png" alt-text="Screenshot of selecting a voice type on the new project page." lightbox="../../../media/video-translation/compare-original-translated.png":::
46+
:::image type="content" source="../../../media/video-translation/foundry-portal/compare-original-translated.png" alt-text="Screenshot of selecting a voice type on the new project page." lightbox="../../../media/video-translation/foundry-portal/compare-original-translated.png":::
3947

4048
The voice type options are:
4149
- **Prebuilt voice**: The service automatically selects the most suitable prebuilt voice by matching the speaker's voice in the video with prebuilt voices.
@@ -45,6 +53,90 @@ To try out video translation, follow these steps:
4553
> To use personal voice via the API, you need to apply for [access](https://aka.ms/customneural).
4654
4755

56+
## Create a video translation project
57+
58+
To create a video translation project, follow these steps:
59+
60+
1. Go to the [model catalog in Azure AI Foundry portal](https://ai.azure.com/explore/models).
61+
62+
1. Enter and search for "Azure-AI-Speech" in the catalog search box.
63+
64+
:::image type="content" source="../../../media/video-translation/foundry-portal/search-model-catalog.png" alt-text="Screenshot of the model catalog in Azure AI Foundry portal." lightbox="../../../media/video-translation/foundry-portal/search-model-catalog.png":::
65+
66+
1. Select **Azure-AI-Speech** and you're taken to the **Azure-AI-Speech** try out page.
67+
1. Select **Speech capabilities by scenario** > **Video translation**.
68+
69+
:::image type="content" source="../../../media/video-translation/foundry-portal/capabilities-by-scenario.png" alt-text="Screenshot of filtering Speech service capabilities by scenario." lightbox="../../../media/video-translation/foundry-portal/capabilities-by-scenario.png":::
70+
71+
1. Select **My videos** > **Upload video**.
72+
73+
:::image type="content" source="../../../media/video-translation/foundry-portal/select-upload-video.png" alt-text="Screenshot of selecting the upload video button." lightbox="../../../media/video-translation/foundry-portal/select-upload-video.png":::
74+
75+
1. On the **Upload video** page, select a **Voice type**.
76+
77+
:::image type="content" source="../../../media/video-translation/foundry-portal/select-voice-type.png" alt-text="Screenshot of selecting a voice type on the new project page." lightbox="../../../media/video-translation/foundry-portal/select-voice-type.png":::
78+
79+
The voice type options are:
80+
- **Prebuilt neural voice**: The service automatically selects the most suitable prebuilt voice by matching the speaker's voice in the video with prebuilt voices.
81+
- **Personal voice**: Use the personal voice that matches the voice of the speakers in the video.
82+
83+
> [!NOTE]
84+
> To use personal voice, you need to apply for [access](https://aka.ms/customneural).
85+
86+
1. Upload your video file by dragging and dropping the video file or selecting the file manually. The video must be in .mp4 format, less than 5 GB, and shorter than 4 hours.
87+
88+
1. Provide the **Number of speakers**, **Language of the video**, and **Translate to** language.
89+
90+
1. Select the boxes to acknowledge the pricing information and code of conduct.
91+
92+
1. Select **Next: Advanced settings** if you want to adjust the advanced settings.
93+
94+
:::image type="content" source="../../../media/video-translation/foundry-portal/provide-video-information.png" alt-text="Screenshot of providing video information on the new project page." lightbox="../../../media/video-translation/foundry-portal/provide-video-information.png":::
95+
96+
1. Optionally, you can adjust the following settings:
97+
98+
- **Lexicon file**: This option allows you to add custom words or phrases that the system should recognize and pronounce correctly. You can create a lexicon file in the [audio content creation tool in the Speech Studio](https://aka.ms/speechstudio) and select it here.
99+
- **Burn subtitles**: This option allows you to add subtitles to the video. The subtitle file can be in WebVTT or JSON format. You can download a sample WebVTT file for your reference by selecting **Download sample VTT file**.
100+
101+
:::image type="content" source="../../../media/video-translation/foundry-portal/provide-video-information-advanced.png" alt-text="Screenshot of providing lexicon and subtitle information while creating a new project." lightbox="../../../media/video-translation/foundry-portal/provide-video-information-advanced.png":::
102+
103+
If you want to use your own subtitle files, select **Subtitle** > **Upload your own**. You can choose to upload either the source subtitle file or the target subtitle file.
104+
- Automatic subtitles: Results in both source and target language subtitles.
105+
- Upload source language subtitles: Results in both source and target language subtitles.
106+
- Upload target language subtitles: Results in only target language subtitles.
107+
108+
1. Select **Create**.
109+
110+
Once the upload to Azure Blob Storage is complete, you can check the processing status on the project tab.
111+
112+
After the project is created, you can select the project to review detailed settings and make adjustments according to your preferences.
113+
114+
## Check and adjust voice settings
115+
116+
Select **My videos** and you should see a video labeled with **Succeeded** status.
117+
118+
:::image type="content" source="../../../media/video-translation/foundry-portal/select-succeeded-video.png" alt-text="Screenshot of selecting a successfully translated video." lightbox="../../../media/video-translation/foundry-portal/select-succeeded-video.png":::
119+
120+
Select the video to see the **Translated** and **Original** tabs under **Video**. You can compare the original and translated videos by selecting the corresponding tab. The translated video is generated automatically, and you can play it to check the translation quality.
121+
122+
:::image type="content" source="../../../media/video-translation/foundry-portal/compare-original-translated-custom.png" alt-text="Screenshot of reviewing details of the translated video." lightbox="../../../media/video-translation/foundry-portal/compare-original-translated-custom.png":::
123+
124+
To the right side of the video, you can view both the original script and the translated script. Hovering over each part of the original script triggers the video to automatically jump to the corresponding segment of the original video, while hovering over each part of the translated script triggers the video to jump to the corresponding translated segment.
125+
126+
You can make multiple changes to the video, including adjusting the voice settings, adding or removing segments, and changing the time frame of the scripts. You're only charged after you select **Apply changes** to apply your changes. You can select **Save** to save work in progress without incurring any charges.
127+
128+
If you encounter segments with an "unidentified" voice name, it might be because the system couldn't accurately detect the voice, especially in situations where speaker voices overlap. In such cases, it's advisable to manually change the voice name.
129+
130+
:::image type="content" source="../../../media/video-translation/foundry-portal/voice-unidentified.png" alt-text="Screenshot of one segment with unidentified voice name." lightbox="../../../media/video-translation/foundry-portal/voice-unidentified.png":::
131+
132+
## Translate to another language
133+
134+
You can keep the current translation project and translate the original video into another language.
135+
136+
1. Select **My videos** and then select the tile for your video translation.
137+
1. Select **+ New language**.
138+
1. On the new **Translate to new language** page that appears, choose a new translation language and voice type. Once the video is translated, a new project is automatically created.
139+
48140
## Related content
49141

50142
- [Video translation overview](../../../video-translation-overview.md)

articles/ai-services/speech-service/includes/how-to/video-translation/speech-studio.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -14,8 +14,9 @@ In this article, you learn how to use video translation with Azure AI Speech in
1414
## Pre-requisites
1515

1616
- An Azure subscription. If you don't have an Azure subscription, create a free account before you begin.
17-
- A Speech resource [in a supported region](../../../video-translation-overview.md#supported-regions-and-languages). If you don't have a Speech resource, create one in the [Azure portal](https://portal.azure.com/).
18-
- You need a video file in .mp4 format, less than 500 MB, and shorter than 60 minutes. For testing purposes, you can use the sample video file provided by Microsoft at [https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4](https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4).
17+
- An Azure AI Services resource for Speech [in a supported region](../../../video-translation-overview.md#supported-regions-and-languages). If you don't have a Speech resource, create one in the [Azure portal](https://portal.azure.com/).
18+
- An [Azure Blob Storage](/azure/storage/blobs/storage-blobs-overview) account.
19+
- You need a video file in .mp4 format, less than 5 GB, and shorter than 4 hours. For testing purposes, you can use the sample video file provided by Microsoft at [https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4](https://speechstudioprodpublicsa.blob.core.windows.net/ttsvoice/VideoTranslation/PublicDoc/SampleData/es-ES-TryOutOriginal.mp4).
1920
- Make sure video translation supports your [source and target language](../../../language-support.md?tabs=speech-translation#video-translation).
2021

2122
## Create a video translation project
@@ -41,7 +42,7 @@ To create a video translation project, follow these steps:
4142
> [!NOTE]
4243
> To use personal voice, you need to apply for [access](https://aka.ms/customneural).
4344
44-
1. Upload your video file by dragging and dropping the video file or selecting the file manually. The video must be in .mp4 format, less than 500 MB, and shorter than 60 minutes.
45+
1. Upload your video file by dragging and dropping the video file or selecting the file manually. The video must be in .mp4 format, less than 5 GB, and shorter than 4 hours.
4546

4647
1. Provide the **Project name**, **Number of speakers**, **Language of the video**, and **Translate to** language.
4748

Loading
589 KB
Loading
Loading
162 KB
Loading
130 KB
Loading
164 KB
Loading

0 commit comments

Comments
 (0)