Merge pull request #281710 from sally-baolian/patch-273

prmerger-automator[bot] · web-flow · commit 702aad260052 · 2024-07-25T13:57:33.000Z
TTS Avatar GA release on July 25th in Pacific time
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/avatar-gestures-with-ssml.md b/articles/ai-services/speech-service/text-to-speech-avatar/avatar-gestures-with-ssml.md
@@ -11,9 +11,7 @@ ms.author: eur
 author: eric-urban
 ---
 
-# Customize text to speech avatar gestures with SSML (preview)
-
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
+# Customize text to speech avatar gestures with SSML
 
 The [Speech Synthesis Markup Language (SSML)](../speech-synthesis-markup-structure.md) with input text determines the structure, content, and other characteristics of the text to speech output. Most SSML tags can also work in text to speech avatar. Furthermore, text to speech avatar batch mode provides avatar gestures insertion ability by using the SSML bookmark element with the format `<bookmark mark='gesture.*'/>`. 
 
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/batch-synthesis-avatar-properties.md b/articles/ai-services/speech-service/text-to-speech-avatar/batch-synthesis-avatar-properties.md
@@ -11,9 +11,7 @@ ms.author: eur
 author: eric-urban
 ---
 
-# Batch synthesis properties for text to speech avatar (preview)
-
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
+# Batch synthesis properties for text to speech avatar
 
 Batch synthesis properties can be grouped as: avatar related properties, batch job related properties, and text to speech related properties,  which are described in the following tables.
 
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/batch-synthesis-avatar.md b/articles/ai-services/speech-service/text-to-speech-avatar/batch-synthesis-avatar.md
@@ -11,11 +11,9 @@ ms.author: eur
 author: eric-urban
 ---
 
-# How to use batch synthesis for text to speech avatar (preview)
+# How to use batch synthesis for text to speech avatar
 
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
-
-The batch synthesis API for text to speech avatar (preview) allows for the asynchronous synthesis of text into a talking avatar as a video file. Publishers and video content platforms can utilize this API to create avatar video content in a batch. That approach can be suitable for various use cases such as training materials, presentations, or advertisements.
+The batch synthesis API for text to speech avatar allows for the asynchronous synthesis of text into a talking avatar as a video file. Publishers and video content platforms can utilize this API to create avatar video content in a batch. That approach can be suitable for various use cases such as training materials, presentations, or advertisements.
 
 The synthetic avatar video will be generated asynchronously after the system receives text input. The generated video output can be downloaded in batch mode synthesis. You submit text for synthesis, poll for the synthesis status, and download the video output when the status indicates success. The text input formats must be plain text or Speech Synthesis Markup Language (SSML) text. 
 
@@ -27,10 +25,10 @@ To perform batch synthesis, you can use the following REST API operations.
 
 | Operation            | Method  | REST API call                                      |
 |----------------------|---------|---------------------------------------------------|
-| [Create batch synthesis](#create-a-batch-synthesis-request) | PUT    | avatar/batchsyntheses/{SynthesisId}?api-version=2024-04-15-preview |
-| [Get batch synthesis](#get-batch-synthesis)    | GET     | avatar/batchsyntheses/{SynthesisId}?api-version=2024-04-15-preview |
-| [List batch synthesis](#list-batch-synthesis)   | GET     | avatar/batchsyntheses/?api-version=2024-04-15-preview |
-| [Delete batch synthesis](#delete-batch-synthesis) | DELETE  | avatar/batchsyntheses/{SynthesisId}?api-version=2024-04-15-preview |
+| [Create batch synthesis](#create-a-batch-synthesis-request) | PUT    | avatar/batchsyntheses/{SynthesisId}?api-version=2024-08-01 |
+| [Get batch synthesis](#get-batch-synthesis)    | GET     | avatar/batchsyntheses/{SynthesisId}?api-version=2024-08-01 |
+| [List batch synthesis](#list-batch-synthesis)   | GET     | avatar/batchsyntheses/?api-version=2024-08-01 |
+| [Delete batch synthesis](#delete-batch-synthesis) | DELETE  | avatar/batchsyntheses/{SynthesisId}?api-version=2024-08-01 |
 
 You can refer to the code samples on [GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/batch-avatar).
 
@@ -67,7 +65,7 @@ curl -v -X PUT -H "Ocp-Apim-Subscription-Key: YourSpeechKey" -H "Content-Type: a
         "talkingAvatarCharacter": "lisa",
         "talkingAvatarStyle": "graceful-sitting"
     }
-}'  "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/my-job-01?api-version=2024-04-15-preview"
+}'  "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/my-job-01?api-version=2024-08-01"
 ```
 
 You should receive a response body in the following format:
@@ -106,7 +104,7 @@ To retrieve the status of a batch synthesis job, make an HTTP GET request using
 Replace `YourSynthesisId` with your batch synthesis ID, `YourSpeechKey` with your Speech resource key, and `YourSpeechRegion` with your Speech resource region.
 
 ```azurecli-interactive
-curl -v -X GET "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/YourSynthesisId?api-version=2024-04-15-preview" -H "Ocp-Apim-Subscription-Key: YourSpeechKey"
+curl -v -X GET "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/YourSynthesisId?api-version=2024-08-01" -H "Ocp-Apim-Subscription-Key: YourSpeechKey"
 ```
 
 You should receive a response body in the following format:
@@ -157,7 +155,7 @@ To list all batch synthesis jobs for your Speech resource, make an HTTP GET requ
 Replace `YourSpeechKey` with your Speech resource key and `YourSpeechRegion` with your Speech resource region. Optionally, you can set the `skip` and `top` (page size) query parameters in the URL. The default value for `skip` is 0, and the default value for `maxpagesize` is 100.
 
 ```azurecli-interactive
-curl -v -X GET "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses?skip=0&maxpagesize=2&api-version=2024-04-15-preview" -H "Ocp-Apim-Subscription-Key: YourSpeechKey"
+curl -v -X GET "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses?skip=0&maxpagesize=2&api-version=2024-08-01" -H "Ocp-Apim-Subscription-Key: YourSpeechKey"
 ```
 
 You receive a response body in the following format:
@@ -232,7 +230,7 @@ You receive a response body in the following format:
             }
         }
     ],
-    "nextLink": "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/?api-version=2024-04-15-preview&skip=2&maxpagesize=2"
+    "nextLink": "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/?api-version=2024-08-01&skip=2&maxpagesize=2"
 }
 ```
 
@@ -283,7 +281,7 @@ After you have retrieved the audio output results and no longer need the batch s
 To delete a batch synthesis job, make an HTTP DELETE request using the following URI format. Replace `YourSynthesisId` with your batch synthesis ID, `YourSpeechKey` with your Speech resource key, and `YourSpeechRegion` with your Speech resource region.
 
 ```azurecli-interactive
-curl -v -X DELETE "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/YourSynthesisId?api-version=2024-04-15-preview" -H "Ocp-Apim-Subscription-Key: YourSpeechKey"
+curl -v -X DELETE "https://YourSpeechRegion.api.cognitive.microsoft.com/avatar/batchsyntheses/YourSynthesisId?api-version=2024-08-01" -H "Ocp-Apim-Subscription-Key: YourSpeechKey"
 ```
 
 The response headers include `HTTP/1.1 204 No Content` if the delete request was successful.
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/custom-avatar-create.md b/articles/ai-services/speech-service/text-to-speech-avatar/custom-avatar-create.md
@@ -11,9 +11,7 @@ ms.author: eur
 author: eric-urban
 ---
 
-# How to create a custom text to speech avatar (preview)
-
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
+# How to create a custom text to speech avatar
 
 Getting started with a custom text to speech avatar is a straightforward process. All it takes are a few of video files. If you'd like to train a [custom neural voice](../custom-neural-voice.md) for the same actor, you can do so separately.
 
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/custom-avatar-record-video-samples.md b/articles/ai-services/speech-service/text-to-speech-avatar/custom-avatar-record-video-samples.md
@@ -11,9 +11,7 @@ ms.author: v-baolianzou
 keywords: how to record video samples for custom text to speech avatar
 ---
 
-# How to record video samples for custom text to speech avatar (preview)
-
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
+# How to record video samples for custom text to speech avatar
 
 This article provides instructions on preparing high-quality video samples for creating a custom text to speech avatar.
 
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/real-time-synthesis-avatar.md b/articles/ai-services/speech-service/text-to-speech-avatar/real-time-synthesis-avatar.md
@@ -1,5 +1,5 @@
 ---
-title: Real-time synthesis for text to speech avatar (preview) - Speech service
+title: Real-time synthesis for text to speech avatar - Speech service
 titleSuffix: Azure AI services
 description: Learn how to use text to speech avatar with real-time synthesis.
 manager: nitinme
@@ -11,11 +11,9 @@ ms.author: eur
 author: eric-urban
 ---
 
-# How to do real-time synthesis for text to speech avatar (preview)
+# How to do real-time synthesis for text to speech avatar
 
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
-
-In this how-to guide, you learn how to use text to speech avatar (preview) with real-time synthesis. The synthetic avatar video will be generated in almost real time after the system receives the text input.
+In this how-to guide, you learn how to use text to speech avatar with real-time synthesis. The synthetic avatar video will be generated in almost real time after the system receives the text input.
 
 ## Prerequisites
 
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/what-is-custom-text-to-speech-avatar.md b/articles/ai-services/speech-service/text-to-speech-avatar/what-is-custom-text-to-speech-avatar.md
@@ -11,9 +11,7 @@ ms.author: eur
 author: eric-urban
 ---
 
-# What is custom text to speech avatar? (preview)
-
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
+# What is custom text to speech avatar?
 
 Custom text to speech avatar allows you to create a customized, one-of-a-kind synthetic talking avatar for your application. With custom text to speech avatar, you can build a unique and natural-looking avatar for your product or brand by providing video recording data of your selected actors. If you also create a [custom neural voice](#custom-voice-and-custom-text-to-speech-avatar) for the same actor and use it as the avatar's voice, the avatar will be even more realistic.
 
@@ -41,7 +39,7 @@ Here's an overview of the steps to create a custom text to speech avatar:
 
 1. **Prepare training data:** Ensure that the video recording is in the right format. It's a good idea to shoot the video recording in a professional-quality video shooting studio to get a clean background image. The quality of the resulting avatar heavily depends on the recorded video used for training. Factors like speaking rate, body posture, facial expression, hand gestures, consistency in the actor's position, and lighting of the video recording are essential to create an engaging custom text to speech avatar.
 
-1. **Train the avatar model:** We'll start training the custom text to speech model after verifying the consent statement of the avatar talent. In the preview stage of this service, this step will be done manually by Microsoft. You'll be notified after the model is successfully trained.
+1. **Train the avatar model:** We'll start training the custom text to speech model after verifying the consent statement of the avatar talent. This step is currently manually done by Microsoft. You'll be notified after the model is successfully trained.
 
 1. **Deploy and use your avatar model in your APPs**
 
diff --git a/articles/ai-services/speech-service/text-to-speech-avatar/what-is-text-to-speech-avatar.md b/articles/ai-services/speech-service/text-to-speech-avatar/what-is-text-to-speech-avatar.md
@@ -12,9 +12,7 @@ author: eric-urban
 ms.custom: references_regions
 ---
 
-# Text to speech avatar overview (preview)
-
-[!INCLUDE [Text to speech avatar preview](../includes/text-to-speech-avatar-preview.md)]
+# Text to speech avatar overview
 
 Text to speech avatar converts text into a digital video of a photorealistic human (either a prebuilt avatar or a [custom text to speech avatar](#custom-text-to-speech-avatar)) speaking with a natural-sounding voice. The text to speech avatar video can be synthesized asynchronously or in real time. Developers can build applications integrated with text to speech avatar through an API, or use a content creation tool on Speech Studio to create video content without coding.
 
@@ -44,7 +42,7 @@ The voice in the synthetic video could be a prebuilt neural voice available on A
 
 Both batch synthesis and real-time synthesis resolution are 1920 x 1080, and the frames per second (FPS) are 25. Batch synthesis codec can be h264 or h265 if the format is mp4 and can set codec as vp9 if the format is `webm`; only `webm` can contain an alpha channel. Real-time synthesis codec is h264. Video bitrate can be configured for both batch synthesis and real-time synthesis in the request; the default value is 2000000; more detailed configurations can be found in the sample code.
 
-|                  | Batch synthesis  | Real-Time synthesis |
+|                  | Batch synthesis  | Real-time synthesis |
 |------------------|------------------|----------------------|
 | **Resolution**   | 1920 x 1080      | 1920 x 1080          |
 | **FPS**          | 25               | 25                   |