Skip to content

Commit b420910

Browse files
authored
Merge pull request #201797 from sally-baolian/patch-31
Update long-audio-api.md
2 parents 8792797 + 18ba385 commit b420910

File tree

2 files changed

+9
-0
lines changed

2 files changed

+9
-0
lines changed

articles/cognitive-services/Speech-Service/long-audio-api.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,9 @@ When preparing your text file, make sure it:
3535
* For plain text, each paragraph is separated by hitting **Enter/Return**. See [plain text input example](https://github.com/Azure-Samples/Cognitive-Speech-TTS/blob/master/CustomVoice-API-Samples/Java/en-US.txt).
3636
* For SSML text, each SSML piece is considered a paragraph. Separate SSML pieces by different paragraphs. See [SSML text input example](https://github.com/Azure-Samples/Cognitive-Speech-TTS/blob/master/CustomVoice-API-Samples/Java/SSMLTextInputSample.txt).
3737

38+
> [!NOTE]
39+
> When using SSML text, be sure to use the [supported SSML elements](speech-synthesis-markup.md?tabs=csharp#supported-ssml-elements) except the `audio` and `mstts:backgroundaudio` elements. The `audio` and `mstts:backgroundaudio` elements are not supported by Long Audio API. The `audio` element will be ignored without any error message. The `mstts:backgroundaudio` element will cause the systhesis task failure. If your synthesis task fails, download the audio result (.zip file) and check the error report with suffix name "err.txt" within the zip file for details.
40+
3841
## Sample code
3942

4043
The rest of this page focuses on Python, but sample code for the Long Audio API is available on GitHub for the following programming languages:

articles/cognitive-services/Speech-Service/speech-synthesis-markup.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -771,6 +771,9 @@ Any audio included in the SSML document must meet these requirements:
771771
* The combined total time for all text and audio files in a single response can't exceed 600 seconds.
772772
* The audio must not contain any customer-specific or other sensitive information.
773773

774+
> [!NOTE]
775+
> The 'audio' element is not supported by the Long Audio API.
776+
774777
**Syntax**
775778

776779
```xml
@@ -807,6 +810,9 @@ If the background audio provided is shorter than the text-to-speech or the fade
807810

808811
Only one background audio file is allowed per SSML document. You can intersperse `audio` tags within the `voice` element to add more audio to your SSML document.
809812

813+
> [!NOTE]
814+
> The `mstts:backgroundaudio` element is not supported by the Long Audio API.
815+
810816
**Syntax**
811817

812818
```xml

0 commit comments

Comments
 (0)