Skip to content

Commit aaf81a5

Browse files
Merge pull request #209778 from yulin-li/yulin/format
[CogSvc] update new added audio formats
2 parents 73b6bc7 + ac1f0b4 commit aaf81a5

File tree

1 file changed

+32
-30
lines changed

1 file changed

+32
-30
lines changed

articles/cognitive-services/Speech-Service/rest-text-to-speech.md

Lines changed: 32 additions & 30 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Text-to-speech API reference (REST) - Speech service
33
titleSuffix: Azure Cognitive Services
4-
description: Learn how to use the REST API to convert text into synthesized speech.
4+
description: Learn how to use the REST API to convert text into synthesized speech.
55
services: cognitive-services
66
author: eric-urban
77
manager: nitinme
@@ -151,9 +151,9 @@ This response has been truncated to illustrate the structure of a response.
151151
],
152152
"Status": "Preview"
153153
},
154-
154+
155155
...
156-
156+
157157
{
158158
"Name": "Microsoft Server Speech Text to Speech Voice (ga-IE, OrlaNeural)",
159159
"DisplayName": "Orla",
@@ -274,33 +274,35 @@ If the HTTP status is `200 OK`, the body of the response contains an audio file
274274

275275
This is a list of supported audio formats that are sent in each request as the `X-Microsoft-OutputFormat` header. Each format incorporates a bit rate and encoding type. The Speech service supports 48-kHz, 24-kHz, 16-kHz, and 8-kHz audio outputs. Prebuilt neural voices are created from samples that use a 24-khz sample rate. All voices can upsample or downsample to other sample rates when synthesizing.
276276

277-
|Streaming |Non-Streaming |
278-
|----------------------------------|-------------------------|
279-
|audio-16khz-16bit-32kbps-mono-opus|riff-8khz-8bit-mono-alaw |
280-
|audio-16khz-32kbitrate-mono-mp3 |riff-8khz-8bit-mono-mulaw|
281-
|audio-16khz-64kbitrate-mono-mp3 |riff-8khz-16bit-mono-pcm |
282-
|audio-16khz-128kbitrate-mono-mp3 |riff-24khz-16bit-mono-pcm|
283-
|audio-24khz-16bit-24kbps-mono-opus|riff-48khz-16bit-mono-pcm|
284-
|audio-24khz-16bit-48kbps-mono-opus| |
285-
|audio-24khz-48kbitrate-mono-mp3 | |
286-
|audio-24khz-96kbitrate-mono-mp3 | |
287-
|audio-24khz-160kbitrate-mono-mp3 | |
288-
|audio-48khz-96kbitrate-mono-mp3 | |
289-
|audio-48khz-192kbitrate-mono-mp3 | |
290-
|ogg-16khz-16bit-mono-opus | |
291-
|ogg-24khz-16bit-mono-opus | |
292-
|ogg-48khz-16bit-mono-opus | |
293-
|raw-8khz-8bit-mono-alaw | |
294-
|raw-8khz-8bit-mono-mulaw | |
295-
|raw-8khz-16bit-mono-pcm | |
296-
|raw-16khz-16bit-mono-pcm | |
297-
|raw-16khz-16bit-mono-truesilk | |
298-
|raw-24khz-16bit-mono-pcm | |
299-
|raw-24khz-16bit-mono-truesilk | |
300-
|raw-48khz-16bit-mono-pcm | |
301-
|webm-16khz-16bit-mono-opus | |
302-
|webm-24khz-16bit-24kbps-mono-opus | |
303-
|webm-24khz-16bit-mono-opus | |
277+
| Streaming | Non-Streaming |
278+
| ---------------------------------- | --------------------------- |
279+
| audio-16khz-16bit-32kbps-mono-opus | riff-8khz-8bit-mono-alaw |
280+
| audio-16khz-32kbitrate-mono-mp3 | riff-8khz-8bit-mono-mulaw |
281+
| audio-16khz-64kbitrate-mono-mp3 | riff-8khz-16bit-mono-pcm |
282+
| audio-16khz-128kbitrate-mono-mp3 | riff-22050hz-16bit-mono-pcm |
283+
| audio-24khz-16bit-24kbps-mono-opus | riff-24khz-16bit-mono-pcm |
284+
| audio-24khz-16bit-48kbps-mono-opus | riff-44100hz-16bit-mono-pcm |
285+
| audio-24khz-48kbitrate-mono-mp3 | riff-48khz-16bit-mono-pcm |
286+
| audio-24khz-96kbitrate-mono-mp3 | |
287+
| audio-24khz-160kbitrate-mono-mp3 | |
288+
| audio-48khz-96kbitrate-mono-mp3 | |
289+
| audio-48khz-192kbitrate-mono-mp3 | |
290+
| ogg-16khz-16bit-mono-opus | |
291+
| ogg-24khz-16bit-mono-opus | |
292+
| ogg-48khz-16bit-mono-opus | |
293+
| raw-8khz-8bit-mono-alaw | |
294+
| raw-8khz-8bit-mono-mulaw | |
295+
| raw-8khz-16bit-mono-pcm | |
296+
| raw-16khz-16bit-mono-pcm | |
297+
| raw-16khz-16bit-mono-truesilk | |
298+
| raw-22050hz-16bit-mono-pcm | |
299+
| raw-24khz-16bit-mono-pcm | |
300+
| raw-24khz-16bit-mono-truesilk | |
301+
| raw-44100hz-16bit-mono-pcm | |
302+
| raw-48khz-16bit-mono-pcm | |
303+
| webm-16khz-16bit-mono-opus | |
304+
| webm-24khz-16bit-24kbps-mono-opus | |
305+
| webm-24khz-16bit-mono-opus | |
304306

305307
> [!NOTE]
306308
> en-US-AriaNeural, en-US-JennyNeural and zh-CN-XiaoxiaoNeural are available in public preview in 48Khz output. Other voices support 24khz upsampled to 48khz output.

0 commit comments

Comments
 (0)