You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/rest-text-to-speech.md
+44-31Lines changed: 44 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -272,37 +272,50 @@ If the HTTP status is `200 OK`, the body of the response contains an audio file
272
272
273
273
## Audio outputs
274
274
275
-
This is a list of supported audio formats that are sent in each request as the `X-Microsoft-OutputFormat` header. Each format incorporates a bit rate and encoding type. The Speech service supports 48-kHz, 24-kHz, 16-kHz, and 8-kHz audio outputs. Prebuilt neural voices are created from samples that use a 24-khz sample rate. All voices can upsample or downsample to other sample rates when synthesizing.
The supported streaming and non-streaming audio formats are sent in each request as the `X-Microsoft-OutputFormat` header. Each format incorporates a bit rate and encoding type. The Speech service supports 48-kHz, 24-kHz, 16-kHz, and 8-kHz audio outputs. Prebuilt neural voices are created from samples that use a 24-khz sample rate. All voices can upsample or downsample to other sample rates when synthesizing.
276
+
277
+
#### [Streaming](#tab/streaming)
278
+
279
+
amr-wb-16000hz
280
+
audio-16khz-16bit-32kbps-mono-opus
281
+
audio-16khz-32kbitrate-mono-mp3
282
+
audio-16khz-64kbitrate-mono-mp3
283
+
audio-16khz-128kbitrate-mono-mp3
284
+
audio-24khz-16bit-24kbps-mono-opus
285
+
audio-24khz-16bit-48kbps-mono-opus
286
+
audio-24khz-48kbitrate-mono-mp3
287
+
audio-24khz-96kbitrate-mono-mp3
288
+
audio-24khz-160kbitrate-mono-mp3
289
+
audio-48khz-96kbitrate-mono-mp3
290
+
audio-48khz-192kbitrate-mono-mp3
291
+
ogg-16khz-16bit-mono-opus
292
+
ogg-24khz-16bit-mono-opus
293
+
ogg-48khz-16bit-mono-opus
294
+
raw-8khz-8bit-mono-alaw
295
+
raw-8khz-8bit-mono-mulaw
296
+
raw-8khz-16bit-mono-pcm
297
+
raw-16khz-16bit-mono-pcm
298
+
raw-16khz-16bit-mono-truesilk
299
+
raw-22050hz-16bit-mono-pcm
300
+
raw-24khz-16bit-mono-pcm
301
+
raw-24khz-16bit-mono-truesilk
302
+
raw-44100hz-16bit-mono-pcm
303
+
raw-48khz-16bit-mono-pcm
304
+
webm-16khz-16bit-mono-opus
305
+
webm-24khz-16bit-24kbps-mono-opus |
306
+
webm-24khz-16bit-mono-opus
307
+
308
+
#### [NonStreaming](#tab/nonstreaming)
309
+
310
+
riff-8khz-8bit-mono-alaw
311
+
riff-8khz-8bit-mono-mulaw
312
+
riff-8khz-16bit-mono-pcm
313
+
riff-22050hz-16bit-mono-pcm
314
+
riff-24khz-16bit-mono-pcm
315
+
riff-44100hz-16bit-mono-pcm
316
+
riff-48khz-16bit-mono-pcm
317
+
318
+
***
306
319
307
320
> [!NOTE]
308
321
> en-US-AriaNeural, en-US-JennyNeural and zh-CN-XiaoxiaoNeural are available in public preview in 48Khz output. Other voices support 24khz upsampled to 48khz output.
0 commit comments