You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/rest-text-to-speech.md
+48-31Lines changed: 48 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -272,37 +272,54 @@ If the HTTP status is `200 OK`, the body of the response contains an audio file
272
272
273
273
## Audio outputs
274
274
275
-
This is a list of supported audio formats that are sent in each request as the `X-Microsoft-OutputFormat` header. Each format incorporates a bit rate and encoding type. The Speech service supports 48-kHz, 24-kHz, 16-kHz, and 8-kHz audio outputs. Prebuilt neural voices are created from samples that use a 24-khz sample rate. All voices can upsample or downsample to other sample rates when synthesizing.
The supported streaming and non-streaming audio formats are sent in each request as the `X-Microsoft-OutputFormat` header. Each format incorporates a bit rate and encoding type. The Speech service supports 48-kHz, 24-kHz, 16-kHz, and 8-kHz audio outputs. Prebuilt neural voices are created from samples that use a 24-khz sample rate. All voices can upsample or downsample to other sample rates when synthesizing.
276
+
277
+
#### [Streaming](#tab/streaming)
278
+
279
+
```
280
+
amr-wb-16000hz
281
+
audio-16khz-16bit-32kbps-mono-opus
282
+
audio-16khz-32kbitrate-mono-mp3
283
+
audio-16khz-64kbitrate-mono-mp3
284
+
audio-16khz-128kbitrate-mono-mp3
285
+
audio-24khz-16bit-24kbps-mono-opus
286
+
audio-24khz-16bit-48kbps-mono-opus
287
+
audio-24khz-48kbitrate-mono-mp3
288
+
audio-24khz-96kbitrate-mono-mp3
289
+
audio-24khz-160kbitrate-mono-mp3
290
+
audio-48khz-96kbitrate-mono-mp3
291
+
audio-48khz-192kbitrate-mono-mp3
292
+
ogg-16khz-16bit-mono-opus
293
+
ogg-24khz-16bit-mono-opus
294
+
ogg-48khz-16bit-mono-opus
295
+
raw-8khz-8bit-mono-alaw
296
+
raw-8khz-8bit-mono-mulaw
297
+
raw-8khz-16bit-mono-pcm
298
+
raw-16khz-16bit-mono-pcm
299
+
raw-16khz-16bit-mono-truesilk
300
+
raw-22050hz-16bit-mono-pcm
301
+
raw-24khz-16bit-mono-pcm
302
+
raw-24khz-16bit-mono-truesilk
303
+
raw-44100hz-16bit-mono-pcm
304
+
raw-48khz-16bit-mono-pcm
305
+
webm-16khz-16bit-mono-opus
306
+
webm-24khz-16bit-24kbps-mono-opus
307
+
webm-24khz-16bit-mono-opus
308
+
```
309
+
310
+
#### [NonStreaming](#tab/nonstreaming)
311
+
312
+
```
313
+
riff-8khz-8bit-mono-alaw
314
+
riff-8khz-8bit-mono-mulaw
315
+
riff-8khz-16bit-mono-pcm
316
+
riff-22050hz-16bit-mono-pcm
317
+
riff-24khz-16bit-mono-pcm
318
+
riff-44100hz-16bit-mono-pcm
319
+
riff-48khz-16bit-mono-pcm
320
+
```
321
+
322
+
***
306
323
307
324
> [!NOTE]
308
325
> en-US-AriaNeural, en-US-JennyNeural and zh-CN-XiaoxiaoNeural are available in public preview in 48Khz output. Other voices support 24khz upsampled to 48khz output.
0 commit comments