|
1 | 1 | ---
|
2 | 2 | title: Text-to-speech API reference (REST) - Speech service
|
3 | 3 | titleSuffix: Azure Cognitive Services
|
4 |
| -description: Learn how to use the REST API to convert text into synthesized speech. |
| 4 | +description: Learn how to use the REST API to convert text into synthesized speech. |
5 | 5 | services: cognitive-services
|
6 | 6 | author: eric-urban
|
7 | 7 | manager: nitinme
|
@@ -151,9 +151,9 @@ This response has been truncated to illustrate the structure of a response.
|
151 | 151 | ],
|
152 | 152 | "Status": "Preview"
|
153 | 153 | },
|
154 |
| - |
| 154 | + |
155 | 155 | ...
|
156 |
| - |
| 156 | + |
157 | 157 | {
|
158 | 158 | "Name": "Microsoft Server Speech Text to Speech Voice (ga-IE, OrlaNeural)",
|
159 | 159 | "DisplayName": "Orla",
|
@@ -274,33 +274,35 @@ If the HTTP status is `200 OK`, the body of the response contains an audio file
|
274 | 274 |
|
275 | 275 | This is a list of supported audio formats that are sent in each request as the `X-Microsoft-OutputFormat` header. Each format incorporates a bit rate and encoding type. The Speech service supports 48-kHz, 24-kHz, 16-kHz, and 8-kHz audio outputs. Prebuilt neural voices are created from samples that use a 24-khz sample rate. All voices can upsample or downsample to other sample rates when synthesizing.
|
276 | 276 |
|
277 |
| -|Streaming |Non-Streaming | |
278 |
| -|----------------------------------|-------------------------| |
279 |
| -|audio-16khz-16bit-32kbps-mono-opus|riff-8khz-8bit-mono-alaw | |
280 |
| -|audio-16khz-32kbitrate-mono-mp3 |riff-8khz-8bit-mono-mulaw| |
281 |
| -|audio-16khz-64kbitrate-mono-mp3 |riff-8khz-16bit-mono-pcm | |
282 |
| -|audio-16khz-128kbitrate-mono-mp3 |riff-24khz-16bit-mono-pcm| |
283 |
| -|audio-24khz-16bit-24kbps-mono-opus|riff-48khz-16bit-mono-pcm| |
284 |
| -|audio-24khz-16bit-48kbps-mono-opus| | |
285 |
| -|audio-24khz-48kbitrate-mono-mp3 | | |
286 |
| -|audio-24khz-96kbitrate-mono-mp3 | | |
287 |
| -|audio-24khz-160kbitrate-mono-mp3 | | |
288 |
| -|audio-48khz-96kbitrate-mono-mp3 | | |
289 |
| -|audio-48khz-192kbitrate-mono-mp3 | | |
290 |
| -|ogg-16khz-16bit-mono-opus | | |
291 |
| -|ogg-24khz-16bit-mono-opus | | |
292 |
| -|ogg-48khz-16bit-mono-opus | | |
293 |
| -|raw-8khz-8bit-mono-alaw | | |
294 |
| -|raw-8khz-8bit-mono-mulaw | | |
295 |
| -|raw-8khz-16bit-mono-pcm | | |
296 |
| -|raw-16khz-16bit-mono-pcm | | |
297 |
| -|raw-16khz-16bit-mono-truesilk | | |
298 |
| -|raw-24khz-16bit-mono-pcm | | |
299 |
| -|raw-24khz-16bit-mono-truesilk | | |
300 |
| -|raw-48khz-16bit-mono-pcm | | |
301 |
| -|webm-16khz-16bit-mono-opus | | |
302 |
| -|webm-24khz-16bit-24kbps-mono-opus | | |
303 |
| -|webm-24khz-16bit-mono-opus | | |
| 277 | +| Streaming | Non-Streaming | |
| 278 | +| ---------------------------------- | --------------------------- | |
| 279 | +| audio-16khz-16bit-32kbps-mono-opus | riff-8khz-8bit-mono-alaw | |
| 280 | +| audio-16khz-32kbitrate-mono-mp3 | riff-8khz-8bit-mono-mulaw | |
| 281 | +| audio-16khz-64kbitrate-mono-mp3 | riff-8khz-16bit-mono-pcm | |
| 282 | +| audio-16khz-128kbitrate-mono-mp3 | riff-22050hz-16bit-mono-pcm | |
| 283 | +| audio-24khz-16bit-24kbps-mono-opus | riff-24khz-16bit-mono-pcm | |
| 284 | +| audio-24khz-16bit-48kbps-mono-opus | riff-44100hz-16bit-mono-pcm | |
| 285 | +| audio-24khz-48kbitrate-mono-mp3 | riff-48khz-16bit-mono-pcm | |
| 286 | +| audio-24khz-96kbitrate-mono-mp3 | | |
| 287 | +| audio-24khz-160kbitrate-mono-mp3 | | |
| 288 | +| audio-48khz-96kbitrate-mono-mp3 | | |
| 289 | +| audio-48khz-192kbitrate-mono-mp3 | | |
| 290 | +| ogg-16khz-16bit-mono-opus | | |
| 291 | +| ogg-24khz-16bit-mono-opus | | |
| 292 | +| ogg-48khz-16bit-mono-opus | | |
| 293 | +| raw-8khz-8bit-mono-alaw | | |
| 294 | +| raw-8khz-8bit-mono-mulaw | | |
| 295 | +| raw-8khz-16bit-mono-pcm | | |
| 296 | +| raw-16khz-16bit-mono-pcm | | |
| 297 | +| raw-16khz-16bit-mono-truesilk | | |
| 298 | +| raw-22050hz-16bit-mono-pcm | | |
| 299 | +| raw-24khz-16bit-mono-pcm | | |
| 300 | +| raw-24khz-16bit-mono-truesilk | | |
| 301 | +| raw-44100hz-16bit-mono-pcm | | |
| 302 | +| raw-48khz-16bit-mono-pcm | | |
| 303 | +| webm-16khz-16bit-mono-opus | | |
| 304 | +| webm-24khz-16bit-24kbps-mono-opus | | |
| 305 | +| webm-24khz-16bit-mono-opus | | |
304 | 306 |
|
305 | 307 | > [!NOTE]
|
306 | 308 | > en-US-AriaNeural, en-US-JennyNeural and zh-CN-XiaoxiaoNeural are available in public preview in 48Khz output. Other voices support 24khz upsampled to 48khz output.
|
|
0 commit comments