You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/long-audio-api.md
+7-7Lines changed: 7 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ manager: nitinme
8
8
ms.service: cognitive-services
9
9
ms.subservice: speech-service
10
10
ms.topic: how-to
11
-
ms.date: 09/25/2022
11
+
ms.date: 11/11/2022
12
12
ms.author: eur
13
13
---
14
14
@@ -126,7 +126,7 @@ Prepare an input text file, in either plain text or SSML text, then add the foll
126
126
127
127
> [!NOTE]
128
128
> `concatenateResult` is an optional parameter. If this parameter isn't set, the audio outputs will be generated per paragraph. You can also concatenate the audios into one output by including the parameter.
129
-
> `outputFormat` is also optional. By default, the audio output is set to `riff-16khz-16bit-mono-pcm`. For more information about supported audio output formats, see [Audio output formats](#audio-output-formats).
129
+
> `outputFormat` is also optional. By default, the audio output is set to `riff-24khz-16bit-mono-pcm`. For more information about supported audio output formats, see [Audio output formats](#audio-output-formats).
130
130
131
131
```python
132
132
defsubmit_synthesis():
@@ -150,7 +150,7 @@ def submit_synthesis():
150
150
'description': 'sample description',
151
151
'locale': locale,
152
152
'voices': json.dumps(voice_identities),
153
-
'outputformat': 'riff-16khz-16bit-mono-pcm',
153
+
'outputformat': 'riff-24khz-16bit-mono-pcm',
154
154
'concatenateresult': True,
155
155
}
156
156
@@ -228,7 +228,7 @@ response.status_code: 200
228
228
}
229
229
],
230
230
"properties": {
231
-
"outputFormat": "riff-16khz-16bit-mono-pcm",
231
+
"outputFormat": "riff-24khz-16bit-mono-pcm",
232
232
"concatenateResult": false,
233
233
"totalDuration": "PT5M57.252S",
234
234
"billableCharacterCount": 3048
@@ -339,7 +339,7 @@ response.status_code: 200
339
339
}
340
340
],
341
341
"properties": {
342
-
"outputFormat": "riff-16khz-16bit-mono-pcm",
342
+
"outputFormat": "riff-24khz-16bit-mono-pcm",
343
343
"concatenateResult": false,
344
344
"totalDuration": "PT1S",
345
345
"billableCharacterCount": 5
@@ -359,7 +359,7 @@ response.status_code: 200
359
359
}
360
360
],
361
361
"properties": {
362
-
"outputFormat": "riff-16khz-16bit-mono-pcm",
362
+
"outputFormat": "riff-24khz-16bit-mono-pcm",
363
363
"concatenateResult": false,
364
364
"totalDuration": "PT5M57.252S",
365
365
"billableCharacterCount": 3048
@@ -456,7 +456,7 @@ The Long audio API is available in multiple regions with unique endpoints.
456
456
We support flexible audio output formats. You can generate audio outputs per paragraph or concatenate the audio outputs into a single output by setting the `concatenateResult` parameter. The following audio output formats are supported by the Long Audio API:
457
457
458
458
> [!NOTE]
459
-
> The default audio format is riff-16khz-16bit-mono-pcm.
459
+
> The default audio format is riff-24khz-16bit-mono-pcm.
460
460
>
461
461
> The sample rate for long audio voices is 24kHz, not 48kHz. Other sample rates can be obtained through upsampling or downsampling when synthesizing.
0 commit comments