You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/how-to-custom-voice-prepare-data.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -73,7 +73,7 @@ Follow these guidelines when preparing audio.
73
73
| File format | RIFF (.wav), grouped into a .zip file |
74
74
| File name | Numeric, with .wav extension. No duplicate file names allowed. |
75
75
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
76
-
| Sample format | PCM, 16-bit |
76
+
| Sample format | PCM, at least 16-bit |
77
77
| Audio length | Shorter than 15 seconds |
78
78
| Archive format | .zip |
79
79
| Maximum archive size | 2048 MB |
@@ -114,9 +114,10 @@ Follow these guidelines when preparing audio for segmentation.
114
114
115
115
| Property | Value |
116
116
| -------- | ----- |
117
-
| File format | RIFF (.wav) with a sampling rate of at least 16 khz-16-bit in PCM or .mp3 with a bit rate of at least 256 KBps, grouped into a .zip file |
117
+
| File format | RIFF (.wav) or .mp3, grouped into a .zip file |
118
118
| File name | ASCII and Unicode characters supported. No duplicate names allowed. |
119
119
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
120
+
| Sample format |RIFF(.wav): PCM, at least 16-bit<br>mp3: at least 256 KBps bit rate|
120
121
| Audio length | Longer than 20 seconds |
121
122
| Archive format | .zip |
122
123
| Maximum archive size | 2048 MB |
@@ -153,9 +154,10 @@ Follow these guidelines when preparing audio.
153
154
154
155
| Property | Value |
155
156
| -------- | ----- |
156
-
| File format | RIFF (.wav) with a sampling rate of at least 16 khz-16-bit in PCM or .mp3 with a bit rate of at least 256 KBps, grouped into a .zip file |
157
+
| File format | RIFF (.wav) or .mp3, grouped into a .zip file |
157
158
| File name | ASCII and Unicode characters supported. No duplicate name allowed. |
158
159
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
160
+
| Sample format |RIFF(.wav): PCM, at least 16-bit<br>mp3: at least 256 KBps bit rate|
<sup>3</sup> For the free (F0) pricing tier, see also the monthly allowances at the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/).<br/>
106
113
<sup>4</sup> See [additional explanations](#detailed-description-quota-adjustment-and-best-practices) and [best practices](#general-best-practices-to-mitigate-throttling-during-autoscaling).<br/>
107
114
<sup>5</sup> See [additional explanations](#detailed-description-quota-adjustment-and-best-practices), [best practices](#general-best-practices-to-mitigate-throttling-during-autoscaling), and [adjustment instructions](#text-to-speech-increase-concurrent-request-limit-for-custom-neural-voices).<br/>
0 commit comments