Skip to content

Commit 1ee419c

Browse files
authored
Merge pull request #191067 from sally-baolian/patch-2
Update how-to-custom-voice-prepare-data.md
2 parents 51b3274 + 8726c1f commit 1ee419c

File tree

2 files changed

+16
-7
lines changed

2 files changed

+16
-7
lines changed

articles/cognitive-services/Speech-Service/how-to-custom-voice-prepare-data.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -73,7 +73,7 @@ Follow these guidelines when preparing audio.
7373
| File format | RIFF (.wav), grouped into a .zip file |
7474
| File name | Numeric, with .wav extension. No duplicate file names allowed. |
7575
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
76-
| Sample format | PCM, 16-bit |
76+
| Sample format | PCM, at least 16-bit |
7777
| Audio length | Shorter than 15 seconds |
7878
| Archive format | .zip |
7979
| Maximum archive size | 2048 MB |
@@ -114,9 +114,10 @@ Follow these guidelines when preparing audio for segmentation.
114114

115115
| Property | Value |
116116
| -------- | ----- |
117-
| File format | RIFF (.wav) with a sampling rate of at least 16 khz-16-bit in PCM or .mp3 with a bit rate of at least 256 KBps, grouped into a .zip file |
117+
| File format | RIFF (.wav) or .mp3, grouped into a .zip file |
118118
| File name | ASCII and Unicode characters supported. No duplicate names allowed. |
119119
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
120+
| Sample format |RIFF(.wav): PCM, at least 16-bit<br>mp3: at least 256 KBps bit rate|
120121
| Audio length | Longer than 20 seconds |
121122
| Archive format | .zip |
122123
| Maximum archive size | 2048 MB |
@@ -153,9 +154,10 @@ Follow these guidelines when preparing audio.
153154
154155
| Property | Value |
155156
| -------- | ----- |
156-
| File format | RIFF (.wav) with a sampling rate of at least 16 khz-16-bit in PCM or .mp3 with a bit rate of at least 256 KBps, grouped into a .zip file |
157+
| File format | RIFF (.wav) or .mp3, grouped into a .zip file |
157158
| File name | ASCII and Unicode characters supported. No duplicate name allowed. |
158159
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
160+
| Sample format |RIFF(.wav): PCM, at least 16-bit<br>mp3: at least 256 KBps bit rate|
159161
| Audio length | No limit |
160162
| Archive format | .zip |
161163
| Maximum archive size | 2048 MB |

articles/cognitive-services/Speech-Service/speech-services-quotas-and-limits.md

Lines changed: 11 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -68,13 +68,13 @@ In the following tables, the parameters without the **Adjustable** row aren't ad
6868

6969
| Quota | Free (F0)<sup>3</sup> | Standard (S0) |
7070
|--|--|--|
71-
| *Max number of transactions per second (TPS) per Speech service resource* | | |
72-
| Real-time API. Prebuilt neural voices and custom neural voices. | 200<sup>4</sup> | 200<sup>4</sup> |
71+
| **Max number of transactions per second (TPS) per Speech service resource** | | |
72+
| Real-time API. Prebuilt neural voices and custom neural voices. | 20 per 60 seconds | 200<sup>4</sup> |
7373
| Adjustable | No<sup>4</sup> | Yes<sup>4</sup> |
74-
| *HTTP-specific quotas* | | |
74+
| **HTTP-specific quotas** | | |
7575
| Max audio length produced per request | 10 min | 10 min |
7676
| Max total number of distinct `<voice>` and `<audio>` tags in SSML | 50 | 50 |
77-
| *Websocket specific quotas* | | |
77+
| **Websocket specific quotas** | | |
7878
| Max audio length produced per turn | 10 min | 10 min |
7979
| Max total number of distinct `<voice>` and `<audio>` tags in SSML | 50 | 50 |
8080
| Max SSML message size per turn | 64 KB | 64 KB |
@@ -102,6 +102,13 @@ In the following tables, the parameters without the **Adjustable** row aren't ad
102102
| Default value | N/A | 10 |
103103
| Adjustable | N/A | Yes<sup>5</sup> |
104104

105+
#### Audio Content Creation tool
106+
107+
| Quota | Free (F0)| Standard (S0) |
108+
|--|--|--|
109+
| File size | 3,000 characters per file | 20,000 characters per file |
110+
| Export to audio library | 1 concurrent task | N/A |
111+
105112
<sup>3</sup> For the free (F0) pricing tier, see also the monthly allowances at the [pricing page](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/).<br/>
106113
<sup>4</sup> See [additional explanations](#detailed-description-quota-adjustment-and-best-practices) and [best practices](#general-best-practices-to-mitigate-throttling-during-autoscaling).<br/>
107114
<sup>5</sup> See [additional explanations](#detailed-description-quota-adjustment-and-best-practices), [best practices](#general-best-practices-to-mitigate-throttling-during-autoscaling), and [adjustment instructions](#text-to-speech-increase-concurrent-request-limit-for-custom-neural-voices).<br/>

0 commit comments

Comments
 (0)