Skip to content

Commit 13ee1b2

Browse files
authored
Merge pull request #197339 from sally-baolian/patch-21
Update how-to-custom-voice-prepare-data.md
2 parents cc98caa + d157b35 commit 13ee1b2

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

articles/cognitive-services/Speech-Service/how-to-custom-voice-prepare-data.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -64,14 +64,14 @@ To produce a good voice model, create the recordings in a quiet room with a high
6464
6565
### Audio files
6666

67-
Each audio file should contain a single utterance (a single sentence or a single turn of a dialog system), less than 15 seconds long. All files must be in the same spoken language. Multi-language custom Text-to-Speech voices aren't supported, with the exception of the Chinese-English bi-lingual. Each audio file must have a unique numeric filename with the filename extension .wav.
67+
Each audio file should contain a single utterance (a single sentence or a single turn of a dialog system), less than 15 seconds long. All files must be in the same spoken language. Multi-language custom Text-to-Speech voices aren't supported, with the exception of the Chinese-English bi-lingual. Each audio file must have a unique filename with the filename extension .wav.
6868

6969
Follow these guidelines when preparing audio.
7070

7171
| Property | Value |
7272
| -------- | ----- |
7373
| File format | RIFF (.wav), grouped into a .zip file |
74-
| File name | Numeric, with .wav extension. No duplicate file names allowed. |
74+
| File name | File name characters supported by Windows OS, with .wav extension.<br>The characters \ / : * ? " < > \| aren't allowed. <br>It can't start or end with a space, and can't start with a dot. <br>No duplicate file names allowed. |
7575
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
7676
| Sample format | PCM, at least 16-bit |
7777
| Audio length | Shorter than 15 seconds |
@@ -115,7 +115,7 @@ Follow these guidelines when preparing audio for segmentation.
115115
| Property | Value |
116116
| -------- | ----- |
117117
| File format | RIFF (.wav) or .mp3, grouped into a .zip file |
118-
| File name | ASCII and Unicode characters supported. No duplicate names allowed. |
118+
| File name | File name characters supported by Windows OS, with .wav extension. <br>The characters \ / : * ? " < > \| aren't allowed. <br>It can't start or end with a space, and can't start with a dot. <br>No duplicate file names allowed. |
119119
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
120120
| Sample format |RIFF(.wav): PCM, at least 16-bit<br>mp3: at least 256 KBps bit rate|
121121
| Audio length | Longer than 20 seconds |
@@ -155,7 +155,7 @@ Follow these guidelines when preparing audio.
155155
| Property | Value |
156156
| -------- | ----- |
157157
| File format | RIFF (.wav) or .mp3, grouped into a .zip file |
158-
| File name | ASCII and Unicode characters supported. No duplicate name allowed. |
158+
| File name | File name characters supported by Windows OS, with .wav extension. <br>The characters \ / : * ? " < > \| aren't allowed. <br>It can't start or end with a space, and can't start with a dot. <br>No duplicate file names allowed. |
159159
| Sampling rate | For creating a custom neural voice, 24,000 Hz is required. |
160160
| Sample format |RIFF(.wav): PCM, at least 16-bit<br>mp3: at least 256 KBps bit rate|
161161
| Audio length | No limit |

0 commit comments

Comments
 (0)