Skip to content

Commit 4d1643d

Browse files
authored
Merge pull request #231326 from sally-baolian/patch-108
Update how-to-custom-voice-prepare-data.md
2 parents 735be74 + ee84742 commit 4d1643d

7 files changed

+11
-7
lines changed

articles/cognitive-services/Speech-Service/how-to-custom-voice-prepare-data.md

Lines changed: 11 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -44,17 +44,21 @@ To upload training data, follow these steps:
4444
4545
Data files are automatically validated when you select **Submit**. Data validation includes series of checks on the audio files to verify their file format, size, and sampling rate. If there are any errors, fix them and submit again.
4646

47-
After you upload the data, you can check the details in the training set detail view. On the **Overview** tab, you can further check the pronunciation scores and the noise level for each of your data. The pronunciation score ranges from 0-100. A score below 70 normally indicates a speech error or script mismatch. A heavy accent can reduce your pronunciation score and affect the generated digital voice.
47+
After you upload the data, you can check the details in the training set detail view. On the detail page, you can further check the pronunciation issue and the noise level for each of your data. The pronunciation score at the sentence level ranges from 0-100. A score below 70 normally indicates a speech error or script mismatch. Utterances with an overall score lower than 70 will be rejected. A heavy accent can reduce your pronunciation score and affect the generated digital voice.
4848

4949
## Resolve data issues online
5050

5151
After upload, you can check the data details of the training set. Before continuing to [train your voice model](how-to-custom-voice-create-voice.md), you should try to resolve any data issues.
5252

5353
You can resolve data issues per utterance in Speech Studio.
5454

55-
1. On the **Data details** page, select individual utterances you want to edit, then click **Edit**.
55+
1. On the detail page, go to the **Accepted data** or **Rejected data** page. Select individual utterances you want to change, then click **Edit**.
5656

57-
:::image type="content" source="media/custom-voice/cnv-edit-trainingset.png" alt-text="Screenshot of selecting edit button on the Data details page.":::
57+
:::image type="content" source="media/custom-voice/cnv-edit-trainingset.png" alt-text="Screenshot of selecting edit button on the accepted data or rejected data details page.":::
58+
59+
You can choose which data issues to be displayed based on your criteria.
60+
61+
:::image type="content" source="media/custom-voice/cnv-issues-display-criteria.png" alt-text="Screenshot of choosing which data issues to be displayed":::
5862

5963
1. Edit window will be displayed.
6064

@@ -70,7 +74,7 @@ You can resolve data issues per utterance in Speech Studio.
7074

7175
:::image type="content" source="media/custom-voice/cnv-edit-trainingset-upload-recording.png" alt-text="Screenshot that shows how to upload recording file on the Edit transcript and recording file window.":::
7276

73-
1. After the data in a training set are updated, you need to check the data quality by clicking **Analyze data** before using this training set for training.
77+
1. After you've made changes to your data, you need to check the data quality by clicking **Analyze data** before using this dataset for training.
7478

7579
You can't select this training set for training model before the analysis is complete.
7680

@@ -84,7 +88,7 @@ The issues are divided into three types. Refer to the following tables to check
8488

8589
**Auto-rejected**
8690

87-
Data with these errors won't be used for training. Imported data with errors will be ignored, so you don't need to delete them. You can resubmit the corrected data for training.
91+
Data with these errors won't be used for training. Imported data with errors will be ignored, so you don't need to delete them. You can [fix these data errors online](#resolve-data-issues-online) or upload the corrected data again for training.
8892

8993
| Category | Name | Description |
9094
| --------- | ----------- | --------------------------- |
@@ -109,15 +113,15 @@ The following errors are fixed automatically, but you should review and confirm
109113
| --------- | ----------- | --------------------------- |
110114
| Mismatch |Silence auto fixed |The start silence is detected to be shorter than 100 ms, and has been extended to 100 ms automatically. Download the normalized dataset and review it. |
111115
| Mismatch |Silence auto fixed | The end silence is detected to be shorter than 100 ms, and has been extended to 100 ms automatically. Download the normalized dataset and review it.|
116+
| Script | Text auto normalized|Text is automatically normalized for digits, symbols, and abbreviations. Review the script and audio to make sure they match.|
112117

113118
**Manual check required**
114119

115120
Unresolved errors listed in the next table affect the quality of training, but data with these errors won't be excluded during training. For higher-quality training, it's a good idea to fix these errors manually.
116121

117122
| Category | Name | Description |
118123
| --------- | ----------- | --------------------------- |
119-
| Script | Non-normalized text|This script contains digits. Expand them to normalized words, and match with the audio. For example, normalize *123* to *one hundred and twenty-three*.|
120-
| Script | Non-normalized text|This script contains symbols. Normalize the symbols to match the audio. For example, normalize *50%* to *fifty percent*.|
124+
| Script | Non-normalized text |This script contains symbols. Normalize the symbols to match the audio. For example, normalize */* to *slash*.|
121125
| Script | Not enough question utterances| At least 10 percent of the total utterances should be question sentences. This helps the voice model properly express a questioning tone.|
122126
| Script | Not enough exclamation utterances| At least 10 percent of the total utterances should be exclamation sentences. This helps the voice model properly express an excited tone.|
123127
| Script | No valid end punctuation| Add one of the following at the end of the line: full stop (half-width '.' or full-width '。'), exclamation point (half-width '!' or full-width '!' ), or question mark ( half-width '?' or full-width '?').|
-12.9 KB
Loading
-2.65 KB
Loading
229 Bytes
Loading
-755 Bytes
Loading
-11.5 KB
Loading
12.2 KB
Loading

0 commit comments

Comments
 (0)