Skip to content

Commit f2ce388

Browse files
Merge pull request #226952 from ArcherAZ/patch-55
Update how-to-custom-speech-human-labeled-transcriptions.md
2 parents 08ea112 + eff6512 commit f2ce388

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

articles/cognitive-services/Speech-Service/how-to-custom-speech-human-labeled-transcriptions.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,11 +18,11 @@ Human-labeled transcriptions are word-by-word transcriptions of an audio file. Y
1818

1919
A large sample of transcription data is required to improve recognition. We suggest providing between 1 and 20 hours of audio data. The Speech service will use up to 20 hours of audio for training. This guide is broken up by locale, with sections for US English, Mandarin Chinese, and German.
2020

21-
The transcriptions for all WAV files are contained in a single plain-text file. Each line of the transcription file contains the name of one of the audio files, followed by the corresponding transcription. The file name and transcription are separated by a tab (`\t`).
21+
The transcriptions for all WAV files are contained in a single plain-text file (.txt or .tsv). Each line of the transcription file contains the name of one of the audio files, followed by the corresponding transcription. The file name and transcription are separated by a tab (`\t`).
2222

2323
For example:
2424

25-
```tsv
25+
```txt
2626
speech01.wav speech recognition is awesome
2727
speech02.wav the quick brown fox jumped all over the place
2828
speech03.wav the lazy dog was not amused

0 commit comments

Comments
 (0)