2 People - Live-streaming Shopping Style Average Tone Speech Synthesis Corpus. It is recorded by Chinese native speaker. Corpus coverage includes 'Welcome', 'Product Introduction', 'Interaction' and other text categories related to livestream shopping, phonemes and tones are balanced. Professional phonetician participates in the annotation.
For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1399?source=Gihtub
48,000Hz, 24bit, uncompressed wav, mono channel
professional recording studio
live shopping text, and the syllables, phonemes and tones are balanced;
professional voice actor, one man and one woman, five hours per people
microphone
Mandarin
word and pinyin transcription, prosodic boundary annotation
speech synthesis
Commercial License