Skip to content

Nexdata-AI/2-People-Live-streaming-Shopping-Style-Average-Tone-Speech-Synthesis-Corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

2-People-Live-streaming-Shopping-Style-Average-Tone-Speech-Synthesis-Corpus

Description

2 People - Live-streaming Shopping Style Average Tone Speech Synthesis Corpus. It is recorded by Chinese native speaker. Corpus coverage includes 'Welcome', 'Product Introduction', 'Interaction' and other text categories related to livestream shopping, phonemes and tones are balanced. Professional phonetician participates in the annotation.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1399?source=Gihtub

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel

Recording environment

professional recording studio

Recording content

live shopping text, and the syllables, phonemes and tones are balanced;

Speaker

professional voice actor, one man and one woman, five hours per people

Device

microphone

Language

Mandarin

Annotation

word and pinyin transcription, prosodic boundary annotation

Application scenarios

speech synthesis

Licensing Information

Commercial License

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published