4 People - Cantonese Average Tone Speech Synthesis Corpus,recorded by native of Guangdong. The corpus contain educational, game and general colloquial content. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1568?source=Github
48,000Hz, 24bit, uncompressed wav, mono channel;
professional recording studio;
contains educational, game and general colloquial content;
professional voice actor, two male and two female, 2 hours per person;
word and phoneme transcription, prosodic boundary annotation;
microphone;
Cantonese;
speech synthesis.
Commercial License