Skip to content

Nexdata-AI/4-People-Cantonese-Average-Tone-Speech-Synthesis-Corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

4-People-Cantonese-Average-Tone-Speech-Synthesis-Corpus

Description

4 People - Cantonese Average Tone Speech Synthesis Corpus,recorded by native of Guangdong. The corpus contain educational, game and general colloquial content. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1568?source=Github

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

contains educational, game and general colloquial content;

Speaker

professional voice actor, two male and two female, 2 hours per person;

Annotation

word and phoneme transcription, prosodic boundary annotation;

Device

microphone;

Language

Cantonese;

Application scenarios

speech synthesis.

Licensing Information

Commercial License

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published