4-People-Cantonese-Average-Tone-Speech-Synthesis-Corpus

Description

4 People - Cantonese Average Tone Speech Synthesis Corpus，recorded by native of Guangdong. The corpus contain educational, game and general colloquial content. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1568?source=Github

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

contains educational, game and general colloquial content;

Speaker

professional voice actor, two male and two female, 2 hours per person;

Annotation

word and phoneme transcription, prosodic boundary annotation;

Device

microphone;

Language

Cantonese;

Application scenarios

speech synthesis.

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

4-People-Cantonese-Average-Tone-Speech-Synthesis-Corpus

Description

Specifications

Format

Recording environment

Recording content

Speaker

Annotation

Device

Language

Application scenarios

Licensing Information

About

Uh oh!

Releases

Packages

Nexdata-AI/4-People-Cantonese-Average-Tone-Speech-Synthesis-Corpus

Folders and files

Latest commit

History

Repository files navigation

4-People-Cantonese-Average-Tone-Speech-Synthesis-Corpus

Description

Specifications

Format

Recording environment

Recording content

Speaker

Annotation

Device

Language

Application scenarios

Licensing Information

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages