3-People-Italian-Average-Tone-Speech-Synthesis-Corpus

Description

3 People - Italian Average Tone Speech Synthesis Corpus. It is recorded by native Italian, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1368?source=Github

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Speaker

Italian, 1 male and 2 female, 4 hours per person

Style

customer service and general styles, 2 hours/style/person;

Annotation

word and phoneme transcription, four-level prosodic boundary annotation;

Device

microphone;

Language

Italian;

Application scenarios

speech synthesis

Licensing Information

Commercial License

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

3-People-Italian-Average-Tone-Speech-Synthesis-Corpus

Description

Specifications

Format

Recording environment

Speaker

Style

Annotation

Device

Language

Application scenarios

Licensing Information

About

Uh oh!

Releases

Packages

Nexdata-AI/3-People-Italian-Average-Tone-Speech-Synthesis-Corpus

Folders and files

Latest commit

History

Repository files navigation

3-People-Italian-Average-Tone-Speech-Synthesis-Corpus

Description

Specifications

Format

Recording environment

Speaker

Style

Annotation

Device

Language

Application scenarios

Licensing Information

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages