3 People - Italian Average Tone Speech Synthesis Corpus. It is recorded by native Italian, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1368?source=Github
48,000Hz, 24bit, uncompressed wav, mono channel;
professional recording studio;
Italian, 1 male and 2 female, 4 hours per person
customer service and general styles, 2 hours/style/person;
word and phoneme transcription, four-level prosodic boundary annotation;
microphone;
Italian;
speech synthesis
Commercial License