Chinese Multi-emotional Modal particle and Natural Conversation Speech Synthesis Corpus, is recorded by multiple native Chinese voice actors. It not only includes sentences rich in modal particles that align with daily expression habits, but also encompasses free conversation data on given topics. In each conversation, the audio of each speaker is independently stored in their respective tracks. Professional phoneticians have annotated information such as text content, meeting the precise requirements for speech synthesis research and development to a full extent.
For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1833?source=Github
Modal particle: 48kHz, 24bit, wav, mono; Natural Conversation: 48kHz, 24bit, wav, stereo(each speaker's speech occupying his/her own sound track)
Recording studio
- Read texts containing modal particles in a natural way; 2. Have a natural conversation based on given topic
Transcription text
Microphone
100 professional voice actors
Chinese
Speech synthesis
Commercial License