Spanish(Mexico) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1715?source=Github
16kHz, 16 bit, wav, mono channel;
including interview, self-meida,variety show, etc.
Low background noise;
Mexico(MEX),etc.;
es-MX,etc.
Spanish(Mexico), etc;
Transcription text, timestamp, speaker ID, gender, noise.
Word Accuracy Rate (WAR) 98% (Tags, gender, speakerID, accent, topic are other non-speech annotations are not included in accuracy statistics due to subjectivity)
Commercial License