Skip to content

Nexdata-AI/268-Hours-Arabic-Saudi-Full-Duplex-Multi-Channel-Customer-Service-Speech-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

268-Hours-Arabic-Saudi-Full-Duplex-Multi-Channel-Customer-Service-Speech-Dataset

Description

Arabic(Saudi) Multi-stream Spontaneous Dialogue Smartphone speech dataset-Customer Service. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(268 native speakers), geographicly speaking, enhancing model performance in real and complex tasks.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1627?source=Github

Specifications

Format

16kHz, 16 bit, wav, mono channel;

Content category

Recorders in free conversation without a set topic;

Recording condition

Low background noise (indoor);

Recording device

Android smartphone, iPhone;

Speaker

268 native speakers in total, 41% male and 59% female;

Country

Kingdom of Saudi Arabia;

Language

Arabic;

Features of annotation

Transcription text, timestamp, speaker ID, gender;

Accuracy Rate

Word Accuracy Rate (WAR) 95%

Licensing Information

Commercial License

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published