This data has been collected,with Hussam Eldeen Hassan collaboration, for purpose of working on a project in AMMI program-Speech Recognition course, which is taught by Gabriel Synnaeve, Neil Zeghidour, Emmanuel Dupoux, Laurent Besacier and Morgane Rivera.
This data contains 2h5m of speech read from text novel (ba'd hadha alqoronfl: بعض هذا القرنفل) in official Arabic language. It collected using the lig-aikuma android app. It combined from 16 text files, where each text file contain about 50 medium-sentences. Each file has corresponding recordnig folder contain recording (.wav) for each sentence in that file (i.e 50 sound recordings).
Some recordings contain background noise, you may find some typo in text data due to missing of Arabic diacritics, you may found some recordings crushed due to some failuer in the lig-aikuma app.
THIS DATA IS NOT ALLOWED TO BE USED IN ANY OTHER PURPOSE WITHOUT PERMISSION OF THE OWNERS.