Skip to content
Change the repository type filter

All

    Repositories list

    • masader

      Public
      The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.
      JavaScript
      GNU General Public License v3.0
      3619411Updated Jan 30, 2026Jan 30, 2026
    • Python
      MIT License
      7521Updated Oct 13, 2025Oct 13, 2025
    • CIDAR

      Public
      Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.
      Jupyter Notebook
      Apache License 2.0
      84500Updated Apr 3, 2025Apr 3, 2025
    • Calliar

      Public
      A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.
      Jupyter Notebook
      MIT License
      2015420Updated Jun 24, 2024Jun 24, 2024
    • dar

      Public
      A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
      Python
      Apache License 2.0
      21110Updated Jun 23, 2024Jun 23, 2024
    • HTML
      2010Updated May 10, 2024May 10, 2024
    • .github

      Public
      1100Updated Apr 13, 2024Apr 13, 2024
    • CIDAR-v2

      Public
      Jupyter Notebook
      2630Updated Mar 30, 2024Mar 30, 2024
    • Python
      1110Updated Mar 3, 2024Mar 3, 2024
    • ARBML

      Public
      Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.
      JavaScript
      MIT License
      49422100Updated Mar 1, 2024Mar 1, 2024
    • Taqyim

      Public
      Python intefrace for evaluation on chatgpt models
      Jupyter Notebook
      MIT License
      41910Updated Feb 13, 2024Feb 13, 2024
    • evals

      Public
      Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
      Jupyter Notebook
      MIT License
      2.9k201Updated Feb 13, 2024Feb 13, 2024
    • nmatheg

      Public
      A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the…
      Jupyter Notebook
      52210Updated Jan 27, 2024Jan 27, 2024
    • tkseem

      Public
      Arabic Tokenization Library. It provides many tokenization algorithms.
      Jupyter Notebook
      MIT License
      2111043Updated Jan 4, 2024Jan 4, 2024
    • klaam

      Public
      Arabic speech recognition, classification and text-to-speech.
      Jupyter Notebook
      MIT License
      85424141Updated Sep 30, 2023Sep 30, 2023
    • tnkeeh

      Public
      Arabic cleaning, normalization and segmentation library.
      Python
      MIT License
      97420Updated Sep 28, 2023Sep 28, 2023
    • mat-bpe

      Public
      Jupyter Notebook
      0000Updated Aug 6, 2023Aug 6, 2023
    • Ashaar

      Public
      Arabic poetry analysis and generation.
      Jupyter Notebook
      42300Updated Jul 23, 2023Jul 23, 2023
    • Jupyter Notebook
      0200Updated Jun 4, 2023Jun 4, 2023
    • Python
      Apache License 2.0
      0300Updated Apr 3, 2023Apr 3, 2023
    • atmatah

      Public
      a repository containing scripts to automate processes, for instance configuring web-apps on remote machines
      Jinja
      MIT License
      0040Updated Jan 25, 2023Jan 25, 2023
    • qawafi

      Public
      Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.
      Jupyter Notebook
      MIT License
      103550Updated Jan 3, 2023Jan 3, 2023
    • 0000Updated Dec 31, 2022Dec 31, 2022
    • whisperar

      Public
      Python
      34010Updated Dec 25, 2022Dec 25, 2022
    • Bohour

      Public
      Bohour, a package that abstracts arabic poetry science, Aroud
      Python
      MIT License
      0220Updated Dec 2, 2022Dec 2, 2022
    • adawat

      Public
      Jupyter Notebook
      GNU General Public License v3.0
      0600Updated Nov 17, 2022Nov 17, 2022
    • rasm

      Public
      Arabic Art using GANs
      Python
      Other
      31700Updated Aug 3, 2022Aug 3, 2022
    • bayanat

      Public
      Explore the content of Arabic text datasets.
      Jupyter Notebook
      MIT License
      31920Updated May 23, 2022May 23, 2022
    • Research

      Public
      Support Arabic people working on research by creating an environment for ideas in NLP and speech.
      01130Updated Apr 25, 2021Apr 25, 2021
    • MetRec

      Public
      Arabic Poetry Metric Classification Using Bidirectional Gated Recurrent Neural Networks
      Jupyter Notebook
      MIT License
      21100Updated Jun 3, 2020Jun 3, 2020