Skip to content

Latest commit

 

History

History
741 lines (688 loc) · 42.4 KB

File metadata and controls

741 lines (688 loc) · 42.4 KB

Arabic Romanization Master Reference

Version: 1.1.0
Last Updated: 2025-10-22

Overview

This document serves as the comprehensive source of truth for Arabic script romanization across all Adobe Arabic character set modules. Each table shows how Arabic script characters are romanized according to different international and academic standards.

✅ VERIFICATION STATUS: All romanization data has been verified against official source documents. Current validation accuracy: 100.0% (2421/2421 mappings across 52 standards, verified 2025-10-22).

Document Structure

This document organizes Arabic-script romanization standards by language family and geographic region:

  • Arabic Core Romanization (71 characters, 12 standards): Traditional Arabic script letters
  • Persian Romanization (50 letters, 8 standards): Persian script letters
  • Panjab Region Romanization (89 letters, 10 standards): Urdu and Punjabi script letters including aspirated digraphs
  • Ottoman Turkish Romanization (50 characters, 3 standards): Turkish/Ottoman Turkish script letters
  • Uyghur/Kazakh/Kyrgyz Romanization (AAR3R+AAR3P) (54 characters, 4 standards): Turkic languages using Arabic script
  • Kashmiri/Saraiki/Balti Romanization (AAR4R) (72 letters, 3 standards): Nastaliq script style
  • Arabic Extended Romanization (AAR5R) (157 letters, 13 standards): Pashto, Sindhi, Kurdish (Sorani), and Balochi script letters

Arabic Core Romanization (71 characters: 63 single + 8 combinations)

Traditional Arabic script letters and their romanization across international standards

Standards Used:

  • BGN/PCGN: Board on Geographic Names/Permanent Committee on Geographical Names Arabic 2019
  • UNGEGN: United Nations Group of Experts on Geographical Names Arabic 1972/2018
  • ALA-LC: American Library Association-Library of Congress Arabic romanization
  • DIN: Deutsches Institut für Normung 31635 Arabic romanization
  • ISO 233-2: ISO 233-2:1993 Simplified Arabic romanization
  • DMG: Deutsche Morgenländische Gesellschaft Arabic romanization
  • Wehr/Cowan: Hans Wehr Dictionary (English edition) romanization system
  • Brill: Brill Simple Arabic Transliteration System 1.0
  • EI3: Encyclopedia of Islam Third Edition Arabic transliteration
  • OSU: Ohio State University Egyptian Arabic IPA
  • UMich: University of Michigan Arabic IPA
  • Wikipedia: Wikipedia Modern Standard Arabic IPA

Note: For complete URL references and detailed documentation, see Arabic International Standards, Arabic Academic Systems, and Phonetic Representation Systems in arabic-roman-standards.md.

Unicode Character BGN/PCGN UNGEGN ALA-LC DIN ISO 233-2 DMG Wehr/Cowan Brill EI3 OSU UMich WP
0627 ا ā ā ā ā ʾ ā ā ā ā ā
0671 ٱ ʼ - a/i ʾ - - - - - - - -
0628 ب b b b b b b b b b b b b
066E ٮ - - - - - - - - - - - -
067E پ p - p p - p p - - - - -
062A ت t t t t t t t t t t t t
0629 ة ah/at - h/t/tan h/t h/t ah/at a/ah/āh/at/āt a/at - - a/ah
062B ث th th th ṯ/s̱ th th θ θ θ
062C ج j j j ǧ ǧ ǧ j ǧ j g/ʒ ɡ d͡ʒ/ʤ
0686 چ ch - ch/zh č - č - - - - - -
062D ح ẖ/ḩ ħ ħ/ʜ/ḥ ħ
062E خ kh kh kh kh kh x/ꭓ x x
062F د d d d d d d d d d d d d
0630 ذ dh dh dh ḏ/ẕ dh dh ð ð ð
0631 ر r r r r r r r r r ɾ/r r r
0632 ز z z z z z z z z z z z z
0698 ژ - - zh ž - ž zh - - - - -
0633 س s s s s s s s s s s s s
0634 ش sh sh sh š š š sh š sh ʃ ʃ/š ʃ
0635 ص ş s̱/ş ş/sˠ/sˤ/ṣ
0636 ض ḏ/ḑ ḍ/ż ď/dˠ/dˤ/ḍ
0637 ط ţ ṯ/ṭ ţ/t̃/tˠ/tˤ/ṭ
0638 ظ d͟h/z̧ đ/ð̃/ðˠ/ðˤ/ẓ ðˤ
0639 ع ʻ ʿ ʻ ʿ ʿ ʿ ʿ ʿ ʿ ʕ ʕ/ʢ/ʻ/ʿ ʕ
063A غ gh gh gh ġ ġ ġ gh ġ gh ɣ ɣ ɣ
0641 ف f f f f f f f f f f f f
06A1 ڡ - - - - - - - - - - - -
06A4 ڤ v - v v - - - - - - - -
06A5 ڥ - - v v - - - - - - - -
06CB ۋ - - v - - - - - - - - -
0642 ق q q q q q q q q q q q/ɢ q
066F ٯ - - - - - - - - - - - -
0643 ك - k k k k k k k k k k
06A9 ک k - - - - - - - - - - -
06AA ڪ - - - - - - - - - - - -
06AD ڭ - - - - - - - - - - -
06AF گ g - g g - g g - - - - -
0644 ل l l l l l l l l l l/ɫ l l
0645 م m m m m m m m m m m m m
0646 ن n n n n n n n n n n n n
0647 ه h h h h h h h h h h h h
0648 و w/ū w w w w w/v/ū w/ū w ū/uww w/o w/ū w
064A ي y/ay y y y y y/ī y/ī y ī/iyy j/e y/ī j
0649 ى á/ī - á ā ā ā ā ī/iyy - -
06A2 ڢ - - f f - - - - - - - -
06A7 ڧ - - q q - - - - - - - -
06A8 ڨ g - - - - - - - - - - -
06B4 ڴ* g - - - - - - - - - - -
0627+064E اَ - - ā - - - - - - - - -
0627+0648 او āw/aw - - - - - - - - - - -
0627+064A ای āy/ī - - - - - - - - - - -
0648+064F وُ - - ū - - - - - - - - -
0648+0652+064E وَْ - - aw - - - - - - - - -
064A+0650 يِ - - ī - - - - - - - - -
064A+0652+064E يَْ - - ay - - - - - - - - -
0649+0650 ىِ - - ī - - - - - - - - -
0649+064E ىَ - - á - - - - - - - - -
0649+0652+064E ىَْ - - ay - - - - - - - - -
064E َ a - a a a a a a - a/æ/ɑ a a
064F ُ u - u u u u u u - u/ʊ u u
0650 ِ i - i i i i i i - i/ɨ/ʉ/ɪ i i
0622 آ ʼā/ā ʾā ā ʾā ʾā/ā ʾā ʾā - - - - ʔaː
0623 أ ʼ ʾa a ʾ/a ʾa ʾa ʾa - - - - -
0625 إ ʼ ʾi i ʾ/i ʾi ʾi ʾi - - - - -
0624 ؤ ʼ - u - ʾu ʾu ʾu - - - - -
0626 ئ ʼ - i - ʾi ʾi ʾi - - - - -
0621 ء ʼ ʾ ʼ ʾ ˈ/ˌ ʾ ʾ ʾ ʾ ʔ ʔ/ʼ ʔ
064B ً - - - an an an - - - - - an
064C ٌ - - - un un un - - - - - un
064D ٍ - - - in in in - - - - - in

Note: * 06B4 (ڴ): The BGN/PCGN Arabic 2019 document contains a Unicode labeling error (Table 3, row 6) where it lists "06B4" but shows character ڭ (06AD). Character 06B4 (GAF WITH THREE DOTS ABOVE) does not appear in BGN/PCGN Arabic 2019, and is not part of any Arabic Core romanization standard. It is used in extended Arabic scripts (Sindhi) and appears in ALA-LC Sindhi standard instead.


Persian Romanization (50 letters)

Persian script letters and their romanization

Standards Used:

  • ISO 233-3: ISO 233-3:2023 Persian romanization standard
  • UNGEGN: United Nations Group of Experts on Geographical Names Persian 2012
  • BGN/PCGN: Board on Geographic Names/Permanent Committee on Geographical Names Persian 1958/2019
  • DIN: Deutsches Institut für Normung 31635 Persian romanization
  • ALA-LC: American Library Association-Library of Congress Persian romanization
  • EI3: Encyclopedia of Islam Third Edition Persian transliteration
  • DMG: Deutsche Morgenländische Gesellschaft Persian romanization
  • OSU: Ohio State University Persian IPA

Note: For complete URL references and detailed documentation, see Persian/Farsi Standards in arabic-roman-standards.md.

Note:

  • Characters marked with - are not covered by that particular standard
  • Persian-specific characters: پ چ ژ گ
  • Some characters represent different phonemes in Persian vs Arabic
Unicode Character ISO 233-3 UNGEGN BGN/PCGN DIN ALA-LC EI3 DMG OSU
0627 ا ā ā ā ā ā ā ā ɒ
0628 ب b b b b b b b b
067E پ p p p p p p p
062A ت t t t t t t t
0629 ة - h - - - at - -
062B ث s th s
062C ج j j j ǧ j j j d͡ʒ
0686 چ č č ch č ch ch č t͡ʃ
062D ح h h
062E خ x x kh kh kh x
062F د d d d d d d d d
0630 ذ dh z dh ḏ/ẕ -
0631 ر r r r r r r r ɾ/r/ɹ
0632 ز z z z z z z z -
0698 ژ ž ž zh ž zh zh ž ʒ
0633 س s s s s s s s -
0634 ش š š sh š sh sh š ʃ
0635 ص ş s ş -
0636 ض z -
0637 ط ţ t ţ -
0638 ظ z z
0639 ع ʻ ʻ ʻ ʿ ʿ ʿ ʿ -
063A غ ġ q gh ġ gh gh ġ -
0641 ف f f f f f f f f
0642 ق q q q q q q q ɢ
0643 ك - - - - - - k -
06A9 ک k k k k k k k
06AF گ g g g g g g g ɡ
0644 ل l l l l l l l l
0645 م m m m m m m m m
0646 ن n n n n n n n n
0647 ه h h h h h h h/e h/æ
0648 و v v v/ū/ow w/ū v/ū/aw v/ū v/ū/o v/u/oʊ
064A ي - - - y/ī y/ī/ay - y/ī -
0649 ى - - y/ī/ey y/ī - - - i/eɪ
06CC ی y y y y/ī - ī/iyyi y/ī j/i/eɪ
0674 ٴ - - ʼ/e/ye - - - - -
0674 ٴ - - ʼ/e/ye - - - - -
062E+0648 خو - - - - - khw - -
062E+0648 خو - - - - - khw - -
064E َ - a a - - - a æ
064F ُ - o o - - - u u
0650 ِ - e e - - - i i
0654 ٔ - - ʼ/e/ye - - - - -
0622 آ ʼā ā ā/ʼā - - ʾā ʾā ʔɒ
0623 أ ʼa - - - - - - -
0625 إ ʼi - - - - - - -
0624 ؤ - - - - - - - -
0626 ئ - - - - - - - -
0621 ء ʼ ʼ - ʾ - ʾ - ʔ

Panjab Region Romanization (Urdu/Punjabi) (89 letters)

Urdu and Punjabi script letters and their romanization

Standards Used:

  • ALA-LC: American Library Association-Library of Congress Urdu romanization
  • BGN/PCGN: United States Board on Geographic Names / Permanent Committee on Geographical Names for British Official Use Urdu romanization (2018)
  • UNGEGN Urdu: United Nations Group of Experts on Geographical Names Urdu romanization (1972)
  • DIN: Deutsche Institut für Normung Urdu romanization (DIN 31635)
  • DMG: Deutsche Morgenländische Gesellschaft Urdu romanization
  • EI3: Encyclopaedia of Islam, 3rd edition romanization for Urdu and Punjabi
  • UNGEGN Punjabi: United Nations Group of Experts on Geographical Names Punjabi romanization
  • CLE: Centre for Language Engineering Urdu IPA
  • Wikipedia Urdu: Wikipedia Urdu language IPA
  • Wikipedia Punjabi: Wikipedia Punjabi language IPA

Note: For complete URL references and detailed documentation, see Panjab Region Standards in arabic-roman-standards.md.

Unicode Character ALA-LC BGN/PCGN UNGEGN Urdu DIN DMG EI3 UNGEGN Punjabi CLE WP Urdu WP Punjabi
0621 ء - ʼ - ʾ ʾ ʾ - ə ʔ ʔ
0622 آ ʼā ā - - ʾā ʾā - ɑ ɑ
0623 أ - - - - ʾa - - - - -
0624 ؤ - - - - ʾu - - - - -
0625 إ - - - - ʾi - - - - -
0626 ئ - ʼ - - ʾi - - - - -
0627 ا ā ā ā ā ā ā ā ɑ ɑ
0628 ب b b b b b b b b b b
062A ت t t t t t t t
062B ث s th s s s
062C ج j j j j ǧ j j d͡ʒ/ʤ d͡ʒ/ʤ d͡ʒ/ʤ
062D ح h h ɦ ɦ
062E خ kh kh ḳh kh kh x x x
062F د d d d d d d d
0630 ذ z dh dh z z z
0631 ر r r r r r r r r r ɾ
0632 ز z z z z z z z z z z
0633 س s s s s s s s s s s
0634 ش sh sh sh š š sh sh ʃ ʃ ʃ
0635 ص ş s s s s
0636 ض z ż z z z
0637 ط ţ t
0638 ظ z z z z
0639 ع ʻ ʻ ʻ ʿ ʿ ʿ ʿ ʔ ʔ ʔ
063A غ gh gh gḥ ġ ġ gh gh ɣ ɣ ɣ
0641 ف f f f f f f f f f f
0642 ق q q q q q q q q q q
0643 ك - - - - k - - - - -
0644 ل l l l l l l l l l l
0645 م m m m m m m m m m m
0646 ن n n n n n/ṇ n n n n
0647 ه - h - h - h - - - -
0648 و v/ū/o/au w/o/ū v/ẉ v/ū/o w/ū/o/ō w v v/o ʋ
0649 ى - y y y/ī - - - - - -
064A ي - - - y/ī y/ī - - - - -
064E ◌َ a a a - a - - ə ə ə
064F ◌ُ u u u - u - - ʊ ʊ ʊ
0650 ◌ِ i i i - i - - ɪ/ɛ ɪ ɪ
0670 ◌ٰ - á - - - - - - - -
0679 ٹ ʈ ʈ ʈ
067E پ p p p p p p p p p p
067F ٿ - - - - - - - - -
0686 چ c ch ch c č ch ch t͡ʃ/ʧ t͡ʃ/ʧ t͡ʃ/ʧ
0688 ڈ ɖ ɖ ɖ
0691 ڑ ɽ ɽ ɽ
0698 ژ zh zh ž ž zh zh ʒ ʒ ʒ
06A9 ک k k k k k k k k k k
06AF گ g g g g g g g ɡ ɡ ɡ
06BA ں ñ n - ŋ ŋ
06BB ڻ - - - - - - - - ɳ -
06BE ھ h h h h h h h h h h
06C1 ہ h h h h h - h h ɦ ɦ
06CC ی y/ī/e/á/ai y - y/ī y/ī y y j/i j
06D2 ے e e e/ai e e/ē ē ē e e e
0768 ݨ - - - - - - - - - ɳ
076A ݪ - - - - - - - - - ɭ
0627+06BA اں - - - - - - - ɑ̃ - -
0628+06BE بھ bh bh - - - b͟h -
062A+06BE تھ th th - - - t͟h - t̪ʰ t̪ʰ t̪ʰ
062C+06BE جھ jh jh - - - j͟h - d͡ʒʰ/ʤʰ d͡ʒʱ/ʤʱ d͡ʒʱ/ʤʱ
062F+06BE دھ dh dh - - - d͟h - d̪ʰ d̪ʱ d̪ʱ
0631+06BE رھ - - - - - - - - -
0644+06BE لھ - - - - - l͟h - -
0645+06BE مھ - - - - - - - -
0646+06AF نگ - - - - - - - ŋ - -
0646+06BE نھ - - - - - n͟h - -
0648+064E ◌َو - - - - - - - ɔ - -
0648+064F ◌ُو - - - - - - - u - -
0648+06BA وں - - - - - - - õ - -
0648+06BE وھ - - - - - - - - -
064E+0648 ◌َو - au - - - - - - - -
064E+06D2 ◌َے - ai - - - - - - - -
064F+0648 ◌ُو - ū - - - - - - - -
0650+0649 ◌ِى - ī - - - - - - - -
0679+06BE ٹھ ṭh ṭh - - - - - ʈʰ ʈʰ ʈʰ
067E+06BE پھ ph ph - - - p͟h -
0686+06BE چھ ch chh - - - c͟h͟h - t͡ʃʰ/ʧʰ t͡ʃʰ/ʧʰ t͡ʃʰ/ʧʰ
0688+06BE ڈھ ḍh ḍh - - - ď͟h - ɖʰ ɖʱ ɖʱ
0691+06BE ڑھ ṛh rh - - - - - ɽʰ ɽʱ ɽʱ
06A9+06BE کھ kh kh - - - k͟h -
06AF+06BE گھ gh gh - - - g͟h - ɡʰ ɡʱ ɡʱ
06CC+06BA یں - - - - - - - - -
06CC+06BE یھ - - - - - - - -
06D2+064E ◌َے - - - - - - - æ - -
0648+064E+06BA وَں - - - - - - - ɔ̃ - -
0648+064F+06BA وُں - - - - - - - ũ - -
06CC+064E+06BA يَں - - - - - - - æ̃ - -
06CC+0650+06BA يِں - - - - - - - ĩ - -

Ottoman Turkish Romanization (50 characters)

Ottoman Turkish script letters and their romanization according to DMG Turkish, ALA-LC Ottoman, and EI3 Ottoman standards

Standards Used:

  • DMG Turkish: Deutsche Morgenländische Gesellschaft adaptation for Ottoman Turkish
  • ALA-LC Ottoman: American Library Association - Library of Congress Ottoman Turkish romanization
  • EI3 Ottoman: Encyclopedia of Islam 3 Ottoman Turkish transliteration guidelines

Note: For complete URL references and detailed documentation, see Ottoman Turkish Standards in arabic-roman-standards.md.

Note:

  • Characters marked with - represent simplified Turkish adaptations without specific DMG diacritics
  • Ottoman Turkish used Arabic script from 1299-1928
  • Turkish-specific adaptations: emphatic consonants simplified, vowel harmony applied
  • DMG Turkish: Verified against DMG_denkschrift.txt (Turkish/Ottoman section)
  • ALA-LC Ottoman: Verified against ALA-LC_Ottoman.pdf
Unicode Character DMG ALA-LC EI3
0627 ا â - ʾ/ā
0628 ب b b b
067E پ p p p
062A ت t t t
0629 ة t h at
062B ث s s th
062C ج ǧ c c
0686 چ č ç ç
062D ح h
062E خ kh
062F د d d d
0630 ذ z z dh
0631 ر r r r
0632 ز z z z
0698 ژ ž j zh
0633 س s s s
0634 ش š ş ş
0635 ص s
0636 ض d ż
0637 ط t/d
0638 ظ z
0639 ع ʿ ʻ ʿ
063A غ ġ ġ gh
0641 ف f f f
0642 ق q q
0643 ك k/g k k/g/ğ/ñ
06A9 ک k/g k k/g/ğ/ñ
06AD ڭ j̈/ŋ ñ ñ
06AF گ g g g
0644 ل l l l
0645 م m m m
0646 ن n n n
0647 ه h h h/e/at
0648 و v v v/ū/w
064A ي y - ī/iyye
0649 ى y y ī/iyye
06CC ی y y ī/iyye
064E َ a - -
064F ُ u/o - -
0650 ِ e/i - -
0622 آ ʾâ - ʾā
0623 أ ʾa ʼ ʾa
0625 إ ʾi - ʾi
0624 ؤ ʾu ʼ ʾu
0626 ئ ʾi ʼ ʾi
0621 ء ʾ ʼ ʾ
064B ً an - -
064C ٌ un - -
064D ٍ in - -

Notes:

  1. ك/ک (Kaf): Source document indicates "k, g, j̈, bzw. ŋ" (beziehungsweise = respectively/or). The romanization varies by context and phonetic environment in Turkish.

  2. ڭ (Nef/Sagir Nün): DMG Note 8 states: "Im Osttürkischen, wo das Sagir Nün durch Nün-Kaf wiedergegeben wird, da umschreibe man es auch durch ng." (In Eastern Turkish, where the Sagir Nün is represented by Nün-Kaf, it should be transliterated as "ng"). This is the 28th letter of the Ottoman alphabet, specifically representing the nasal ng sound (IPA: [ŋ]).


Uyghur/Kazakh/Kyrgyz Romanization (AAR3R+AAR3P) (54 characters)

Turkic language Arabic script romanization: Uyghur, Kazakh, and Kyrgyz

Standards Used:

  • BGN/PCGN: Board on Geographic Names/Permanent Committee on Geographical Names Uyghur 2024
  • Wikipedia Uyghur: Wikipedia Uyghur language IPA
  • Wikipedia Kazakh: Wikipedia Kazakh alphabets IPA correspondence
  • Wikipedia Kyrgyz: Wikipedia Kyrgyz alphabets IPA correspondence

Note: For complete URL references and detailed documentation, see Uyghur/Kazakh/Kyrgyz Standards in arabic-roman-standards.md.

Note:

  • Kazakh is transitioning from Cyrillic to Latin script through 2031. Historical Arabic script correspondence is documented for reference only.
  • Kyrgyz currently uses Cyrillic script officially. Historical Arabic script correspondence is documented for reference only.
Unicode Character BGN/PCGN WP Uyghur WP Kazakh WP Kyrgyz
0627 ا a ɑ ɑ ɑ
06D5 ە e ɛ e e
0675 ٵ - - æ -
0628 ب b b b b
067E پ p p p p
062A ت t t t t
062C ج j d͡ʒ/ʤ ʑ dʒ/ʤ
0686 چ ch t͡ʃ/ʧ tɕ/ʨ tʃ/ʧ
062D ح - -
062E خ x - -
062F د d d d d
0631 ر r r ɾ r
0632 ز z z z z
0698 ژ zh ʒ - -
0633 س s s s s
0634 ش sh ʃ ɕ ʃ
0639 ع - - ʁ ʁ
063A غ gh ʁ - -
0641 ف f f f f
06CB ۋ w v w -
0642 ق q q q q
0643 ك k k k k
06AD ڭ ng ŋ ɴ ŋ
06AF گ g ɡ ɡ g
0644 ل l l l l
0645 م m m m m
0646 ن n n n n
0647 ه - - - -
06BE ھ h h h -
0648 و o o o o
06C5 ۅ - - - ø
06C6 ۆ ö ø v v
06C7 ۇ u u ʊ u
06C8 ۈ ü y - -
06C9 ۉ - - - y
0676 ٶ - - ø -
0677 ٷ - - ʏ -
064A ي y j j j
0649 ى i i ə ɯ
06D0 ې ë e - -
0678 ٸ - - ɪ -
0674 ٴ - - - -
0626 ئ - ʔ - i
0626+0627 ئا a - - -
0626+06D0 ئې ë - - -
0626+06D5 ئە e - - -
0626+0649 ئى i - - -
0626+0648 ئو o - - -
0626+06C6 ئۆ ö - - -
0626+06C7 ئۇ u - - -
0626+06C8 ئۈ ü - - -
062A+0633 تس - - - ts/ʦ
0634+0686 شچ - - - ʃtʃ/ʃʧ

Kashmiri/Saraiki/Balti Romanization (AAR4R)

Kashmiri, Saraiki, and Balti script letters and their romanization (Nastaliq script style)

Standards Used:

  • ALA-LC: American Library Association-Library of Congress Kashmiri romanization
  • Wikipedia Kashmiri: Wikipedia Kashmiri language IPA
  • Wikipedia Saraiki: Wikipedia Saraiki alphabet IPA
  • Wikipedia Balti: Wikipedia Balti language IPA

Note: For complete URL references and detailed documentation, see Extended Arabic Script Standards in arabic-roman-standards.md.

Unicode Character ALA-LC WP Kashmiri WP Saraiki WP Balti WP Kashmiri IPA WP Balti IPA
0620 ؠ ʹ ʲ - - -
0622 آ ā - - - - -
0623 أ - - - - -
0625 إ - - - - -
0627 ا a/ā/i/u/o/ō/ọ/e/ē a ā/a/e/o ɑ/ə/e/o
0627+065F اٟ ūʼ - - - - -
0628 ب b b b b b b
062A ت t t t t t
062A+06BE تھ th th t̪ʰ - - -
062B ث s s s s s
062C ج j j d͡ʒ/ʤ d͡ʒ/ʤ j d͡ʒ/ʤ
062D ح h h ɦ h h
062E خ k͟h kh x/kʰ x x x
062F د d d d d d
0630 ذ z z z z z
0631 ر r r r r r ɾ
0632 ز z z z z z z
0633 س s s s s s s
0634 ش ś sh/š ʃ ʃ š ʃ
0635 ص s s s s s
0636 ض z z z z z
0637 ط t t t t
0638 ظ z z z z z
0639 ع ʻ ā/a/e/o ɑ/ə/e/o
063A غ g͟h g/ğ ɣ/ɡ ɣ ǧ ʁ/ɢ
0641 ف f f f/pʰ f f pʰ/f
0642 ق q q q/k q q q
0643 ك k k k - - -
0643+06BE كھ - kh - - -
0644 ل l l l l l l/ɭ/ɫ
0645 م m m m m m m
0646 ن n n n/◌̃ n n n
0647 ه h - - - - -
0648 و v/ū/o/ō v/w ʋ v w/u w/u
0649 ى ī/ẏ - - - - -
064A ي y - - - - -
064E َ a - - - - -
064F ُ u - - - - -
0650 ِ i - - - - -
0653 ٓ ā - - - - -
0654 ٔ - - - - -
0655 ٕ - - - - -
0672 ٲ ạ̄ - - - - -
0679 ٹ ʈ ʈ ʈ ʈ
0679+06BE ٹھ ṭh ṭh ʈʰ - - -
067B ٻ - - - ɓ - -
067E پ p p p p p p
067E+06BE پھ ph ph - - -
0683 ڃ - - - - ž ʒ
0684 ڄ - - - ʄ - -
0686 چ c ch/č t͡ʃ/ʧ t͡ʃ/ʧ č t͡ʃ/ʧ
0686+06BE چھ ch chh/čh t͡ʃʰ/ʧʰ - - -
0687 ڇ - - - - č̣ ʈ͡ʂ/ꭧ
0688 ڈ ɖ ɖ ɖ
0691 ڑ ɽ ɽ ɽ
0697 ڗ - - - - đ/dz d͡z/ʣ
0698 ژ ts ts t͡s/ʦ ʒ c/ts t͡s/ʦ
0698+06BE ژھ tsh tsh t͡sʰ/ʦʰ - - -
06A9 ک k - - k k k
06A9+0654 کٔ - - - - ǩ/ṡ ɕ
06A9+06BE کھ kha - - - - -
06AF گ g g ɡ g g ɡ
06B3 ڳ - - - ɠ - -
06BA ں - ñ ◌̃ ◌̃ - -
06BE ھ - - - ◌ʰ h ʰ/ʱ
06C1 ہ - h h ɦ h h
06C4 ۄ - - - - -
06C6 ۆ o - - - - -
06CC ی y/ī/ē y j j y/i j/i
06D2 ے y/ē y j e e/ay e
0759 ݙ - - - - -
075C ݜ - - - - ʂ ʂ
0768 ݨ - - - ɳ ŋ/ng ŋ
0769 ݩ - - - - ň/ny ɲ

Arabic Extended Romanization (AAR5R)

Pashto, Sindhi, Kurdish (Sorani), and Balochi script letters and their romanization (Naskh script style)

Standards Used:

  • ALA-LC Pashto: American Library Association-Library of Congress Pashto romanization
  • DIN Pashto: Deutsches Institut für Normung 31635 Pashto romanization
  • ALA-LC Sindhi: American Library Association-Library of Congress Sindhi romanization
  • SLA: Sindhi Language Authority Sindhi IPA
  • Wikipedia Sindhi: Wikipedia Sindhi alphabet IPA
  • DIN Kurdish: Deutsches Institut für Normung 31635 Kurdish Sorani romanization
  • ALA-LC Kurdish: American Library Association-Library of Congress Kurdish Sorani romanization
  • BGN/PCGN: Board on Geographic Names/Permanent Committee on Geographical Names Baluchi 2008
  • Wikipedia Pashto: Wikipedia Pashto phonology IPA
  • Wikipedia Kurdish: Wikipedia Kurdish IPA
  • Wikipedia Balochi: Wikipedia Balochi alphabets IPA
  • BAS: Balochi Academy Sarbaz standard romanization (2017)
  • BAS IPA: Balochi Academy Sarbaz IPA phonetic transcription

Note: For complete URL references and detailed documentation, see Extended Arabic Script Standards in arabic-roman-standards.md.

Unicode Character ALA-LC Pashto DIN Pashto WP Pashto ALA-LC Sindhi SLA WP Sindhi DIN Kurdish ALA-LC Kurdish WP Kurdish BGN/PCGN BAS BAS IPA WP Balochi
0621 ء ʼ ʾ ʔ ʔ - - - ʼ - - -
0621+064E ءَ - - - - - - - - - ā - - -
0621+0650 ءِ - - - - - - - - - ay - - -
0622 آ - - - - - - - - - ā à ɑ ɑ
0624 ؤ - - - - - - - - - - - - ɑuː
0626 ئ - ei - - - - - - - - ae ae/ɛ ɛ
0626+0647 ئه - - - - - - - e - - - - -
0626+0650 ئِ - - - - - - - i - - - - -
0626+06CC ئی - - - - - - - - - - ai ɑiː ɑiː
0626+06CE ئێ - - - - - - - ê - - - - -
0627 ا a/u/i/ā/ū/ī/o/e/aw/ay/ạ ā ɑ a ʔ/aː a a ɑː ā à ɑ a
0627+0624 اؤ - - - - - - - - - - au ɑuː -
0627+06CC ای - - - - - - - - - - i i -
0628 ب b b b b b b b b b b b b b
0628+06BE بھ - - - - - - - - - bh - - -
0629 ة h - - - - - - - - - - - -
062A ت t t t t t t t/tʼ t t t t
062A+06BE تھ - - - - - - - - - thʼ - - -
062B ث s ś s - - - t͟h ť -
062C ج j ǧ d͡ʒ/ʤ j ɟ d͡ʑ/ʥ c c d͡ʒ/ʤ j j d͡ʒ/ʤ d͡ʒ/ʤ
062C+0647 جه - - - jh ɟʰ - - - - - - - -
062C+0647+06C1 جهہ - - - - - d͡ʑʰ/ʥʰ - - - - - - -
062C+06BE جھ - - - - - - - - - jh - - -
062D ح h ħ h ḧ/hʼ ħ - - -
062D+06BE حھ - - - - - - - - - dhʼ - - -
062E خ kh x k͟h x x x x x kh - - -
062F د d d d d d d d d d d d
0630 ذ z ż z - - - d͟h ď ɗ -
0631 ر r r r r r r r r ɾ r r ɾ ɾ
0631+06BE رھ - - - - - - - - - ṛh - - -
0632 ز z z z z z z z z z z z z z
0633 س s s s s s s s s s s s s s
0634 ش sh š ʃ sh ʃ ʂ ş ș/ş ʃ sh š ʃ ʃ
0635 ص s ʂ s - - ş - - -
0636 ض z ʑ z - - - - -
0637 ط t ŧ t - - ţ - - -
0638 ظ z ʑ z - - - - - -
0639 ع ʻ ʿ ʔ ʻ ʔ ɑː/oː/eː/ʔ ʿ ʻ - ʻ - - -
0639+0647 عه - - - - - - - ʻe/eʼ - - - - -
063A غ gh ġ ɣ g͟h ɣ ɣ ɣ gh - - -
0641 ف f f f f f f f f f f - - f
0642 ق q q q q q q q q q q - - -
0643 ك k - - - - - - k - k - - -
0644 ل l l l l l l l l l l l l l
0644+0647+06C1 لهہ - - - - - - - - - - - -
0645 م m m m m m m m m m m m m m
0645+0647+06C1 مهہ - - - - - - - - - - - -
0646 ن n n n n n n/◌̃ n n n n n n n
0646+0647+06C1 نهہ - - - - - - - - - - - -
0646+065A نٚ - - - - - - - - ŋ - - - -
0647 ه h h h h ɦ h - e - h - - -
0648 و w/ū/o/aw/u w/ū/ō/au w v/o w ʋ/ʊ/oː/ɔː/uː u/w u/w w w/o w w w
0648+0648 وو - - - - - - û û - u u -
0649 ى y/á/ạy y/ī/ē/ai/ei - - - - î/y - y - - -
064A ي y y/ī/ē/ai/ei j y/e j - î/y î/y - - - - -
064B ً an - - - - - - - - - - - -
064E َ - - - a - - - - - a a ʌ a
064E+0627 َا - - - ā - - - - - - - - -
064E+0648+0652 َوْ - - - au - - - - - - - - -
064E+06CC َی - - - ā - - - - - - - - -
064E+06CC+0652 َیْ - - - ai - - - - - - - - -
064E+06D2 َے - - - ā - - - - - - - - -
064F ُ - - - u - - - - - u o o o
064F+0648 ُو - - - ū - - - - - - - - -
0650 ِ - - - i - - - i - i e e e
0650+06CC ِی - - - ī - - - - - - - - -
0651 ّ (doubled) - - - - - - - - - - - -
0670 ٰ - - - - - - - - - á - - -
0679 ٹ - - - - - - - - ť ʈ
0679+06BE ٹھ - - - - - - - - - ṭh - - -
067A ٺ - - - ṭh ʈh ʈʰ - - - - - - -
067B ٻ - - - ɓ ɓ - - - - - - -
067C ټ ʈ - - - - - - - - -
067D ٽ - - - ʈ ʈ - - - - - - -
067E پ p p p p p p p p/pʼ p p p p p
067E+06BE پھ - - - - - - - - - ph - - -
067F ٿ - - - th - - - t͟h - - -
0680 ڀ - - - bh - - - - - - -
0681 ځ ż ć d͡z/ʣ - - - - - - - - - -
0683 ڃ - - - ñ ɲ ɲ - - - - - - -
0684 ڄ - - - ʄ ʄ - - - - - - -
0685 څ c t͡s/ʦ - - - - - - - - - -
0686 چ ch č t͡ʃ/ʧ c c t͡ɕ/ʨ ç ç t͡ʃ/ʧ ch c t͡ʃ/ʧ t͡ʃ/ʧ
0686+06BE چھ - - - - - - - - - chh - - -
0687 ڇ - - - ch t͡ɕʰ/ʨʰ - - - - - - -
0688 ڈ - - - - - - - - ď ɗ ɖ
0688+06BE ڈھ - - - - - - - - - ḍh - - -
0689 ډ ɖ - - - - - - - - -
068A ڊ - - - ɖ - - - - - - -
068C ڌ - - - dh - - - - - - -
068D ڍ - - - ḍh ḍh ɖʱ - - - - - - -
068F ڏ - - - ɗ ɗ - - - - - - -
0691 ڑ - ŕ - - - - - - - - - -
0693 ړ ŕ ɽ - - - - - - - - -
0695 ڕ - - - - - - r - - - -
0696 ږ ẓh ǵ ʐ - - - - - - - - - -
0698 ژ zh ž ʒ - - ʒ j j ʒ zh ž ʒ ʒ
0699 ڙ - - - ɽ ɽ - - - - - - -
0699+0647+06C1 ڙهہ - - - - - ɽʰ - - - - - - -
069A ښ ṣh ʂ - - - - - - - - - -
06A4 ڤ - - - - - - v v v - - - -
06A6 ڦ - - - ph - - - - - - -
06A9 ک - k k kh k k/kʼ k k k k k
06A9+06BE کھ - - - - - - - - - khʼ - - -
06AA ڪ - - - k k k - - - - - - -
06AB ګ g g ɡ - - - - - - - - - -
06AD ڭ - - ŋ - - - - - - - - - -
06AF گ g g ɡ g g ɡ g g ɡ g g ɡ ɡ
06AF+0647 گه - - - gh - - - - - - - -
06AF+0647+06C1 گهہ - - - - - ɡʱ - - - - - - -
06AF+06BE گھ - - - - - - - - - ghʼ - - -
06B1 ڱ - - - ŋ ŋ - - - - - - -
06B3 ڳ - - - ɠ ɠ - - - - - - -
06B5 ڵ - - - - - - ł ɫ - - - -
06BA ں - - - - - - - - - ñ - - -
06BB ڻ - - - ɳ ɳ - - - - - - -
06BB+0647+06C1 ڻهہ - - - - - ɳʰ - - - - - - -
06BC ڼ ń ɳ - - - - - - - - - -
06BE ھ h h h - - h h h h h h h h
06C1 ہ - - - - - ə/əʰ e - - h h h h
06C6 ۆ - - - - - - o o - - - -
06CC ی y/ī/e/ay y/ī/ē/ai/ei j - - j/iː î/y î/y j - y j j
06CD ۍ ạy ei əi - - - - - - - - - -
06CE ێ - - - - - - ê ê - - - -
06CF ۏ - - - - - - - - - - ò ɯ
06D0 ې e ē e - - - - - - - - - -
06D2 ے e ē - e - - - - - e è ɪ
06D3 ۓ ạy ei - - - - - - - - - - -
06D5 ە - - - - - - - - ɛ - - - -
06FD ۽ - - - - - ãĩ̯ - - - - - - -
06FE ۾ - - - - - mẽ - - - - - - -
0754 ݔ - - - - - - - - - - è ɪ -

Standards Abbreviations

  • BGN/PCGN: Board on Geographic Names/Permanent Committee on Geographical Names
  • UNGEGN: United Nations Group of Experts on Geographical Names (shows 1972/2018 when versions differ)
  • ALA-LC: American Library Association-Library of Congress
  • DIN: Deutsches Institut für Normung
  • ISO: International Organization for Standardization (ISO 233:1984)
  • ISO 233-2: ISO 233-2:1993 Simplified Arabic romanization
  • Wehr: Hans Wehr system (Deutsche Morgenländische Gesellschaft)
  • Brill: Brill Simple Arabic transliteration
  • EI3: Encyclopedia of Islam Third Edition romanization system
  • Hunterian: Hunterian transliteration system (for Urdu)
  • Local Standard: Regional or national romanization standards

Notes

Data Sources

  • Arabic Core (AAR1R): Based on adobe-arabic-1-roman.txt
    • IPA values verified against Ohio State University Egyptian Arabic IPA Guide
  • Urdu/Farsi/Punjabi (AAR2R): Based on adobe-arabic-2-roman.txt with linguistic research
    • IPA values verified against Ohio State University Persian IPA Guide
  • Uyghur/Kazakh/Kyrgyz (AAR3R): Based on adobe-arabic-3-roman.txt with Turkic language standards
  • Extended (AAR5R): Based on adobe-arabic-5-roman.txt with specialized language research

Data Verification

  • Verified data: All romanization mappings verified against official source documents
  • UNGEGN with / syntax: Shows differences between 1972 and 2018 versions (format: 1972/2018)
  • Alternative romanizations: Slash notation (/) indicates multiple valid romanization options
  • Character codes and names: All verified from official Unicode specifications and Adobe module files

Character Selection

  • Only Arabic script letters are included (no digits, punctuation, or diacritics)
  • Focus on characters that have meaningful romanization equivalents
  • Shared characters between modules are noted in original module files

Romanization Standards

  • Each table uses standards most relevant to its language family
  • Arabic Core: International geographic and academic standards
  • AAR2R: Library and linguistic standards for Indo-Iranian languages
  • AAR3R: Geographic standards for Turkic languages
  • Extended: Specialized standards for individual languages (Pashto, Sindhi, etc.)