Skip to content

Commit 5c1cbad

Browse files
authored
Thin ن (#756)
* UnicodeData.txt line from L2/23-121 * lb=AL * Arabic * ArabicShaping.txt from the proposal * A new Joining_Group * Regenerate UCD * GenerateEnums * This is silly
1 parent c64da22 commit 5c1cbad

22 files changed

+74
-41
lines changed

unicodetools/data/ucd/dev/ArabicShaping.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -851,6 +851,7 @@ A873; PHAGS-PA CANDRABINDU; U; No_Joining_Group
851851
10EC2; DAL WITH VERTICAL 2 DOTS BELOW; R; DAL
852852
10EC3; TAH WITH VERTICAL 2 DOTS BELOW; D; TAH
853853
10EC4; KAF WITH VERTICAL 2 DOTS BELOW; D; KAF
854+
10EC6; THIN NOON; D; THIN NOON
854855

855856
# Sogdian Characters
856857

unicodetools/data/ucd/dev/DerivedAge.txt

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedAge-17.0.0.txt
2-
# Date: 2024-11-14, 14:54:38 GMT
2+
# Date: 2024-11-14, 15:12:52 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2081,10 +2081,10 @@ FBC3..FBD2 ; 17.0 # [16] ARABIC LIGATURE JALLA WA-ALAA..ARABIC LIGATURE ALAY
20812081
FD90..FD91 ; 17.0 # [2] ARABIC LIGATURE RAHMATU ALLAAHI ALAYH..ARABIC LIGATURE RAHMATU ALLAAHI ALAYHAA
20822082
FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIGATURE KARRAMA ALLAAHU WAJHAH
20832083
10940..1095C ; 17.0 # [29] SIDETIC LETTER N01..SIDETIC LETTER N29
2084-
10EC5 ; 17.0 # ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
2084+
10EC5..10EC6 ; 17.0 # [2] ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW..ARABIC LETTER THIN NOON
20852085
10ED1..10ED8 ; 17.0 # [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
20862086
10EFB ; 17.0 # ARABIC SMALL LOW NOON
20872087

2088-
# Total code points: 103
2088+
# Total code points: 104
20892089

20902090
# EOF

unicodetools/data/ucd/dev/DerivedCoreProperties.txt

Lines changed: 13 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedCoreProperties-17.0.0.txt
2-
# Date: 2024-11-14, 14:55:09 GMT
2+
# Date: 2024-11-14, 15:13:20 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1053,6 +1053,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
10531053
10EB0..10EB1 ; Alphabetic # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
10541054
10EC2..10EC4 ; Alphabetic # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
10551055
10EC5 ; Alphabetic # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
1056+
10EC6 ; Alphabetic # Lo ARABIC LETTER THIN NOON
10561057
10EFB..10EFC ; Alphabetic # Mn [2] ARABIC SMALL LOW NOON..ARABIC COMBINING ALEF OVERLAY
10571058
10F00..10F1C ; Alphabetic # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
10581059
10F27 ; Alphabetic # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
@@ -1441,7 +1442,7 @@ FFDA..FFDC ; Alphabetic # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANG
14411442
30000..3134A ; Alphabetic # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
14421443
31350..323AF ; Alphabetic # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
14431444

1444-
# Total code points: 142799
1445+
# Total code points: 142800
14451446

14461447
# ================================================
14471448

@@ -6742,6 +6743,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
67426743
10EB0..10EB1 ; ID_Start # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
67436744
10EC2..10EC4 ; ID_Start # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
67446745
10EC5 ; ID_Start # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
6746+
10EC6 ; ID_Start # Lo ARABIC LETTER THIN NOON
67456747
10F00..10F1C ; ID_Start # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
67466748
10F27 ; ID_Start # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
67476749
10F30..10F45 ; ID_Start # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
@@ -6975,7 +6977,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
69756977
30000..3134A ; ID_Start # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
69766978
31350..323AF ; ID_Start # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
69776979

6978-
# Total code points: 141308
6980+
# Total code points: 141309
69796981

69806982
# ================================================
69816983

@@ -7909,6 +7911,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
79097911
10EB0..10EB1 ; ID_Continue # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
79107912
10EC2..10EC4 ; ID_Continue # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
79117913
10EC5 ; ID_Continue # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
7914+
10EC6 ; ID_Continue # Lo ARABIC LETTER THIN NOON
79127915
10EFB..10EFF ; ID_Continue # Mn [5] ARABIC SMALL LOW NOON..ARABIC SMALL LOW WORD MADDA
79137916
10F00..10F1C ; ID_Continue # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
79147917
10F27 ; ID_Continue # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
@@ -8384,7 +8387,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
83848387
31350..323AF ; ID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
83858388
E0100..E01EF ; ID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
83868389

8387-
# Total code points: 144610
8390+
# Total code points: 144611
83888391

83898392
# ================================================
83908393

@@ -8929,6 +8932,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
89298932
10EB0..10EB1 ; XID_Start # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
89308933
10EC2..10EC4 ; XID_Start # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
89318934
10EC5 ; XID_Start # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
8935+
10EC6 ; XID_Start # Lo ARABIC LETTER THIN NOON
89328936
10F00..10F1C ; XID_Start # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
89338937
10F27 ; XID_Start # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
89348938
10F30..10F45 ; XID_Start # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
@@ -9162,7 +9166,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
91629166
30000..3134A ; XID_Start # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
91639167
31350..323AF ; XID_Start # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
91649168

9165-
# Total code points: 141285
9169+
# Total code points: 141286
91669170

91679171
# ================================================
91689172

@@ -10097,6 +10101,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
1009710101
10EB0..10EB1 ; XID_Continue # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
1009810102
10EC2..10EC4 ; XID_Continue # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
1009910103
10EC5 ; XID_Continue # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
10104+
10EC6 ; XID_Continue # Lo ARABIC LETTER THIN NOON
1010010105
10EFB..10EFF ; XID_Continue # Mn [5] ARABIC SMALL LOW NOON..ARABIC SMALL LOW WORD MADDA
1010110106
10F00..10F1C ; XID_Continue # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
1010210107
10F27 ; XID_Continue # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
@@ -10572,7 +10577,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
1057210577
31350..323AF ; XID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
1057310578
E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1057410579

10575-
# Total code points: 144591
10580+
# Total code points: 144592
1057610581

1057710582
# ================================================
1057810583

@@ -12347,6 +12352,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
1234712352
10EB0..10EB1 ; Grapheme_Base # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
1234812353
10EC2..10EC4 ; Grapheme_Base # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
1234912354
10EC5 ; Grapheme_Base # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
12355+
10EC6 ; Grapheme_Base # Lo ARABIC LETTER THIN NOON
1235012356
10ED1..10ED8 ; Grapheme_Base # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
1235112357
10F00..10F1C ; Grapheme_Base # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
1235212358
10F1D..10F26 ; Grapheme_Base # No [10] OLD SOGDIAN NUMBER ONE..OLD SOGDIAN FRACTION ONE HALF
@@ -12830,7 +12836,7 @@ FFFC..FFFD ; Grapheme_Base # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEME
1283012836
30000..3134A ; Grapheme_Base # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
1283112837
31350..323AF ; Grapheme_Base # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
1283212838

12833-
# Total code points: 152803
12839+
# Total code points: 152804
1283412840

1283512841
# ================================================
1283612842

unicodetools/data/ucd/dev/EastAsianWidth.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# EastAsianWidth-17.0.0.txt
2-
# Date: 2024-11-14, 14:55:18 GMT
2+
# Date: 2024-11-14, 15:13:27 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1966,6 +1966,7 @@ FFFD ; A # So REPLACEMENT CHARACTER
19661966
10EB0..10EB1 ; N # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
19671967
10EC2..10EC4 ; N # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
19681968
10EC5 ; N # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
1969+
10EC6 ; N # Lo ARABIC LETTER THIN NOON
19691970
10ED1..10ED8 ; N # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
19701971
10EFB..10EFF ; N # Mn [5] ARABIC SMALL LOW NOON..ARABIC SMALL LOW WORD MADDA
19711972
10F00..10F1C ; N # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL

unicodetools/data/ucd/dev/LineBreak.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# LineBreak-17.0.0.txt
2-
# Date: 2024-11-14, 14:55:19 GMT
2+
# Date: 2024-11-14, 15:13:28 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2823,6 +2823,7 @@ FFFD ; AI # So REPLACEMENT CHARACTER
28232823
10EB0..10EB1 ; AL # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
28242824
10EC2..10EC4 ; AL # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
28252825
10EC5 ; AL # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
2826+
10EC6 ; AL # Lo ARABIC LETTER THIN NOON
28262827
10ED1..10ED8 ; AL # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
28272828
10EFB..10EFF ; CM # Mn [5] ARABIC SMALL LOW NOON..ARABIC SMALL LOW WORD MADDA
28282829
10F00..10F1C ; AL # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL

unicodetools/data/ucd/dev/PropertyValueAliases.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# PropertyValueAliases-17.0.0.txt
2-
# Date: 2024-10-16, 17:27:56 GMT
2+
# Date: 2024-11-14, 15:13:47 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1090,6 +1090,7 @@ jg ; Taw ; Taw
10901090
jg ; Teh_Marbuta ; Teh_Marbuta
10911091
jg ; Teh_Marbuta_Goal ; Teh_Marbuta_Goal ; Hamza_On_Heh_Goal
10921092
jg ; Teth ; Teth
1093+
jg ; Thin_Noon ; Thin_Noon
10931094
jg ; Thin_Yeh ; Thin_Yeh
10941095
jg ; Vertical_Tail ; Vertical_Tail
10951096
jg ; Waw ; Waw

unicodetools/data/ucd/dev/Scripts.txt

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# Scripts-17.0.0.txt
2-
# Date: 2024-11-14, 14:56:03 GMT
2+
# Date: 2024-11-14, 15:14:07 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -889,6 +889,7 @@ FE76..FEFC ; Arabic # Lo [135] ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE LA
889889
10E60..10E7E ; Arabic # No [31] RUMI DIGIT ONE..RUMI FRACTION TWO THIRDS
890890
10EC2..10EC4 ; Arabic # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
891891
10EC5 ; Arabic # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
892+
10EC6 ; Arabic # Lo ARABIC LETTER THIN NOON
892893
10ED1..10ED8 ; Arabic # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
893894
10EFB..10EFF ; Arabic # Mn [5] ARABIC SMALL LOW NOON..ARABIC SMALL LOW WORD MADDA
894895
1EE00..1EE03 ; Arabic # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
@@ -926,7 +927,7 @@ FE76..FEFC ; Arabic # Lo [135] ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE LA
926927
1EEAB..1EEBB ; Arabic # Lo [17] ARABIC MATHEMATICAL DOUBLE-STRUCK LAM..ARABIC MATHEMATICAL DOUBLE-STRUCK GHAIN
927928
1EEF0..1EEF1 ; Arabic # Sm [2] ARABIC MATHEMATICAL OPERATOR MEEM WITH HAH WITH TATWEEL..ARABIC MATHEMATICAL OPERATOR HAH WITH DAL
928929

929-
# Total code points: 1409
930+
# Total code points: 1410
930931

931932
# ================================================
932933

unicodetools/data/ucd/dev/UnicodeData.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19635,6 +19635,7 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
1963519635
10EC3;ARABIC LETTER TAH WITH TWO DOTS VERTICALLY BELOW;Lo;0;AL;;;;;N;;;;;
1963619636
10EC4;ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW;Lo;0;AL;;;;;N;;;;;
1963719637
10EC5;ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW;Lm;0;AL;;;;;N;;;;;
19638+
10EC6;ARABIC LETTER THIN NOON;Lo;0;AL;;;;;N;;;;;
1963819639
10ED1;ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM;So;0;ON;;;;;N;;;;;
1963919640
10ED2;ARABIC LIGATURE ALAYHIM AS-SALAATU WAS-SALAAM;So;0;ON;;;;;N;;;;;
1964019641
10ED3;ARABIC LIGATURE ALAYHIMAA AS-SALAATU WAS-SALAAM;So;0;ON;;;;;N;;;;;

unicodetools/data/ucd/dev/VerticalOrientation.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# VerticalOrientation-17.0.0.txt
2-
# Date: 2024-11-14, 14:56:06 GMT
2+
# Date: 2024-11-14, 15:14:10 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1787,6 +1787,7 @@ FFFC..FFFD ; U # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARA
17871787
10EB0..10EB1 ; R # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
17881788
10EC2..10EC4 ; R # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
17891789
10EC5 ; R # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
1790+
10EC6 ; R # Lo ARABIC LETTER THIN NOON
17901791
10ED1..10ED8 ; R # So [8] ARABIC LIGATURE ALAYHAA AS-SALAATU WAS-SALAAM..ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
17911792
10EFB..10EFF ; R # Mn [5] ARABIC SMALL LOW NOON..ARABIC SMALL LOW WORD MADDA
17921793
10F00..10F1C ; R # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL

unicodetools/data/ucd/dev/auxiliary/SentenceBreakProperty.txt

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# SentenceBreakProperty-17.0.0.txt
2-
# Date: 2024-11-14, 14:56:04 GMT
2+
# Date: 2024-11-14, 15:14:08 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2397,6 +2397,7 @@ FFDA..FFDC ; OLetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
23972397
10EB0..10EB1 ; OLetter # Lo [2] YEZIDI LETTER LAM WITH DOT ABOVE..YEZIDI LETTER YOT WITH CIRCUMFLEX ABOVE
23982398
10EC2..10EC4 ; OLetter # Lo [3] ARABIC LETTER DAL WITH TWO DOTS VERTICALLY BELOW..ARABIC LETTER KAF WITH TWO DOTS VERTICALLY BELOW
23992399
10EC5 ; OLetter # Lm ARABIC SMALL YEH BARREE WITH TWO DOTS BELOW
2400+
10EC6 ; OLetter # Lo ARABIC LETTER THIN NOON
24002401
10F00..10F1C ; OLetter # Lo [29] OLD SOGDIAN LETTER ALEPH..OLD SOGDIAN LETTER FINAL TAW WITH VERTICAL TAIL
24012402
10F27 ; OLetter # Lo OLD SOGDIAN LIGATURE AYIN-DALETH
24022403
10F30..10F45 ; OLetter # Lo [22] SOGDIAN LETTER ALEPH..SOGDIAN INDEPENDENT SHIN
@@ -2593,7 +2594,7 @@ FFDA..FFDC ; OLetter # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
25932594
30000..3134A ; OLetter # Lo [4939] CJK UNIFIED IDEOGRAPH-30000..CJK UNIFIED IDEOGRAPH-3134A
25942595
31350..323AF ; OLetter # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
25952596

2596-
# Total code points: 136943
2597+
# Total code points: 136944
25972598

25982599
# ================================================
25992600

0 commit comments

Comments
 (0)