Skip to content

Commit 14496bf

Browse files
committed
UCD 17 updates from KenW jan23
1 parent a4fac43 commit 14496bf

File tree

4 files changed

+31
-19
lines changed

4 files changed

+31
-19
lines changed

unicodetools/data/ucd/dev/NamesList.txt

Lines changed: 11 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,17 @@
11
; charset=UTF-8
22
@@@ The Unicode Standard 17.0.0
33
@@@+ NamesList-17.0.0.txt
4-
@+ Generation Date: 2024-11-14, 11:05:33 GMT
4+
@+ Generation Date: 2025-01-23, 16:40:26 GMT
55
Unicode 17.0.0 names list.
6-
Repertoire synched with UnicodeData-17.0.0d1.txt.
7-
Synch with 7th edition CD; post UTC181 annotation updates for 17.0.
6+
Repertoire synched with UnicodeData-17.0.0d2.txt.
7+
Synch with 7th edition CD; post UTC182 annotation updates for 17.0.
88
This file is semi-automatically derived from UnicodeData.txt and
99
a set of manually created annotations using a script to select
1010
or suppress information from the data file. The rules used
1111
for this process are aimed at readability for the human reader,
1212
at the expense of some details; therefore, this file should not
1313
be parsed for machine-readable information.
14-
@+ © 2024 Unicode®, Inc.
14+
@+ © 2025 Unicode®, Inc.
1515
For terms of use and license, see https://www.unicode.org/terms_of_use.html
1616
@@ 0000 C0 Controls and Basic Latin (Basic Latin) 007F
1717
@@+
@@ -13191,6 +13191,7 @@
1319113191
x (heavy single turned comma quotation mark ornament - 275B)
1319213192
~ 2018 FE00 non-fullwidth form
1319313193
~ 2018 FE01 right-justified fullwidth form
13194+
~ 2018 FE02 Sibe form
1319413195
2019 RIGHT SINGLE QUOTATION MARK
1319513196
= single comma quotation mark
1319613197
* this is the preferred character to use for apostrophe
@@ -13199,6 +13200,7 @@
1319913200
x (heavy single comma quotation mark ornament - 275C)
1320013201
~ 2019 FE00 non-fullwidth form
1320113202
~ 2019 FE01 left-justified fullwidth form
13203+
~ 2019 FE02 Sibe form
1320213204
201A SINGLE LOW-9 QUOTATION MARK
1320313205
= low single comma quotation mark
1320413206
* used as opening single quotation mark in some languages
@@ -13214,6 +13216,7 @@
1321413216
x (reversed double prime quotation mark - 301D)
1321513217
~ 201C FE00 non-fullwidth form
1321613218
~ 201C FE01 right-justified fullwidth form
13219+
~ 201C FE02 Sibe form
1321713220
201D RIGHT DOUBLE QUOTATION MARK
1321813221
= double comma quotation mark
1321913222
x (quotation mark - 0022)
@@ -13222,6 +13225,7 @@
1322213225
x (double prime quotation mark - 301E)
1322313226
~ 201D FE00 non-fullwidth form
1322413227
~ 201D FE01 left-justified fullwidth form
13228+
~ 201D FE02 Sibe form
1322513229
201E DOUBLE LOW-9 QUOTATION MARK
1322613230
= low double comma quotation mark
1322713231
* used as opening double quotation mark in some languages
@@ -28149,7 +28153,7 @@ FBC7 ARABIC LIGATURE RAHMATU ALLAAHI ALAYHIMAA
2814928153
FBC8 ARABIC LIGATURE RAHIMAHUM ALLAAHU TAAALAA
2815028154
FBC9 ARABIC LIGATURE RAHIMAHUMAA ALLAAH
2815128155
FBCA ARABIC LIGATURE RAHIMAHUMAA ALLAAHU TAAALAA
28152-
FBCB ARABIC LIGATURE RADI ALLAHU TAAALAA ANHUM
28156+
FBCB ARABIC LIGATURE RADI ALLAAHU TAAALAA ANHUM
2815328157
FBCC ARABIC LIGATURE HAFIZAHU ALLAAH
2815428158
FBCD ARABIC LIGATURE HAFIZAHU ALLAAHU TAAALAA
2815528159
FBCE ARABIC LIGATURE HAFIZAHUM ALLAAHU TAAALAA
@@ -33806,8 +33810,8 @@ FFFF <not a character>
3380633810
10ED2 ARABIC LIGATURE ALAYHIM AS-SALAATU WAS-SALAAM
3380733811
10ED3 ARABIC LIGATURE ALAYHIMAA AS-SALAATU WAS-SALAAM
3380833812
10ED4 ARABIC LIGATURE QADDASA ALLAAHU SIRRAH
33809-
10ED5 ARABIC LIGATURE QUDDISA SIRRAHUM
33810-
10ED6 ARABIC LIGATURE QUDDISA SIRRAHUMAA
33813+
10ED5 ARABIC LIGATURE QUDDISA SIRRUHUM
33814+
10ED6 ARABIC LIGATURE QUDDISA SIRRUHUMAA
3381133815
10ED7 ARABIC LIGATURE QUDDISAT ASRAARUHUM
3381233816
10ED8 ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH
3381333817
@ Tanween mark used in Old Sindhi

unicodetools/data/ucd/dev/StandardizedVariants.txt

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# StandardizedVariants-17.0.0.txt
2-
# Date: 2024-11-11, 22:24:00 GMT [KW]
3-
# © 2024 Unicode®, Inc.
2+
# Date: 2025-01-23, 00:00:00 GMT [KW]
3+
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
66
#
@@ -146,12 +146,16 @@ FF10 FE00; short diagonal stroke form; # FULLWIDTH DIGIT ZERO
146146

147147
2018 FE00; non-fullwidth form; # LEFT SINGLE QUOTATION MARK
148148
2018 FE01; right-justified fullwidth form; # LEFT SINGLE QUOTATION MARK
149+
2018 FE02; Sibe form; # LEFT SINGLE QUOTATION MARK
149150
2019 FE00; non-fullwidth form; # RIGHT SINGLE QUOTATION MARK
150151
2019 FE01; left-justified fullwidth form; # RIGHT SINGLE QUOTATION MARK
152+
2019 FE02; Sibe form; # RIGHT SINGLE QUOTATION MARK
151153
201C FE00; non-fullwidth form; # LEFT DOUBLE QUOTATION MARK
152154
201C FE01; right-justified fullwidth form; # LEFT DOUBLE QUOTATION MARK
155+
201C FE02; Sibe form; # LEFT DOUBLE QUOTATION MARK
153156
201D FE00; non-fullwidth form; # RIGHT DOUBLE QUOTATION MARK
154157
201D FE01; left-justified fullwidth form; # RIGHT DOUBLE QUOTATION MARK
158+
201D FE02; Sibe form; # RIGHT DOUBLE QUOTATION MARK
155159
3001 FE00; corner-justified form; # IDEOGRAPHIC COMMA
156160
3001 FE01; centered form; # IDEOGRAPHIC COMMA
157161
3002 FE00; corner-justified form; # IDEOGRAPHIC FULL STOP

unicodetools/data/ucd/dev/UnicodeData.txt

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -15972,7 +15972,7 @@ FBC7;ARABIC LIGATURE RAHMATU ALLAAHI ALAYHIMAA;So;0;ON;;;;;N;;;;;
1597215972
FBC8;ARABIC LIGATURE RAHIMAHUM ALLAAHU TAAALAA;So;0;ON;;;;;N;;;;;
1597315973
FBC9;ARABIC LIGATURE RAHIMAHUMAA ALLAAH;So;0;ON;;;;;N;;;;;
1597415974
FBCA;ARABIC LIGATURE RAHIMAHUMAA ALLAAHU TAAALAA;So;0;ON;;;;;N;;;;;
15975-
FBCB;ARABIC LIGATURE RADI ALLAHU TAAALAA ANHUM;So;0;ON;;;;;N;;;;;
15975+
FBCB;ARABIC LIGATURE RADI ALLAAHU TAAALAA ANHUM;So;0;ON;;;;;N;;;;;
1597615976
FBCC;ARABIC LIGATURE HAFIZAHU ALLAAH;So;0;ON;;;;;N;;;;;
1597715977
FBCD;ARABIC LIGATURE HAFIZAHU ALLAAHU TAAALAA;So;0;ON;;;;;N;;;;;
1597815978
FBCE;ARABIC LIGATURE HAFIZAHUM ALLAAHU TAAALAA;So;0;ON;;;;;N;;;;;
@@ -19642,8 +19642,8 @@ FFFD;REPLACEMENT CHARACTER;So;0;ON;;;;;N;;;;;
1964219642
10ED2;ARABIC LIGATURE ALAYHIM AS-SALAATU WAS-SALAAM;So;0;ON;;;;;N;;;;;
1964319643
10ED3;ARABIC LIGATURE ALAYHIMAA AS-SALAATU WAS-SALAAM;So;0;ON;;;;;N;;;;;
1964419644
10ED4;ARABIC LIGATURE QADDASA ALLAAHU SIRRAH;So;0;ON;;;;;N;;;;;
19645-
10ED5;ARABIC LIGATURE QUDDISA SIRRAHUM;So;0;ON;;;;;N;;;;;
19646-
10ED6;ARABIC LIGATURE QUDDISA SIRRAHUMAA;So;0;ON;;;;;N;;;;;
19645+
10ED5;ARABIC LIGATURE QUDDISA SIRRUHUM;So;0;ON;;;;;N;;;;;
19646+
10ED6;ARABIC LIGATURE QUDDISA SIRRUHUMAA;So;0;ON;;;;;N;;;;;
1964719647
10ED7;ARABIC LIGATURE QUDDISAT ASRAARUHUM;So;0;ON;;;;;N;;;;;
1964819648
10ED8;ARABIC LIGATURE NAWWARA ALLAAHU MARQADAH;So;0;ON;;;;;N;;;;;
1964919649
10EFA;ARABIC DOUBLE VERTICAL BAR BELOW;Mn;220;NSM;;;;;N;;;;;

unicodetools/data/ucd/dev/VerticalOrientation.txt

Lines changed: 11 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# VerticalOrientation-17.0.0.txt
2-
# Date: 2024-11-16, 02:53:48 GMT
3-
# © 2024 Unicode®, Inc.
2+
# Date: 2025-01-23, 17:11:05 GMT [KW]
3+
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
66
#
@@ -1295,7 +1295,11 @@
12951295
3190..3191 ; U # So [2] IDEOGRAPHIC ANNOTATION LINKING MARK..IDEOGRAPHIC ANNOTATION REVERSE MARK
12961296
3192..3195 ; U # No [4] IDEOGRAPHIC ANNOTATION ONE MARK..IDEOGRAPHIC ANNOTATION FOUR MARK
12971297
3196..319F ; U # So [10] IDEOGRAPHIC ANNOTATION TOP MARK..IDEOGRAPHIC ANNOTATION MAN MARK
1298-
31A0..31BF ; U # Lo [32] BOPOMOFO LETTER BU..BOPOMOFO LETTER AH
1298+
31A0..31B3 ; U # Lo [20] BOPOMOFO LETTER BU..BOPOMOFO LETTER INNN
1299+
31B4..31B7 ; Tu # Lo [4] BOPOMOFO FINAL LETTER P..BOPOMOFO FINAL LETTER H
1300+
31B8..31BA ; U # Lo [3] BOPOMOFO LETTER GH..BOPOMOFO LETTER ZY
1301+
31BB ; Tu # Lo BOPOMOFO FINAL LETTER G
1302+
31BC..31BF ; U # Lo [4] BOPOMOFO LETTER GW..BOPOMOFO LETTER AH
12991303
31C0..31E5 ; U # So [38] CJK STROKE T..CJK STROKE SZP
13001304
31E6..31EE ; U # Cn [9] <reserved-31E6>..<reserved-31EE>
13011305
31EF ; U # So IDEOGRAPHIC DESCRIPTION CHARACTER SUBTRACTION
@@ -2235,13 +2239,13 @@ FFFC..FFFD ; U # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHARA
22352239
1B100..1B122 ; U # Lo [35] HENTAIGANA LETTER RE-3..KATAKANA LETTER ARCHAIC WU
22362240
1B123..1B12F ; U # Cn [13] <reserved-1B123>..<reserved-1B12F>
22372241
1B130..1B131 ; U # Cn [2] <reserved-1B130>..<reserved-1B131>
2238-
1B132 ; U # Lo HIRAGANA LETTER SMALL KO
2242+
1B132 ; Tu # Lo HIRAGANA LETTER SMALL KO
22392243
1B133..1B14F ; U # Cn [29] <reserved-1B133>..<reserved-1B14F>
2240-
1B150..1B152 ; U # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
2244+
1B150..1B152 ; Tu # Lo [3] HIRAGANA LETTER SMALL WI..HIRAGANA LETTER SMALL WO
22412245
1B153..1B154 ; U # Cn [2] <reserved-1B153>..<reserved-1B154>
2242-
1B155 ; U # Lo KATAKANA LETTER SMALL KO
2246+
1B155 ; Tu # Lo KATAKANA LETTER SMALL KO
22432247
1B156..1B163 ; U # Cn [14] <reserved-1B156>..<reserved-1B163>
2244-
1B164..1B167 ; U # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
2248+
1B164..1B167 ; Tu # Lo [4] KATAKANA LETTER SMALL WI..KATAKANA LETTER SMALL N
22452249
1B168..1B16F ; U # Cn [8] <reserved-1B168>..<reserved-1B16F>
22462250
1B170..1B2FB ; U # Lo [396] NUSHU CHARACTER-1B170..NUSHU CHARACTER-1B2FB
22472251
1B2FC..1B2FF ; U # Cn [4] <reserved-1B2FC>..<reserved-1B2FF>

0 commit comments

Comments
 (0)