Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 12 additions & 2 deletions unicodetools/data/ucd/dev/DerivedAge.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedAge-17.0.0.txt
# Date: 2025-07-30, 23:54:38 GMT
# DerivedAge-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -2116,4 +2116,14 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG

# Total code points: 4803

# ================================================

# Age=V18_0

# Newly assigned in Unicode 18.0.0 (September, 2026)

20C3 ; 18.0 # UAE DIRHAM SIGN
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

DerivedAge (like the Derived files generally) is never hand-edited, only regenerated as described in https://github.com/unicode-org/unicodetools/blob/main/docs/pipeline.md#regenerate-ucd.

This means that it will falsely say that this file is targeted for 17.0 (because we don’t have an 18.0 yet). That is OK, it will get regenerated to 18 once we have an 18 and the merge/regenerate commands get run on that PR.

The CI is supposed to put up a warning saying that this does not actually target 17 (otherwise some people in the public get false hopes looking at the diffs), but it does not know about 18, only about provisional assignment. I will fix the CI.


# Total code points: 1

# EOF
7 changes: 4 additions & 3 deletions unicodetools/data/ucd/dev/DerivedCoreProperties.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedCoreProperties-17.0.0.txt
# Date: 2025-07-30, 23:55:08 GMT
# DerivedCoreProperties-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -11787,6 +11787,7 @@ E0100..E01EF ; Grapheme_Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELE
208E ; Grapheme_Base # Pe SUBSCRIPT RIGHT PARENTHESIS
2090..209C ; Grapheme_Base # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
20A0..20C1 ; Grapheme_Base # Sc [34] EURO-CURRENCY SIGN..SAUDI RIYAL SIGN
20C3 ; Grapheme_Base # Sc UAE DIRHAM SIGN
2100..2101 ; Grapheme_Base # So [2] ACCOUNT OF..ADDRESSED TO THE SUBJECT
2102 ; Grapheme_Base # L& DOUBLE-STRUCK CAPITAL C
2103..2106 ; Grapheme_Base # So [4] DEGREE CELSIUS..CADA UNA
Expand Down Expand Up @@ -13596,6 +13597,6 @@ FF9E..FF9F ; InCB; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HA
E0020..E007F ; InCB; Extend # Cf [96] TAG SPACE..CANCEL TAG
E0100..E01EF ; InCB; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256

# Total code points: 2217
# Total code points: 2218

# EOF
4 changes: 2 additions & 2 deletions unicodetools/data/ucd/dev/DerivedNormalizationProps.txt
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please revert files that only have time stamp changes

Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedNormalizationProps-17.0.0.txt
# Date: 2025-01-27, 18:09:14 GMT
# DerivedNormalizationProps-18.0.0.txt
# Date: 2025-08-05, 21:48:38 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down
5 changes: 3 additions & 2 deletions unicodetools/data/ucd/dev/EastAsianWidth.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# EastAsianWidth-17.0.0.txt
# Date: 2025-07-24, 00:12:54 GMT
# EastAsianWidth-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -978,6 +978,7 @@
20AA..20AB ; N # Sc [2] NEW SHEQEL SIGN..DONG SIGN
20AC ; A # Sc EURO SIGN
20AD..20C1 ; N # Sc [21] KIP SIGN..SAUDI RIYAL SIGN
20C3 ; N # Sc UAE DIRHAM SIGN
20D0..20DC ; N # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
20DD..20E0 ; N # Me [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
20E1 ; N # Mn COMBINING LEFT RIGHT ARROW ABOVE
Expand Down
1 change: 1 addition & 0 deletions unicodetools/data/ucd/dev/Index.txt
Original file line number Diff line number Diff line change
Expand Up @@ -1603,6 +1603,7 @@ direct sum 2295
Directional Format Characters 202A
DIRECTIONAL FORMATTING, POP 202C
DIRECTIONAL ISOLATE, POP 2069
DIRHAM SIGN, UAE 20C3
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Ken-Whistler usually adds things into the Index late in a release. Ken, do you want this here in the pending-for-18 pull request?

DISCONTINUOUS UNDERLINE SYMBOL 2382
discretionary hyphen 00AD
disjunction 2228
Expand Down
8 changes: 5 additions & 3 deletions unicodetools/data/ucd/dev/LineBreak.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# LineBreak-17.0.0.txt
# Date: 2025-07-29, 13:52:18 GMT
# LineBreak-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -971,7 +971,9 @@
20BF ; PR # Sc BITCOIN SIGN
20C0 ; PO # Sc SOM SIGN
20C1 ; PR # Sc SAUDI RIYAL SIGN
20C2..20CF ; PR # Cn [14] <reserved-20C2>..<reserved-20CF>
20C2 ; PR # Cn <reserved-20C2>
20C3 ; AL # Sc UAE DIRHAM SIGN
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm, I think I had written something about PO vs. PR in discussion of the Riyal, let me see if I can find that…

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah yes, in https://github.com/unicode-org/sah/issues/619, published in L2/25-087:

In particular, Line_Break=PR, not PO. Note that the difference between PR and PO is nowadays only relevant in an East Asian context; see Section 3 of document L2/05-292, adopted by decision 105-C37, has useful background here. The point of picking lb=PR or lb=PO is to allow a break, as in お白湯は÷¥500頂戴しております。 or 上記税込価格にサービス料として10%÷を頂戴いたします. Since the currency sign is meant to be prefix in LTR text, it would likely be prefix in CJK text as currency signs usually are (with exceptions such as ¢), so that we would expect お白湯は÷SAR sign13頂戴しております。 and the character should be lb=PR like ¥.

So the question is whether it would be prefix in a CJK context.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comparing Saudi (PR) and UAE websites, it looks like PR is the appropriate value.

20C4..20CF ; PR # Cn [12] <reserved-20C4>..<reserved-20CF>
20D0..20DC ; CM # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
20DD..20E0 ; CM # Me [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
20E1 ; CM # Mn COMBINING LEFT RIGHT ARROW ABOVE
Expand Down
2 changes: 2 additions & 0 deletions unicodetools/data/ucd/dev/NamesList.txt
Original file line number Diff line number Diff line change
Expand Up @@ -13711,6 +13711,8 @@
* Kyrgyzstan
20C1 SAUDI RIYAL SIGN
* Saudi Arabia
20C3 UAE DIRHAM SIGN
* United Arab Emirates
Comment on lines +13714 to +13715
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ken generates the NamesList from other sources, and likes to add change comments at the top. I assume that he would prefer not to add this in this kind of PR.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, NamesList should not be touched by these PRs.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree with Robin. Don't touch NamesList.txt directly. (And don't deal with Index.txt, either -- updating that is outside the context of establishing the properties.)

@@ 20D0 Combining Diacritical Marks for Symbols 20FF
@ Combining diacritical marks for symbols
20D0 COMBINING LEFT HARPOON ABOVE
Expand Down
7 changes: 4 additions & 3 deletions unicodetools/data/ucd/dev/Scripts.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# Scripts-17.0.0.txt
# Date: 2025-07-24, 13:28:55 GMT
# Scripts-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -155,6 +155,7 @@
208D ; Common # Ps SUBSCRIPT LEFT PARENTHESIS
208E ; Common # Pe SUBSCRIPT RIGHT PARENTHESIS
20A0..20C1 ; Common # Sc [34] EURO-CURRENCY SIGN..SAUDI RIYAL SIGN
20C3 ; Common # Sc UAE DIRHAM SIGN
2100..2101 ; Common # So [2] ACCOUNT OF..ADDRESSED TO THE SUBJECT
2102 ; Common # L& DOUBLE-STRUCK CAPITAL C
2103..2106 ; Common # So [4] DEGREE CELSIUS..CADA UNA
Expand Down Expand Up @@ -638,7 +639,7 @@ FFFC..FFFD ; Common # So [2] OBJECT REPLACEMENT CHARACTER..REPLACEMENT CHAR
E0001 ; Common # Cf LANGUAGE TAG
E0020..E007F ; Common # Cf [96] TAG SPACE..CANCEL TAG

# Total code points: 9123
# Total code points: 9124

# ================================================

Expand Down
1 change: 1 addition & 0 deletions unicodetools/data/ucd/dev/UnicodeData.txt
Original file line number Diff line number Diff line change
Expand Up @@ -7576,6 +7576,7 @@
20BF;BITCOIN SIGN;Sc;0;ET;;;;;N;;;;;
20C0;SOM SIGN;Sc;0;ET;;;;;N;;;;;
20C1;SAUDI RIYAL SIGN;Sc;0;ET;;;;;N;;;;;
20C3;UAE DIRHAM SIGN;Sc;0;ET;;;;;N;;;;;
20D0;COMBINING LEFT HARPOON ABOVE;Mn;230;NSM;;;;;N;NON-SPACING LEFT HARPOON ABOVE;;;;
20D1;COMBINING RIGHT HARPOON ABOVE;Mn;230;NSM;;;;;N;NON-SPACING RIGHT HARPOON ABOVE;;;;
20D2;COMBINING LONG VERTICAL LINE OVERLAY;Mn;1;NSM;;;;;N;NON-SPACING LONG VERTICAL BAR OVERLAY;;;;
Expand Down
5 changes: 3 additions & 2 deletions unicodetools/data/ucd/dev/VerticalOrientation.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# VerticalOrientation-17.0.0.txt
# Date: 2025-07-24, 00:13:33 GMT
# VerticalOrientation-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -904,6 +904,7 @@
208E ; R # Pe SUBSCRIPT RIGHT PARENTHESIS
2090..209C ; R # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
20A0..20C1 ; R # Sc [34] EURO-CURRENCY SIGN..SAUDI RIYAL SIGN
20C3 ; R # Sc UAE DIRHAM SIGN
20D0..20DC ; R # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
20DD..20E0 ; U # Me [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
20E1 ; R # Mn COMBINING LEFT RIGHT ARROW ABOVE
Expand Down
7 changes: 4 additions & 3 deletions unicodetools/data/ucd/dev/extracted/DerivedBidiClass.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedBidiClass-17.0.0.txt
# Date: 2025-07-24, 00:12:44 GMT
# DerivedBidiClass-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -1405,6 +1405,7 @@ FF0D ; ES # Pd FULLWIDTH HYPHEN-MINUS
17DB ; ET # Sc KHMER CURRENCY SYMBOL RIEL
2030..2034 ; ET # Po [5] PER MILLE SIGN..TRIPLE PRIME
20A0..20C1 ; ET # Sc [34] EURO-CURRENCY SIGN..SAUDI RIYAL SIGN
20C3 ; ET # Sc UAE DIRHAM SIGN
212E ; ET # So ESTIMATED SYMBOL
2213 ; ET # Sm MINUS-OR-PLUS SIGN
A838 ; ET # Sc NORTH INDIC RUPEE MARK
Expand All @@ -1421,7 +1422,7 @@ FFE5..FFE6 ; ET # Sc [2] FULLWIDTH YEN SIGN..FULLWIDTH WON SIGN
1E2FF ; ET # Sc WANCHO NGUN SIGN

# The above property value applies to 14 code points not listed here.
# Total code points: 92
# Total code points: 93

# ================================================

Expand Down
9 changes: 5 additions & 4 deletions unicodetools/data/ucd/dev/extracted/DerivedCombiningClass.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedCombiningClass-17.0.0.txt
# Date: 2025-07-24, 00:12:46 GMT
# DerivedCombiningClass-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -753,6 +753,7 @@
208E ; 0 # Pe SUBSCRIPT RIGHT PARENTHESIS
2090..209C ; 0 # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
20A0..20C1 ; 0 # Sc [34] EURO-CURRENCY SIGN..SAUDI RIYAL SIGN
20C3 ; 0 # Sc UAE DIRHAM SIGN
20DD..20E0 ; 0 # Me [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
20E2..20E4 ; 0 # Me [3] COMBINING ENCLOSING SCREEN..COMBINING ENCLOSING UPWARD POINTING TRIANGLE
2100..2101 ; 0 # So [2] ACCOUNT OF..ADDRESSED TO THE SUBJECT
Expand Down Expand Up @@ -2089,8 +2090,8 @@ E0100..E01EF ; 0 # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
F0000..FFFFD ; 0 # Co [65534] <private-use-F0000>..<private-use-FFFFD>
100000..10FFFD; 0 # Co [65534] <private-use-100000>..<private-use-10FFFD>

# The above property value applies to 816778 code points not listed here.
# Total code points: 1113144
# The above property value applies to 816777 code points not listed here.
# Total code points: 1113145

# ================================================

Expand Down
9 changes: 5 additions & 4 deletions unicodetools/data/ucd/dev/extracted/DerivedEastAsianWidth.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedEastAsianWidth-17.0.0.txt
# Date: 2025-07-24, 13:28:21 GMT
# DerivedEastAsianWidth-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -842,6 +842,7 @@
20A0..20A8 ; N # Sc [9] EURO-CURRENCY SIGN..RUPEE SIGN
20AA..20AB ; N # Sc [2] NEW SHEQEL SIGN..DONG SIGN
20AD..20C1 ; N # Sc [21] KIP SIGN..SAUDI RIYAL SIGN
20C3 ; N # Sc UAE DIRHAM SIGN
20D0..20DC ; N # Mn [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
20DD..20E0 ; N # Me [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
20E1 ; N # Mn COMBINING LEFT RIGHT ARROW ABOVE
Expand Down Expand Up @@ -2136,8 +2137,8 @@ FFFC ; N # So OBJECT REPLACEMENT CHARACTER
E0001 ; N # Cf LANGUAGE TAG
E0020..E007F ; N # Cf [96] TAG SPACE..CANCEL TAG

# The above property value applies to 760612 code points not listed here.
# Total code points: 792263
# The above property value applies to 760611 code points not listed here.
# Total code points: 792264

# ================================================

Expand Down
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedGeneralCategory-17.0.0.txt
# Date: 2025-07-24, 00:12:50 GMT
# DerivedGeneralCategory-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -4080,6 +4080,7 @@ FFE9..FFEC ; Sm # [4] HALFWIDTH LEFTWARDS ARROW..HALFWIDTH DOWNWARDS ARROW
0E3F ; Sc # THAI CURRENCY SYMBOL BAHT
17DB ; Sc # KHMER CURRENCY SYMBOL RIEL
20A0..20C1 ; Sc # [34] EURO-CURRENCY SIGN..SAUDI RIYAL SIGN
20C3 ; Sc # UAE DIRHAM SIGN
A838 ; Sc # NORTH INDIC RUPEE MARK
FDFC ; Sc # RIAL SIGN
FE69 ; Sc # SMALL DOLLAR SIGN
Expand All @@ -4090,7 +4091,7 @@ FFE5..FFE6 ; Sc # [2] FULLWIDTH YEN SIGN..FULLWIDTH WON SIGN
1E2FF ; Sc # WANCHO NGUN SIGN
1ECB0 ; Sc # INDIC SIYAQ RUPEE MARK

# Total code points: 64
# Total code points: 65

# ================================================

Expand Down
5 changes: 3 additions & 2 deletions unicodetools/data/ucd/dev/extracted/DerivedLineBreak.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedLineBreak-17.0.0.txt
# Date: 2025-07-29, 13:52:13 GMT
# DerivedLineBreak-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -926,6 +926,7 @@ ABF0..ABF9 ; NU # Nd [10] MEETEI MAYEK DIGIT ZERO..MEETEI MAYEK DIGIT NINE
2085..2089 ; AL # No [5] SUBSCRIPT FIVE..SUBSCRIPT NINE
208A..208C ; AL # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
2090..209C ; AL # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
20C3 ; AL # Sc UAE DIRHAM SIGN
2100..2101 ; AL # So [2] ACCOUNT OF..ADDRESSED TO THE SUBJECT
2102 ; AL # L& DOUBLE-STRUCK CAPITAL C
2104 ; AL # So CENTRE LINE SYMBOL
Expand Down
7 changes: 4 additions & 3 deletions unicodetools/data/ucd/dev/extracted/DerivedName.txt
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# DerivedName-17.0.0.txt
# Date: 2025-07-30, 23:55:12 GMT
# DerivedName-18.0.0.txt
# Date: 2025-08-05, 22:20:00 GMT
# © 2025 Unicode®, Inc.
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
Expand Down Expand Up @@ -7551,6 +7551,7 @@
20BF ; BITCOIN SIGN
20C0 ; SOM SIGN
20C1 ; SAUDI RIYAL SIGN
20C3 ; UAE DIRHAM SIGN
20D0 ; COMBINING LEFT HARPOON ABOVE
20D1 ; COMBINING RIGHT HARPOON ABOVE
20D2 ; COMBINING LONG VERTICAL LINE OVERLAY
Expand Down Expand Up @@ -45823,6 +45824,6 @@ E01ED ; VARIATION SELECTOR-254
E01EE ; VARIATION SELECTOR-255
E01EF ; VARIATION SELECTOR-256

# Total code points: 159801
# Total code points: 159802

# EOF
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
Ignoring Name Age LineBreak:

# U+20C3 is the UAE Dirham sign, similar to other currency symbols:
# TODO: current comparison is probably wrong because the line break properties differ:
# PO for SAUDI RIYAL SIGN
# AL for UAE DIRHAM SIGN
Propertywise [\N{SAUDI RIYAL SIGN-20C1}
\N{UAE DIRHAM SIGN-20C3}] AreAlike

end Ignoring;
Loading