Skip to content

Commit 2fb2e8b

Browse files
authored
12 IPA diacritics, 11 above and 1 below (#742)
* UnicodeData.txt lines from proposal * LineBreak.txt as described in the proposal * Inherited * diacritics are diacritics. * Regenerate UCD * en-GB-oxendict * Regenerate UCD * Regenerate UCD
1 parent 8f51864 commit 2fb2e8b

19 files changed

+121
-47
lines changed

unicodetools/data/ucd/dev/DerivedAge.txt

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedAge-17.0.0.txt
2-
# Date: 2024-11-13, 21:07:25 GMT
2+
# Date: 2024-11-13, 21:22:30 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2071,7 +2071,8 @@ A7DA..A7DC ; 16.0 # [3] LATIN CAPITAL LETTER LAMBDA..LATIN CAPITAL LETTER L
20712071
0C5C ; 17.0 # TELUGU ARCHAIC SHRII
20722072
0CDC ; 17.0 # KANNADA ARCHAIC SHRII
20732073
1ACF..1ADD ; 17.0 # [15] COMBINING DOUBLE CARON..COMBINING DOT-AND-RING BELOW
2074+
1AE0..1AEB ; 17.0 # [12] COMBINING LEFT TACK ABOVE..COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
20742075

2075-
# Total code points: 21
2076+
# Total code points: 33
20762077

20772078
# EOF

unicodetools/data/ucd/dev/DerivedCoreProperties.txt

Lines changed: 11 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedCoreProperties-17.0.0.txt
2-
# Date: 2024-11-13, 21:07:43 GMT
2+
# Date: 2024-11-13, 21:22:48 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -3196,6 +3196,7 @@ FF41..FF5A ; Cased # L& [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN
31963196
1AB0..1ABD ; Case_Ignorable # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
31973197
1ABE ; Case_Ignorable # Me COMBINING PARENTHESES OVERLAY
31983198
1ABF..1ADD ; Case_Ignorable # Mn [31] COMBINING LATIN SMALL LETTER W BELOW..COMBINING DOT-AND-RING BELOW
3199+
1AE0..1AEB ; Case_Ignorable # Mn [12] COMBINING LEFT TACK ABOVE..COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
31993200
1B00..1B03 ; Case_Ignorable # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
32003201
1B34 ; Case_Ignorable # Mn BALINESE SIGN REREKAN
32013202
1B36..1B3A ; Case_Ignorable # Mn [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
@@ -3506,7 +3507,7 @@ E0001 ; Case_Ignorable # Cf LANGUAGE TAG
35063507
E0020..E007F ; Case_Ignorable # Cf [96] TAG SPACE..CANCEL TAG
35073508
E0100..E01EF ; Case_Ignorable # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
35083509

3509-
# Total code points: 2766
3510+
# Total code points: 2778
35103511

35113512
# ================================================
35123513

@@ -7461,6 +7462,7 @@ FFDA..FFDC ; ID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL
74617462
1AA7 ; ID_Continue # Lm TAI THAM SIGN MAI YAMOK
74627463
1AB0..1ABD ; ID_Continue # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
74637464
1ABF..1ADD ; ID_Continue # Mn [31] COMBINING LATIN SMALL LETTER W BELOW..COMBINING DOT-AND-RING BELOW
7465+
1AE0..1AEB ; ID_Continue # Mn [12] COMBINING LEFT TACK ABOVE..COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
74647466
1B00..1B03 ; ID_Continue # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
74657467
1B04 ; ID_Continue # Mc BALINESE SIGN BISAH
74667468
1B05..1B33 ; ID_Continue # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA
@@ -8373,7 +8375,7 @@ FFDA..FFDC ; ID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HAN
83738375
31350..323AF ; ID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
83748376
E0100..E01EF ; ID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
83758377

8376-
# Total code points: 144562
8378+
# Total code points: 144574
83778379

83788380
# ================================================
83798381

@@ -9645,6 +9647,7 @@ FFDA..FFDC ; XID_Start # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGU
96459647
1AA7 ; XID_Continue # Lm TAI THAM SIGN MAI YAMOK
96469648
1AB0..1ABD ; XID_Continue # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
96479649
1ABF..1ADD ; XID_Continue # Mn [31] COMBINING LATIN SMALL LETTER W BELOW..COMBINING DOT-AND-RING BELOW
9650+
1AE0..1AEB ; XID_Continue # Mn [12] COMBINING LEFT TACK ABOVE..COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
96489651
1B00..1B03 ; XID_Continue # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
96499652
1B04 ; XID_Continue # Mc BALINESE SIGN BISAH
96509653
1B05..1B33 ; XID_Continue # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA
@@ -10562,7 +10565,7 @@ FFDA..FFDC ; XID_Continue # Lo [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HA
1056210565
31350..323AF ; XID_Continue # Lo [4192] CJK UNIFIED IDEOGRAPH-31350..CJK UNIFIED IDEOGRAPH-323AF
1056310566
E0100..E01EF ; XID_Continue # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1056410567

10565-
# Total code points: 144543
10568+
# Total code points: 144555
1056610569

1056710570
# ================================================
1056810571

@@ -10784,6 +10787,7 @@ E01F0..E0FFF ; Default_Ignorable_Code_Point # Cn [3600] <reserved-E01F0>..<rese
1078410787
1AB0..1ABD ; Grapheme_Extend # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1078510788
1ABE ; Grapheme_Extend # Me COMBINING PARENTHESES OVERLAY
1078610789
1ABF..1ADD ; Grapheme_Extend # Mn [31] COMBINING LATIN SMALL LETTER W BELOW..COMBINING DOT-AND-RING BELOW
10790+
1AE0..1AEB ; Grapheme_Extend # Mn [12] COMBINING LEFT TACK ABOVE..COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
1078710791
1B00..1B03 ; Grapheme_Extend # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1078810792
1B34 ; Grapheme_Extend # Mn BALINESE SIGN REREKAN
1078910793
1B35 ; Grapheme_Extend # Mc BALINESE VOWEL SIGN TEDUNG
@@ -11034,7 +11038,7 @@ FF9E..FF9F ; Grapheme_Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK.
1103411038
E0020..E007F ; Grapheme_Extend # Cf [96] TAG SPACE..CANCEL TAG
1103511039
E0100..E01EF ; Grapheme_Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1103611040

11037-
# Total code points: 2210
11041+
# Total code points: 2222
1103811042

1103911043
# ================================================
1104011044

@@ -13113,6 +13117,7 @@ ABED ; Grapheme_Link # Mn MEETEI MAYEK APUN IYEK
1311313117
1AB0..1ABD ; InCB; Extend # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
1311413118
1ABE ; InCB; Extend # Me COMBINING PARENTHESES OVERLAY
1311513119
1ABF..1ADD ; InCB; Extend # Mn [31] COMBINING LATIN SMALL LETTER W BELOW..COMBINING DOT-AND-RING BELOW
13120+
1AE0..1AEB ; InCB; Extend # Mn [12] COMBINING LEFT TACK ABOVE..COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
1311613121
1B00..1B03 ; InCB; Extend # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
1311713122
1B34 ; InCB; Extend # Mn BALINESE SIGN REREKAN
1311813123
1B35 ; InCB; Extend # Mc BALINESE VOWEL SIGN TEDUNG
@@ -13364,6 +13369,6 @@ FF9E..FF9F ; InCB; Extend # Lm [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HA
1336413369
E0020..E007F ; InCB; Extend # Cf [96] TAG SPACE..CANCEL TAG
1336513370
E0100..E01EF ; InCB; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1336613371

13367-
# Total code points: 2209
13372+
# Total code points: 2221
1336813373

1336913374
# EOF

unicodetools/data/ucd/dev/EastAsianWidth.txt

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# EastAsianWidth-17.0.0.txt
2-
# Date: 2024-11-13, 21:07:48 GMT
2+
# Date: 2024-11-13, 21:22:52 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -808,6 +808,7 @@
808808
1AB0..1ABD ; N # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
809809
1ABE ; N # Me COMBINING PARENTHESES OVERLAY
810810
1ABF..1ADD ; N # Mn [31] COMBINING LATIN SMALL LETTER W BELOW..COMBINING DOT-AND-RING BELOW
811+
1AE0..1AEB ; N # Mn [12] COMBINING LEFT TACK ABOVE..COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
811812
1B00..1B03 ; N # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
812813
1B04 ; N # Mc BALINESE SIGN BISAH
813814
1B05..1B33 ; N # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA

unicodetools/data/ucd/dev/LineBreak.txt

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# LineBreak-17.0.0.txt
2-
# Date: 2024-11-13, 21:07:49 GMT
2+
# Date: 2024-11-13, 21:22:53 GMT
33
# © 2024 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -778,6 +778,8 @@
778778
1AB0..1ABD ; CM # Mn [14] COMBINING DOUBLED CIRCUMFLEX ACCENT..COMBINING PARENTHESES BELOW
779779
1ABE ; CM # Me COMBINING PARENTHESES OVERLAY
780780
1ABF..1ADD ; CM # Mn [31] COMBINING LATIN SMALL LETTER W BELOW..COMBINING DOT-AND-RING BELOW
781+
1AE0..1AEA ; CM # Mn [11] COMBINING LEFT TACK ABOVE..COMBINING UPWARDS ARROW ABOVE
782+
1AEB ; GL # Mn COMBINING DOUBLE RIGHTWARDS ARROW ABOVE
781783
1B00..1B03 ; CM # Mn [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
782784
1B04 ; CM # Mc BALINESE SIGN BISAH
783785
1B05..1B33 ; AK # Lo [47] BALINESE LETTER AKARA..BALINESE LETTER HA

0 commit comments

Comments
 (0)