Skip to content

Commit 2cfd5d1

Browse files
committed
generated confusables data for high-priority Tibetan
1 parent bb9a5eb commit 2cfd5d1

File tree

4 files changed

+34
-10
lines changed

4 files changed

+34
-10
lines changed

unicodetools/data/security/dev/confusables.txt

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusables.txt
2-
# Date: 2025-05-07, 04:01:07 GMT
2+
# Date: 2025-05-08, 21:27:26 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -6034,6 +6034,10 @@ FE19 ; 2D57 ; MA #* ( ︙ → ⵗ ) PRESENTATION FORM FOR VERTICAL HORIZONTAL EL
60346034

60356035
0F79 ; 0FB3 0F71 0F80 ; MA # ( ཹ → ླཱྀ ) TIBETAN VOWEL SIGN VOCALIC LL → TIBETAN SUBJOINED LETTER LA, TIBETAN VOWEL SIGN AA, TIBETAN VOWEL SIGN REVERSED I #
60366036

6037+
0F7B ; 0F7A 0F7A ; MA # ( ཻ → ེེ ) TIBETAN VOWEL SIGN EE → TIBETAN VOWEL SIGN E, TIBETAN VOWEL SIGN E #
6038+
6039+
0F7D ; 0F7C 0F7C ; MA # ( ཽ → ོོ ) TIBETAN VOWEL SIGN OO → TIBETAN VOWEL SIGN O, TIBETAN VOWEL SIGN O #
6040+
60376041
11CB2 ; 11CAA ; MA # ( 𑲲 → 𑲪 ) MARCHEN VOWEL SIGN U → MARCHEN SUBJOINED LETTER RA #
60386042

60396043
1734 ; 1715 ; MA # ( ᜴ → ᜕ ) HANUNOO SIGN PAMUDPOD → TAGALOG SIGN PAMUDPOD #
@@ -9810,5 +9814,5 @@ A7CF ; A7CE ; MA # ( ꟏ → ꟎ ) LATIN SMALL LETTER PHARYNGEAL VOICED FRICATIV
98109814

98119815
6138 ; 2B73F ; MA # ( 愸 → 𫜿 ) CJK UNIFIED IDEOGRAPH-6138 → CJK UNIFIED IDEOGRAPH-2B73F #
98129816

9813-
# total: 6445
9817+
# total: 6447
98149818

unicodetools/data/security/dev/confusablesSummary.txt

Lines changed: 13 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusablesSummary.txt
2-
# Date: 2025-05-07, 04:01:07 GMT
2+
# Date: 2025-05-08, 21:27:26 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -906,12 +906,12 @@
906906
(‎ ... ‎) 002E 002E 002E FULL STOP, FULL STOP, FULL STOP
907907
← (‎ … ‎) 2026 HORIZONTAL ELLIPSIS
908908

909-
# / ノ 丿 Ⳇ 〳 ᜵ ⁁ ⁄ ∕ ╱ ⟋ ⧸ ㇓ 𝈺 ⼃
909+
# / ノ Ⳇ 〳 丿 ᜵ ⁁ ⁄ ∕ ╱ ⟋ ⧸ ㇓ 𝈺 ⼃
910910
(‎ / ‎) 002F SOLIDUS
911911
← (‎ ノ ‎) 30CE KATAKANA LETTER NO # →⼃→
912-
← (‎ 丿 ‎) 4E3F CJK UNIFIED IDEOGRAPH-4E3F # →⼃→
913912
← (‎ Ⳇ ‎) 2CC6 COPTIC CAPITAL LETTER OLD COPTIC ESH
914913
← (‎ 〳 ‎) 3033 VERTICAL KANA REPEAT MARK UPPER HALF
914+
← (‎ 丿 ‎) 4E3F CJK UNIFIED IDEOGRAPH-4E3F # →⼃→
915915
← (‎ ᜵ ‎) 1735 PHILIPPINE SINGLE PUNCTUATION
916916
← (‎ ⁁ ‎) 2041 CARET INSERTION POINT
917917
← (‎ ⁄ ‎) 2044 FRACTION SLASH
@@ -9271,6 +9271,14 @@
92719271
(‎ ཹ ‎) 0F79 TIBETAN VOWEL SIGN VOCALIC LL
92729272
← (‎ ླཱྀ ‎) 0FB3 0F71 0F80 TIBETAN SUBJOINED LETTER LA, TIBETAN VOWEL SIGN AA, TIBETAN VOWEL SIGN REVERSED I
92739273

9274+
# ེེ ཻ
9275+
(‎ ེེ ‎) 0F7A 0F7A TIBETAN VOWEL SIGN E, TIBETAN VOWEL SIGN E
9276+
← (‎ ཻ ‎) 0F7B TIBETAN VOWEL SIGN EE
9277+
9278+
# ོོ ཽ
9279+
(‎ ོོ ‎) 0F7C 0F7C TIBETAN VOWEL SIGN O, TIBETAN VOWEL SIGN O
9280+
← (‎ ཽ ‎) 0F7D TIBETAN VOWEL SIGN OO
9281+
92749282
# 卐 ࿕
92759283
(‎ ࿕ ‎) 0FD5 RIGHT-FACING SVASTI SIGN
92769284
← (‎ 卐 ‎) 5350 CJK UNIFIED IDEOGRAPH-5350
@@ -10600,7 +10608,7 @@
1060010608
← (‎ ᆔ ‎) 1194 HANGUL JUNGSEONG YU-I
1060110609
← (‎ ㆌ ‎) 318C HANGUL LETTER YU-I # →ᆔ→
1060210610

10603-
# ー 一 ᅳ — ― ─ ━ ㇐ ꟷ ㅡ ⼀ -
10611+
# ー ᅳ 一 — ― ─ ━ ㇐ ꟷ ㅡ ⼀ -
1060410612
(‎ ᅳ ‎) 1173 HANGUL JUNGSEONG EU
1060510613
← (‎ ー ‎) 30FC KATAKANA-HIRAGANA PROLONGED SOUND MARK # →一→→—→→ㅡ→
1060610614
← (‎ 一 ‎) 4E00 CJK UNIFIED IDEOGRAPH-4E00 # →—→→ㅡ→
@@ -17489,5 +17497,5 @@
1748917497
(‎ 𪘀 ‎) 2A600 CJK UNIFIED IDEOGRAPH-2A600
1749017498
← (‎ 𪘀 ‎) 2FA1D CJK COMPATIBILITY IDEOGRAPH-2FA1D
1749117499

17492-
# total : 7408
17500+
# total : 7410
1749317501

unicodetools/data/security/dev/data/confusablesSummaryIdentifier.txt

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusablesSummaryIdentifier.txt
2-
# Date: 2025-05-07, 21:39:36 GMT
2+
# Date: 2025-05-08, 21:27:26 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1144,6 +1144,14 @@
11441144
(‎ ๋ ‎) 0E4B THAI CHARACTER MAI CHATTAWA
11451145
← (‎ ໋ ‎) 0ECB LAO TONE MAI CATAWA
11461146

1147+
# ེེ ཻ
1148+
(‎ ེེ ‎) 0F7A 0F7A TIBETAN VOWEL SIGN E, TIBETAN VOWEL SIGN E
1149+
← (‎ ཻ ‎) 0F7B TIBETAN VOWEL SIGN EE
1150+
1151+
# ོོ ཽ
1152+
(‎ ོོ ‎) 0F7C 0F7C TIBETAN VOWEL SIGN O, TIBETAN VOWEL SIGN O
1153+
← (‎ ཽ ‎) 0F7D TIBETAN VOWEL SIGN OO
1154+
11471155
# ဂာ က
11481156
(‎ က ‎) 1000 MYANMAR LETTER KA
11491157
← (‎ ဂာ ‎) 1002 102C MYANMAR LETTER GA, MYANMAR VOWEL SIGN AA
@@ -1173,5 +1181,5 @@
11731181
(‎ へ ‎) 3078 HIRAGANA LETTER HE
11741182
← (‎ ヘ ‎) 30D8 KATAKANA LETTER HE
11751183

1176-
# total : 419
1184+
# total : 421
11771185

unicodetools/data/security/dev/data/source/formatted-source.txt

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# formatted-source.txt
2-
# Date: 2025-05-07, 04:01:06 GMT
2+
# Date: 2025-05-08, 21:27:25 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2336,6 +2336,10 @@
23362336

23372337
0F68 0F7C 0F7E ; 0F00 # ( ཨོཾ ~ ༀ ) TIBETAN LETTER A, TIBETAN VOWEL SIGN O, TIBETAN SIGN RJES SU NGA RO ~ TIBETAN SYLLABLE OM
23382338

2339+
0F7A 0F7A ; 0F7B # ( ེེ ~ ཻ ) TIBETAN VOWEL SIGN E, TIBETAN VOWEL SIGN E ~ TIBETAN VOWEL SIGN EE
2340+
2341+
0F7C 0F7C ; 0F7D # ( ོོ ~ ཽ ) TIBETAN VOWEL SIGN O, TIBETAN VOWEL SIGN O ~ TIBETAN VOWEL SIGN OO
2342+
23392343
1002 102C ; 1000 # ( ဂာ ~ က ) MYANMAR LETTER GA, MYANMAR VOWEL SIGN AA ~ MYANMAR LETTER KA
23402344

23412345
1002 103E ; 1081 # ( ဂှ ~ ႁ ) MYANMAR LETTER GA, MYANMAR CONSONANT SIGN MEDIAL HA ~ MYANMAR LETTER SHAN HA

0 commit comments

Comments
 (0)