Skip to content

Commit 6ad9e2c

Browse files
committed
Confusable Katakana-Han pair: generated data
1 parent 679348e commit 6ad9e2c

File tree

3 files changed

+13
-5
lines changed

3 files changed

+13
-5
lines changed

unicodetools/data/security/dev/confusables.txt

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusables.txt
2-
# Date: 2025-07-20, 17:08:53 GMT
2+
# Date: 2025-07-20, 17:12:46 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -7234,6 +7234,8 @@ D7C6 ; 119E 1165 4E28 ; MA # ( ퟆ → ᆞᅥ丨 ) HANGUL JUNGSEONG ARAEA-E →
72347234
2341 ; 303C ; MA #* ( ⍁ → 〼 ) APL FUNCTIONAL SYMBOL QUAD SLASH → MASU MARK # →⧄→
72357235
29C4 ; 303C ; MA #* ( ⧄ → 〼 ) SQUARED RISING DIAGONAL SLASH → MASU MARK #
72367236

7237+
4E8E ; 1B122 ; MA # ( 于 → 𛄢 ) CJK UNIFIED IDEOGRAPH-4E8E → KATAKANA LETTER ARCHAIC WU #
7238+
72377239
A49E ; A04A ; MA #* ( ꒞ → ꁊ ) YI RADICAL PUT → YI SYLLABLE PUT #
72387240

72397241
A4AC ; A050 ; MA #* ( ꒬ → ꁐ ) YI RADICAL PYT → YI SYLLABLE PYT #
@@ -9977,5 +9979,5 @@ A7CF ; A7CE ; MA # ( ꟏ → ꟎ ) LATIN SMALL LETTER PHARYNGEAL VOICED FRICATIV
99779979

99789980
6138 ; 2B73F ; MA # ( 愸 → 𫜿 ) CJK UNIFIED IDEOGRAPH-6138 → CJK UNIFIED IDEOGRAPH-2B73F #
99799981

9980-
# total: 6566
9982+
# total: 6567
99819983

unicodetools/data/security/dev/confusablesSummary.txt

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusablesSummary.txt
2-
# Date: 2025-07-20, 17:08:53 GMT
2+
# Date: 2025-07-20, 17:12:46 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -14044,6 +14044,10 @@
1404414044
(‎ 了 ‎) 4E86 CJK UNIFIED IDEOGRAPH-4E86
1404514045
← (‎ 了 ‎) F9BA CJK COMPATIBILITY IDEOGRAPH-F9BA
1404614046

14047+
# 𛄢 于
14048+
(‎ 于 ‎) 4E8E CJK UNIFIED IDEOGRAPH-4E8E
14049+
← (‎ 𛄢 ‎) 1B122 KATAKANA LETTER ARCHAIC WU
14050+
1404714051
# 亮 亮
1404814052
(‎ 亮 ‎) 4EAE CJK UNIFIED IDEOGRAPH-4EAE
1404914053
← (‎ 亮 ‎) F977 CJK COMPATIBILITY IDEOGRAPH-F977
@@ -17820,5 +17824,5 @@
1782017824
(‎ 𪘀 ‎) 2A600 CJK UNIFIED IDEOGRAPH-2A600
1782117825
← (‎ 𪘀 ‎) 2FA1D CJK COMPATIBILITY IDEOGRAPH-2FA1D
1782217826

17823-
# total : 7601
17827+
# total : 7602
1782417828

unicodetools/data/security/dev/data/source/formatted-source.txt

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# formatted-source.txt
2-
# Date: 2025-07-20, 17:08:52 GMT
2+
# Date: 2025-07-20, 17:12:46 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -4591,6 +4591,8 @@ AB51 ; 1E43 # ( ꭑ ~ ṃ ) LATIN SMALL LETTER TURNED UI ~ LATIN SMALL LETTER M
45914591

45924592
16FF3 ; 5152 # ( 𖿳 ~ 兒 ) CHINESE SMALL TRADITIONAL ER ~ CJK UNIFIED IDEOGRAPH-5152
45934593

4594+
1B122 ; 4E8E # ( 𛄢 ~ 于 ) KATAKANA LETTER ARCHAIC WU ~ CJK UNIFIED IDEOGRAPH-4E8E
4595+
45944596
1CCFA ; 1F40D #* ( 𜳺 ~ 🐍 ) SNAKE SYMBOL ~ SNAKE
45954597

45964598
1CCFC ; 1F443 #* ( 𜳼 ~ 👃 ) NOSE SYMBOL ~ NOSE

0 commit comments

Comments
 (0)