Skip to content

Commit ed542ba

Browse files
committed
UTC-183-C34 Generated data
1 parent d01fc46 commit ed542ba

15 files changed

+1179
-1280
lines changed

unicodetools/data/security/dev/IdentifierStatus.txt

Lines changed: 87 additions & 79 deletions
Large diffs are not rendered by default.

unicodetools/data/security/dev/IdentifierType.txt

Lines changed: 209 additions & 89 deletions
Large diffs are not rendered by default.

unicodetools/data/security/dev/confusables.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusables.txt
2-
# Date: 2025-04-29, 02:17:35 GMT
2+
# Date: 2025-05-01, 03:09:29 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html

unicodetools/data/security/dev/confusablesSummary.txt

Lines changed: 26 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# confusablesSummary.txt
2-
# Date: 2025-04-29, 02:17:35 GMT
2+
# Date: 2025-05-01, 03:09:29 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -44,15 +44,15 @@
4444
(‎ !? ‎) 0021 003F EXCLAMATION MARK, QUESTION MARK
4545
← (‎ ⁉ ‎) 2049 EXCLAMATION QUESTION MARK
4646

47-
# '' יי ′′ ‵‵ " ײ ʺ ˮ ״ ˶ ᳓ “ ” ‟ 〃 ˝ ″ ‶ "
47+
# '' יי ′′ ‵‵ " ʺ ˮ ײ ״ ˶ ᳓ “ ” ‟ 〃 ˝ ″ ‶ "
4848
(‎ " ‎) 0022 QUOTATION MARK
4949
← (‎ '' ‎) 0027 0027 APOSTROPHE, APOSTROPHE
5050
← (‎ יי ‎) 05D9 05D9 HEBREW LETTER YOD, HEBREW LETTER YOD # →''→
5151
← (‎ ′′ ‎) 2032 2032 PRIME, PRIME # →″→
5252
← (‎ ‵‵ ‎) 2035 2035 REVERSED PRIME, REVERSED PRIME # →''→
53-
← (‎ ײ ‎) 05F2 HEBREW LIGATURE YIDDISH DOUBLE YOD # →‎יי‎→→''→
5453
← (‎ ʺ ‎) 02BA MODIFIER LETTER DOUBLE PRIME
5554
← (‎ ˮ ‎) 02EE MODIFIER LETTER DOUBLE APOSTROPHE # →″→
55+
← (‎ ײ ‎) 05F2 HEBREW LIGATURE YIDDISH DOUBLE YOD # →‎יי‎→→''→
5656
← (‎ ״ ‎) 05F4 HEBREW PUNCTUATION GERSHAYIM
5757
← (‎ ˶ ‎) 02F6 MODIFIER LETTER MIDDLE DOUBLE ACUTE ACCENT # →˝→
5858
← (‎ ᳓ ‎) 1CD3 VEDIC SIGN NIHSHVASA # →″→
@@ -935,20 +935,20 @@
935935
(‎ /̄ ‎) 002F 0304 SOLIDUS, COMBINING MACRON
936936
← (‎ ⧶ ‎) 29F6 SOLIDUS WITH OVERBAR
937937

938-
# O 𑷠 𖶠 0 ০ ଠ ዐ 〇 Ο О Օ ߀ Ⲟ ⵔ ꓳ 𐊒 𐊫 𐐄 𐔖 𑓐 𑢵 𑣠 𐓂 🯰 𜳰 𜳤 O 𝐎 𝑂 𝑶 𝒪 𝓞 𝔒 𝕆 𝕺 𝖮 𝗢 𝘖 𝙊 𝙾 𝚶 𝛰 𝜪 𝝤 𝞞 𝟎 𝟘 𝟢 𝟬 𝟶
938+
# O 𑷠 𖶠 0 ০ ଠ ዐ 〇 Ο О Օ ߀ Ⲟ ⵔ ꓳ 𐊒 𐊫 𐐄 𐔖 𑓐 𑢵 𑣠 𐓂 🯰 𜳰 𜳤 O 𝐎 𝑂 𝑶 𝒪 𝓞 𝔒 𝕆 𝕺 𝖮 𝗢 𝘖 𝙊 𝙾 𝚶 𝛰 𝜪 𝝤 𝞞 𝟎 𝟘 𝟢 𝟬 𝟶
939939
(‎ 0 ‎) 0030 DIGIT ZERO
940940
← (‎ O ‎) 004F LATIN CAPITAL LETTER O
941941
← (‎ 𑷠 ‎) 11DE0 TOLONG SIKI DIGIT ZERO
942942
← (‎ 𖶠 ‎) 16DA0 CHISOI DIGIT ZERO
943943
← (‎ ০ ‎) 09E6 BENGALI DIGIT ZERO
944944
← (‎ ଠ ‎) 0B20 ORIYA LETTER TTHA # →୦→
945-
← (‎ ୦ ‎) 0B66 ORIYA DIGIT ZERO
946945
← (‎ ዐ ‎) 12D0 ETHIOPIC SYLLABLE PHARYNGEAL A # →Օ→→О→
947946
← (‎ 〇 ‎) 3007 IDEOGRAPHIC NUMBER ZERO # →O→
948947
← (‎ Ο ‎) 039F GREEK CAPITAL LETTER OMICRON
949948
← (‎ О ‎) 041E CYRILLIC CAPITAL LETTER O
950949
← (‎ Օ ‎) 0555 ARMENIAN CAPITAL LETTER OH # →О→
951950
← (‎ ߀ ‎) 07C0 NKO DIGIT ZERO
951+
← (‎ ୦ ‎) 0B66 ORIYA DIGIT ZERO
952952
← (‎ Ⲟ ‎) 2C9E COPTIC CAPITAL LETTER O # →О→
953953
← (‎ ⵔ ‎) 2D54 TIFINAGH LETTER YAR # →О→
954954
← (‎ ꓳ ‎) A4F3 LISU LETTER O # →O→
@@ -1712,13 +1712,13 @@
17121712
(‎ 7点 ‎) 0037 70B9 DIGIT SEVEN, CJK UNIFIED IDEOGRAPH-70B9
17131713
← (‎ ㍟ ‎) 335F IDEOGRAPHIC TELEGRAPH SYMBOL FOR HOUR SEVEN
17141714

1715-
# 8 Ȣ ȣ ৪ ੪ ଃ 𐌚 𞣋 🯸 𜳸 𝟖 𝟠 𝟪 𝟴 𝟾
1715+
# 8 Ȣ ȣ ৪ ଃ ੪ 𐌚 𞣋 🯸 𜳸 𝟖 𝟠 𝟪 𝟴 𝟾
17161716
(‎ 8 ‎) 0038 DIGIT EIGHT
17171717
← (‎ Ȣ ‎) 0222 LATIN CAPITAL LETTER OU
17181718
← (‎ ȣ ‎) 0223 LATIN SMALL LETTER OU
17191719
← (‎ ৪ ‎) 09EA BENGALI DIGIT FOUR
1720-
← (‎ ੪ ‎) 0A6A GURMUKHI DIGIT FOUR
17211720
← (‎ ଃ ‎) 0B03 ORIYA SIGN VISARGA
1721+
← (‎ ੪ ‎) 0A6A GURMUKHI DIGIT FOUR
17221722
← (‎ 𐌚 ‎) 1031A OLD ITALIC LETTER EF
17231723
← (‎ 𞣋 ‎) 1E8CB MENDE KIKAKUI DIGIT FIVE
17241724
← (‎ 🯸 ‎) 1FBF8 SEGMENTED DIGIT EIGHT
@@ -3719,7 +3719,7 @@
37193719
(‎ n̴ ‎) 006E 0334 LATIN SMALL LETTER N, COMBINING TILDE OVERLAY
37203720
← (‎ ᵰ ‎) 1D70 LATIN SMALL LETTER N WITH MIDDLE TILDE
37213721

3722-
# o ᴏ ᴑ ο σ о օ ס ه ٥ ھ ہ ە ۵ ० ੦ ૦ ௦ ం ౦ ಂ ೦ ം ഠ ං ๐ ໐ ဝ ၀ ჿ ⲟ 𐐬 ꬽ 𑣈 𑣗 𐓪 o ℴ 𞸤 𞹤 𞺄 ﮦ ﮧ ﮨ ﮩ ﮪ ﮫ ﮬ ﮭ ﻩ ﻪ ﻫ ﻬ 𝐨 𝑜 𝒐 𝓸 𝔬 𝕠 𝖔 𝗈 𝗼 𝘰 𝙤 𝚘 𝛐 𝛔 𝜊 𝜎 𝝄 𝝈 𝝾 𝞂 𝞸 𝞼
3722+
# o ᴏ ᴑ ο σ о օ ס ه ٥ ھ ہ ە ۵ ० ૦ ం ಂ ೦ ം ഠ ං ๐ ໐ ဝ ၀ ჿ ੦ ௦ ౦ ൦ ⲟ 𐐬 ꬽ 𑣈 𑣗 𐓪 o ℴ 𞸤 𞹤 𞺄 ﮦ ﮧ ﮨ ﮩ ﮪ ﮫ ﮬ ﮭ ﻩ ﻪ ﻫ ﻬ 𝐨 𝑜 𝒐 𝓸 𝔬 𝕠 𝖔 𝗈 𝗼 𝘰 𝙤 𝚘 𝛐 𝛔 𝜊 𝜎 𝝄 𝝈 𝝾 𝞂 𝞸 𝞼
37233723
(‎ o ‎) 006F LATIN SMALL LETTER O
37243724
← (‎ ᴏ ‎) 1D0F LATIN LETTER SMALL CAPITAL O
37253725
← (‎ ᴑ ‎) 1D11 LATIN SMALL LETTER SIDEWAYS O
@@ -3735,22 +3735,22 @@
37353735
← (‎ ە ‎) 06D5 ARABIC LETTER AE # →‎ه‎→
37363736
← (‎ ۵ ‎) 06F5 EXTENDED ARABIC-INDIC DIGIT FIVE # →‎٥‎→
37373737
← (‎ ० ‎) 0966 DEVANAGARI DIGIT ZERO
3738-
← (‎ ੦ ‎) 0A66 GURMUKHI DIGIT ZERO
37393738
← (‎ ૦ ‎) 0AE6 GUJARATI DIGIT ZERO
3740-
← (‎ ௦ ‎) 0BE6 TAMIL DIGIT ZERO
37413739
← (‎ ం ‎) 0C02 TELUGU SIGN ANUSVARA
3742-
← (‎ ౦ ‎) 0C66 TELUGU DIGIT ZERO
37433740
← (‎ ಂ ‎) 0C82 KANNADA SIGN ANUSVARA
37443741
← (‎ ೦ ‎) 0CE6 KANNADA DIGIT ZERO # →౦→
37453742
← (‎ ം ‎) 0D02 MALAYALAM SIGN ANUSVARA
37463743
← (‎ ഠ ‎) 0D20 MALAYALAM LETTER TTHA
3747-
← (‎ ൦ ‎) 0D66 MALAYALAM DIGIT ZERO
37483744
← (‎ ං ‎) 0D82 SINHALA SIGN ANUSVARAYA
37493745
← (‎ ๐ ‎) 0E50 THAI DIGIT ZERO
37503746
← (‎ ໐ ‎) 0ED0 LAO DIGIT ZERO
37513747
← (‎ ဝ ‎) 101D MYANMAR LETTER WA
37523748
← (‎ ၀ ‎) 1040 MYANMAR DIGIT ZERO
37533749
← (‎ ჿ ‎) 10FF GEORGIAN LETTER LABIAL SIGN
3750+
← (‎ ੦ ‎) 0A66 GURMUKHI DIGIT ZERO
3751+
← (‎ ௦ ‎) 0BE6 TAMIL DIGIT ZERO
3752+
← (‎ ౦ ‎) 0C66 TELUGU DIGIT ZERO
3753+
← (‎ ൦ ‎) 0D66 MALAYALAM DIGIT ZERO
37543754
← (‎ ⲟ ‎) 2C9F COPTIC SMALL LETTER O
37553755
← (‎ 𐐬 ‎) 1042C DESERET SMALL LETTER LONG O
37563756
← (‎ ꬽ ‎) AB3D LATIN SMALL LETTER BLACKLETTER O
@@ -5077,7 +5077,7 @@
50775077
(‎ Ţ ‎) 0162 LATIN CAPITAL LETTER T WITH CEDILLA
50785078
← (‎ Ț ‎) 021A LATIN CAPITAL LETTER T WITH COMMA BELOW
50795079

5080-
# ƫ Ꮏ ţ ț
5080+
# ƫ Ꮏ ț ţ
50815081
(‎ ţ ‎) 0163 LATIN SMALL LETTER T WITH CEDILLA
50825082
← (‎ ƫ ‎) 01AB LATIN SMALL LETTER T WITH PALATAL HOOK
50835083
← (‎ Ꮏ ‎) 13BF CHEROKEE LETTER HNA # →ƫ→
@@ -5543,14 +5543,14 @@
55435543
← (‎ ٚ ‎) 065A ARABIC VOWEL SIGN SMALL V ABOVE # →̌→
55445544
← (‎ ꙼ ‎) A67C COMBINING CYRILLIC KAVYKA
55455545

5546-
# ̆̇ ̐ ँ ঁ ଁ ۨ ఀ ಁ ഁ 𑒿
5546+
# ̆̇ ̐ ँ ঁ ଁ ۨ ఀ ಁ ഁ 𑒿
55475547
(‎ ̆̇ ‎) 0306 0307 COMBINING BREVE, COMBINING DOT ABOVE
55485548
← (‎ ̐ ‎) 0310 COMBINING CANDRABINDU
55495549
← (‎ ँ ‎) 0901 DEVANAGARI SIGN CANDRABINDU # →̐→
55505550
← (‎ ঁ ‎) 0981 BENGALI SIGN CANDRABINDU # →̐→
5551-
← (‎ ઁ ‎) 0A81 GUJARATI SIGN CANDRABINDU # →̐→
55525551
← (‎ ଁ ‎) 0B01 ORIYA SIGN CANDRABINDU # →̐→
55535552
← (‎ ۨ ‎) 06E8 ARABIC SMALL HIGH NOON # →̐→
5553+
← (‎ ઁ ‎) 0A81 GUJARATI SIGN CANDRABINDU # →̐→
55545554
← (‎ ఀ ‎) 0C00 TELUGU SIGN COMBINING CANDRABINDU ABOVE # →ँ→→̐→
55555555
← (‎ ಁ ‎) 0C81 KANNADA SIGN CANDRABINDU # →ँ→→̐→
55565556
← (‎ ഁ ‎) 0D01 MALAYALAM SIGN CANDRABINDU # →ँ→→̐→
@@ -5670,14 +5670,14 @@
56705670
← (‎ ͅ ‎) 0345 COMBINING GREEK YPOGEGRAMMENI # →̨→
56715671
← (‎ ᪷ ‎) 1AB7 COMBINING OPEN MARK BELOW # →̨→
56725672

5673-
# ̣ ִ ़ ় ਼ ઼ ଼ ׅ ٜ ࣭ ᳝ 𐨺 𑓃 𑇊
5673+
# ̣ ़ ় ਼ ઼ ଼ ִ ׅ ٜ ࣭ ᳝ 𐨺 𑓃 𑇊
56745674
(‎ ̣ ‎) 0323 COMBINING DOT BELOW
5675-
← (‎ ִ ‎) 05B4 HEBREW POINT HIRIQ
56765675
← (‎ ़ ‎) 093C DEVANAGARI SIGN NUKTA
56775676
← (‎ ় ‎) 09BC BENGALI SIGN NUKTA
56785677
← (‎ ਼ ‎) 0A3C GURMUKHI SIGN NUKTA
56795678
← (‎ ઼ ‎) 0ABC GUJARATI SIGN NUKTA
56805679
← (‎ ଼ ‎) 0B3C ORIYA SIGN NUKTA
5680+
← (‎ ִ ‎) 05B4 HEBREW POINT HIRIQ
56815681
← (‎ ׅ ‎) 05C5 HEBREW MARK LOWER DOT
56825682
← (‎ ٜ ‎) 065C ARABIC VOWEL SIGN DOT BELOW
56835683
← (‎ ࣭ ‎) 08ED ARABIC TONE ONE DOT BELOW
@@ -8545,15 +8545,15 @@
85458545
← (‎ 𑂻 ‎) 110BB KAITHI ABBREVIATION SIGN
85468546
← (‎ 𑇇 ‎) 111C7 SHARADA ABBREVIATION SIGN
85478547

8548-
# ঃ 𖶜 ః ಃ ഃ ඃ း 𑓁
8548+
# ঃ 𖶜 ః ಃ ഃ ඃ း 𑓁
85498549
(‎ ঃ ‎) 0983 BENGALI SIGN VISARGA
85508550
← (‎ 𖶜 ‎) 16D9C CHISOI LETTER JARAHA
8551-
← (‎ ਃ ‎) 0A03 GURMUKHI SIGN VISARGA
85528551
← (‎ ః ‎) 0C03 TELUGU SIGN VISARGA # →ਃ→
85538552
← (‎ ಃ ‎) 0C83 KANNADA SIGN VISARGA # →ః→→ਃ→
85548553
← (‎ ഃ ‎) 0D03 MALAYALAM SIGN VISARGA # →ಃ→→ః→→ਃ→
85558554
← (‎ ඃ ‎) 0D83 SINHALA SIGN VISARGAYA # →ഃ→→ಃ→→ః→→ਃ→
85568555
← (‎ း ‎) 1038 MYANMAR SIGN VISARGA # →ඃ→→ഃ→→ಃ→→ః→→ਃ→
8556+
← (‎ ਃ ‎) 0A03 GURMUKHI SIGN VISARGA
85578557
← (‎ 𑓁 ‎) 114C1 TIRHUTA SIGN VISARGA
85588558

85598559
# অা আ
@@ -8784,10 +8784,10 @@
87848784
← (‎ ரு ‎) 0BB0 0BC1 TAMIL LETTER RA, TAMIL VOWEL SIGN U
87858785
← (‎ ௫ ‎) 0BEB TAMIL DIGIT FIVE # →ரு→
87868786

8787-
# உ ௨ ഉ
8787+
# உ ഉ ௨
87888788
(‎ உ ‎) 0B89 TAMIL LETTER U
8789-
← (‎ ௨ ‎) 0BE8 TAMIL DIGIT TWO
87908789
← (‎ ഉ ‎) 0D09 MALAYALAM LETTER U
8790+
← (‎ ௨ ‎) 0BE8 TAMIL DIGIT TWO
87918791

87928792
# உள ஊ
87938793
(‎ உள ‎) 0B89 0BB3 TAMIL LETTER U, TAMIL LETTER LLA
@@ -9011,7 +9011,7 @@
90119011
(‎ ഇൗ ‎) 0D07 0D57 MALAYALAM LETTER I, MALAYALAM AU LENGTH MARK
90129012
← (‎ ഈ ‎) 0D08 MALAYALAM LETTER II
90139013

9014-
# നു ഌ ങ
9014+
# നു ങ ഌ
90159015
(‎ ഌ ‎) 0D0C MALAYALAM LETTER VOCALIC L
90169016
← (‎ നു ‎) 0D28 0D41 MALAYALAM LETTER NA, MALAYALAM VOWEL SIGN U
90179017
← (‎ ങ ‎) 0D19 MALAYALAM LETTER NGA
@@ -9038,10 +9038,10 @@
90389038
(‎ ദ്ര ‎) 0D26 0D4D 0D30 MALAYALAM LETTER DA, MALAYALAM SIGN VIRAMA, MALAYALAM LETTER RA
90399039
← (‎ ൫ ‎) 0D6B MALAYALAM DIGIT FIVE
90409040

9041-
# ന് ൯ ൻ
9041+
# ന് ൻ ൯
90429042
(‎ ന് ‎) 0D28 0D4D MALAYALAM LETTER NA, MALAYALAM SIGN VIRAMA
9043-
← (‎ ൯ ‎) 0D6F MALAYALAM DIGIT NINE
90449043
← (‎ ൻ ‎) 0D7B MALAYALAM LETTER CHILLU N # →൯→
9044+
← (‎ ൯ ‎) 0D6F MALAYALAM DIGIT NINE
90459045

90469046
# ന്ന ൬
90479047
(‎ ന്ന ‎) 0D28 0D4D 0D28 MALAYALAM LETTER NA, MALAYALAM SIGN VIRAMA, MALAYALAM LETTER NA
@@ -9055,10 +9055,10 @@
90559055
(‎ ര ‎) 0D30 MALAYALAM LETTER RA
90569056
← (‎ റ ‎) 0D31 MALAYALAM LETTER RRA
90579057

9058-
# ര് ൪ ർ
9058+
# ര് ർ ൪
90599059
(‎ ര് ‎) 0D30 0D4D MALAYALAM LETTER RA, MALAYALAM SIGN VIRAMA
9060-
← (‎ ൪ ‎) 0D6A MALAYALAM DIGIT FOUR
90619060
← (‎ ർ ‎) 0D7C MALAYALAM LETTER CHILLU RR # →൪→
9061+
← (‎ ൪ ‎) 0D6A MALAYALAM DIGIT FOUR
90629062

90639063
# വ്ര വ് ൮
90649064
(‎ വ് ‎) 0D35 0D4D MALAYALAM LETTER VA, MALAYALAM SIGN VIRAMA

0 commit comments

Comments
 (0)