Skip to content

Commit 139e4ed

Browse files
authored
feat: add unicode-3.1.1 (#81)
1 parent c57dde1 commit 139e4ed

15 files changed

+35860
-8
lines changed

.github/workflows/ci.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,5 +18,7 @@ jobs:
1818
node-version-file: '.nvmrc'
1919
- name: Install dependencies
2020
run: npm install
21+
- name: Build
22+
run: npm run build
2123
- name: Run tests
2224
run: npm test

data/3.1.1-blocks.txt

Lines changed: 101 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,101 @@
1+
# Start Code..End Code; Block Name
2+
0000..007F; Basic Latin
3+
0080..00FF; Latin-1 Supplement
4+
0100..017F; Latin Extended-A
5+
0180..024F; Latin Extended-B
6+
0250..02AF; IPA Extensions
7+
02B0..02FF; Spacing Modifier Letters
8+
0300..036F; Combining Diacritical Marks
9+
0370..03FF; Greek
10+
0400..04FF; Cyrillic
11+
0530..058F; Armenian
12+
0590..05FF; Hebrew
13+
0600..06FF; Arabic
14+
0700..074F; Syriac
15+
0780..07BF; Thaana
16+
0900..097F; Devanagari
17+
0980..09FF; Bengali
18+
0A00..0A7F; Gurmukhi
19+
0A80..0AFF; Gujarati
20+
0B00..0B7F; Oriya
21+
0B80..0BFF; Tamil
22+
0C00..0C7F; Telugu
23+
0C80..0CFF; Kannada
24+
0D00..0D7F; Malayalam
25+
0D80..0DFF; Sinhala
26+
0E00..0E7F; Thai
27+
0E80..0EFF; Lao
28+
0F00..0FFF; Tibetan
29+
1000..109F; Myanmar
30+
10A0..10FF; Georgian
31+
1100..11FF; Hangul Jamo
32+
1200..137F; Ethiopic
33+
13A0..13FF; Cherokee
34+
1400..167F; Unified Canadian Aboriginal Syllabics
35+
1680..169F; Ogham
36+
16A0..16FF; Runic
37+
1780..17FF; Khmer
38+
1800..18AF; Mongolian
39+
1E00..1EFF; Latin Extended Additional
40+
1F00..1FFF; Greek Extended
41+
2000..206F; General Punctuation
42+
2070..209F; Superscripts and Subscripts
43+
20A0..20CF; Currency Symbols
44+
20D0..20FF; Combining Marks for Symbols
45+
2100..214F; Letterlike Symbols
46+
2150..218F; Number Forms
47+
2190..21FF; Arrows
48+
2200..22FF; Mathematical Operators
49+
2300..23FF; Miscellaneous Technical
50+
2400..243F; Control Pictures
51+
2440..245F; Optical Character Recognition
52+
2460..24FF; Enclosed Alphanumerics
53+
2500..257F; Box Drawing
54+
2580..259F; Block Elements
55+
25A0..25FF; Geometric Shapes
56+
2600..26FF; Miscellaneous Symbols
57+
2700..27BF; Dingbats
58+
2800..28FF; Braille Patterns
59+
2E80..2EFF; CJK Radicals Supplement
60+
2F00..2FDF; Kangxi Radicals
61+
2FF0..2FFF; Ideographic Description Characters
62+
3000..303F; CJK Symbols and Punctuation
63+
3040..309F; Hiragana
64+
30A0..30FF; Katakana
65+
3100..312F; Bopomofo
66+
3130..318F; Hangul Compatibility Jamo
67+
3190..319F; Kanbun
68+
31A0..31BF; Bopomofo Extended
69+
3200..32FF; Enclosed CJK Letters and Months
70+
3300..33FF; CJK Compatibility
71+
3400..4DB5; CJK Unified Ideographs Extension A
72+
4E00..9FFF; CJK Unified Ideographs
73+
A000..A48F; Yi Syllables
74+
A490..A4CF; Yi Radicals
75+
AC00..D7A3; Hangul Syllables
76+
D800..DB7F; High Surrogates
77+
DB80..DBFF; High Private Use Surrogates
78+
DC00..DFFF; Low Surrogates
79+
E000..F8FF; Private Use
80+
F900..FAFF; CJK Compatibility Ideographs
81+
FB00..FB4F; Alphabetic Presentation Forms
82+
FB50..FDFF; Arabic Presentation Forms-A
83+
FE20..FE2F; Combining Half Marks
84+
FE30..FE4F; CJK Compatibility Forms
85+
FE50..FE6F; Small Form Variants
86+
FE70..FEFE; Arabic Presentation Forms-B
87+
FEFF..FEFF; Specials
88+
FF00..FFEF; Halfwidth and Fullwidth Forms
89+
FFF0..FFFD; Specials
90+
10300..1032F; Old Italic
91+
10330..1034F; Gothic
92+
10400..1044F; Deseret
93+
1D000..1D0FF; Byzantine Musical Symbols
94+
1D100..1D1FF; Musical Symbols
95+
1D400..1D7FF; Mathematical Alphanumeric Symbols
96+
20000..2A6D6; CJK Unified Ideographs Extension B
97+
2F800..2FA1F; CJK Compatibility Ideographs Supplement
98+
E0000..E007F; Tags
99+
F0000..FFFFD; Private Use
100+
100000..10FFFD; Private Use
101+

0 commit comments

Comments
 (0)