Skip to content

Commit d537c82

Browse files
committed
add metadata
1 parent fbed623 commit d537c82

File tree

3 files changed

+8083
-21
lines changed

3 files changed

+8083
-21
lines changed

README.md

Lines changed: 24 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -1,25 +1,32 @@
11
# GlotScript
2+
3+
## GlotScript Resource
4+
5+
See [metadata folder](./metadata/).
6+
7+
## GlotScript Tool
8+
29
Detect the script (writing system) of text based on ISO 15924.
310
- Unicode version: 15.0.0
411
- The codes were sourced from [Wikipedia ISO_15924](https://en.wikipedia.org/wiki/ISO_15924).
512
- Unicode ranges were extracted from [Unicode Character Database](https://www.unicode.org/Public/15.0.0/ucd/Scripts.txt).
613

7-
## Special codes
14+
### Special codes
815
- `Zinh` code is the Unicode script property value of characters that may be used with multiple scripts, and that inherit their script from a preceding base character. In some cases, we opted to integrate parts of the Zinh code (e.g. ARABIC FATHATAN..ARABIC HAMZA BELOW, ARABIC LETTER SUPERSCRIPT ALEF) into a different block.
916
- `Zyyy` code is the Unicode script for "Common" characters.
1017
- `Zzzz` code is for Unicode script for "uncoded" script.
1118

12-
## Install from pip
19+
### Install from pip
1320
```bash
1421
pip3 install GlotScript
1522
```
1623

17-
## Install from git
24+
### Install from git
1825
```bash
1926
pip3 install GlotScript@git+https://github.com/cisnlp/GlotScript
2027
```
2128

22-
## Usage
29+
### Usage
2330

2431
```python
2532
from GlotScript import get_script_predictor
@@ -46,23 +53,7 @@ sp('𝄞𝄫 𒊕𒀸')
4653
>> ('Xsux', 0.5, {'details': {'Xsux': 0.5, 'Zyyy': 0.5}, 'tie': True, 'interval': 0.0})
4754
```
4855

49-
50-
## Citation
51-
If you use any part of this library in your research, please cite it using the following BibTex entry.
52-
53-
```
54-
@misc{glotscript,
55-
author = {Kargaran, Amir Hossein and Yvon, Fran{\c{c}}ois and Sch{\"u}tze, Hinrich},
56-
title = {GlotScript},
57-
year = {2023},
58-
publisher = {GitHub},
59-
journal = {GitHub Repository},
60-
howpublished = {\url{https://github.com/cisnlp/GlotScript}},
61-
}
62-
```
63-
64-
65-
## Exploring Unicode Blocks: Related Sources
56+
### Exploring Unicode Blocks: Related Sources
6657
<details>
6758
<summary>Click to Exapand</summary>
6859

@@ -86,4 +77,16 @@ If you use any part of this library in your research, please cite it using the f
8677

8778
</details>
8879

80+
## Citation
81+
If you use any part of this library in your research, please cite it using the following BibTex entry.
8982

83+
```
84+
@misc{glotscript,
85+
author = {Kargaran, Amir Hossein and Yvon, Fran{\c{c}}ois and Sch{\"u}tze, Hinrich},
86+
title = {GlotScript},
87+
year = {2023},
88+
publisher = {GitHub},
89+
journal = {GitHub Repository},
90+
howpublished = {\url{https://github.com/cisnlp/GlotScript}},
91+
}
92+
```

0 commit comments

Comments
 (0)