Skip to content

Commit 1b6a269

Browse files
authored
Update README.md
1 parent 194e3c4 commit 1b6a269

File tree

1 file changed

+38
-3
lines changed

1 file changed

+38
-3
lines changed

README.md

Lines changed: 38 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,39 @@
11
# copius_api
2-
an api to transcribe following languages: Mari (Hill Mari) | Udmurt | Komi | Erzya | Moksha | Mansi || Tatar | Bashkir | Chuvash || Russian
3-
between following scripts:
4-
Cyrillic, Latin, IPA, ISO9, <1917
2+
This is a Python-api to the transcription and orthography toolset at https://copius.eu/ortho.php:
3+
4+
"This toolset is a loose conglomeration of applications aiming to help you handle various character encodings, orthographies, transcriptions and transliterations you might encounter when working with Uralic languages and other languages of Europe and Northern Asia that use variants of the Latin or Cyrillic alphabet." (Copyright © 2021 COPIUS)
5+
6+
How to use:
7+
8+
```
9+
>>> from copius_api import api
10+
>>> api.transcribe("ke̮")
11+
"kɘ"
12+
```
13+
```
14+
>>> from copius_api import api
15+
>>> api.transcribe("lol","kom","lc")
16+
"лол"
17+
```
18+
```
19+
>>> from copius_api import api
20+
>>> api.transcribe("kiki","mns","9c")
21+
"кики"
22+
```
23+
```
24+
>>> from copius_api import api
25+
>>> api.transcribe("буба","mns","c9")
26+
"buba"
27+
```
28+
To see the language abbreviations:
29+
30+
```
31+
>>> from copius_api import api
32+
>>> api.lang_dict
33+
{'Mari (Hill Mari)': 'mhr', 'Udmurt': 'udm', 'Komi': 'kom', 'Erzya': 'myv', 'Moksha': 'mdf', 'Mansi': 'mns', 'Tatar': 'tat', 'Bashkir': 'bak', 'Chuvash': 'chv', 'Russian': 'rus'}
34+
```
35+
```
36+
>>> from copius_api import api
37+
>>> api.orth_dict
38+
{'Cyrillic to Cyrillic': 'cc', 'Cyrillic to Latin': 'cl', 'Cyrillic to IPA': 'ci', 'Cyrillic to ISO9': 'c9', 'Latin to Cyrillic': 'lc', 'Latin to Latin': 'll', 'Latin to IPA': 'li', 'Latin to ISO9': 'l9', 'IPA to Cyrillic': 'ic', 'IPA to Latin': 'il', 'IPA to ISO9': 'i9', 'ISO9 to Cyrillic': '9c', 'ISO9 to Latin': '9l', 'ISO9 to IPA': '9i', '<1917 to Cyrillic': '3c', '<1917 to Latin': '3l', '<1917 to IPA': '3i', '<1917 to ISO9': '39'}
39+
```

0 commit comments

Comments
 (0)