-
Notifications
You must be signed in to change notification settings - Fork 10.3k
Data Files in different versions
Shreeshrii edited this page Jul 3, 2018
·
10 revisions
| Lang Code | Language | 3.00 | 3.02 | 3.04 | 4.0.0 |
|---|---|---|---|---|---|
| afr | Afrikaans | x | x | ||
| amh | Amharic | x | x | ||
| ara | Arabic | x | x | ||
| asm | Assamese | x | x | ||
| aze | Azerbaijani | x | x | ||
| aze_cyrl | Azerbaijani - Cyrilic | x | x | ||
| bel | Belarusian | x | x | ||
| ben | Bengali | x | x | ||
| bod | Tibetan | x | x | ||
| bos | Bosnian | x | x | ||
| bre | Breton | x | |||
| bul | Bulgarian | x | x | ||
| cat | Catalan; Valencian | x | x | ||
| ceb | Cebuano | x | x | ||
| ces | Czech | x | x | ||
| chi_sim | Chinese - Simplified | x | x | ||
| chi_tra | Chinese - Traditional | x | x | ||
| chr | Cherokee | x | x | ||
| cym | Welsh | x | x | ||
| dan | Danish | x | x | ||
| dan_frak | Danish - Fraktur | x | |||
| deu | German | x | x | ||
| deu_frak | German - Fraktur | x | |||
| dzo | Dzongkha | x | x | ||
| ell | Greek, Modern (1453-) | x | x | ||
| eng | English | x | x | x | x |
| enm | English, Middle (1100-1500) | x | x | ||
| epo | Esperanto | x | x | ||
| equ | Math / equation detection module | x | |||
| est | Estonian | x | x | ||
| eus | Basque | x | x | ||
| fas | Persian | x | x | ||
| fin | Finnish | x | x | ||
| fra | French | x | x | ||
| frk | Frankish | x | x | ||
| frm | French, Middle (ca.1400-1600) | x | x | ||
| gle | Irish | x | x | ||
| glg | Galician | x | x | ||
| grc | Greek, Ancient (to 1453) | x | x | ||
| guj | Gujarati | x | x | ||
| hat | Haitian; Haitian Creole | x | x | ||
| heb | Hebrew | x | x | ||
| hin | Hindi | x | x | ||
| hrv | Croatian | x | x | ||
| hun | Hungarian | x | x | ||
| iku | Inuktitut | x | x | ||
| ind | Indonesian | x | x | ||
| isl | Icelandic | x | x | ||
| ita | Italian | x | x | ||
| ita_old | Italian - Old | x | x | ||
| jav | Javanese | x | x | ||
| jpn | Japanese | x | x | ||
| kan | Kannada | x | x | ||
| kat | Georgian | x | x | ||
| kat_old | Georgian - Old | x | x | ||
| kaz | Kazakh | x | x | ||
| khm | Central Khmer | x | x | ||
| kir | Kirghiz; Kyrgyz | x | x | ||
| kmr | Kurmanji (Latin Script) | x | |||
| kor | Korean | x | x | ||
| kor_vert | Korean (vertical) | x | |||
| kur | Kurdish | x | |||
| kur_ara | Kurdish (Arabic) | x | |||
| lao | Lao | x | x | ||
| lat | Latin | x | x | ||
| lav | Latvian | x | x | ||
| lit | Lithuanian | x | x | ||
| ltz | Luxembourgish | x | |||
| mal | Malayalam | x | x | ||
| mar | Marathi | x | x | ||
| mkd | Macedonian | x | x | ||
| mlt | Maltese | x | x | ||
| mon | Mongolian | x | |||
| mri | Maori | x | |||
| msa | Malay | x | x | ||
| mya | Burmese | x | x | ||
| nep | Nepali | x | x | ||
| nld | Dutch; Flemish | x | x | ||
| nor | Norwegian | x | |||
| oci | Occitan (post 1500) | x | x | ||
| ori | Oriya | x | x | ||
| osd | Orientation and script detection module | x | x | x | x |
| pan | Panjabi; Punjabi | x | x | ||
| pol | Polish | x | x | ||
| por | Portuguese | x | x | ||
| pus | Pushto; Pashto | x | x | ||
| que | Quechua | x | |||
| ron | Romanian; Moldavian; Moldovan | x | x | ||
| rus | Russian | x | x | ||
| san | Sanskrit | x | x | ||
| sin | Sinhala; Sinhalese | x | x | ||
| slk | Slovak | x | x | ||
| slk_frak | Slovak - Fraktur | x | |||
| slv | Slovenian | x | x | ||
| snd | Sindhi | x | |||
| spa | Spanish; Castilian | x | x | ||
| spa_old | Spanish; Castilian - Old | x | x | ||
| sqi | Albanian | x | x | ||
| srp | Serbian | x | x | ||
| srp_latn | Serbian - Latin | x | x | ||
| sun | Sundanese | x | |||
| swa | Swahili | x | x | ||
| swe | Swedish | x | x | ||
| syr | Syriac | x | x | ||
| tam | Tamil | x | x | ||
| tat | Tatar | x | |||
| tel | Telugu | x | x | ||
| tgk | Tajik | x | x | ||
| tgl | Tagalog | x | x | ||
| tha | Thai | x | x | ||
| tir | Tigrinya | x | x | ||
| ton | Tonga | x | |||
| tur | Turkish | x | x | ||
| uig | Uighur; Uyghur | x | x | ||
| ukr | Ukrainian | x | x | ||
| urd | Urdu | x | x | ||
| uzb | Uzbek | x | x | ||
| uzb_cyrl | Uzbek - Cyrilic | x | x | ||
| vie | Vietnamese | x | x | ||
| yid | Yiddish | x | x | ||
| yor | Yoruba | x |
Old wiki - no longer maintained. The pages were moved, see the new documentation.
These wiki pages are no longer maintained.
All pages were moved to tesseract-ocr/tessdoc.
The latest documentation is available at https://tesseract-ocr.github.io/.