Adding Support for Hebrew #8234
eliorc
started this conversation in
Language Support
Replies: 2 comments 15 replies
-
Sorry, we don't have a better guide than that thread. What data do you have? Since the basic tokenizer support for Hebrew is already in spaCy, if you have CoNLL format data with tagger and parser annotations, you should be able to use |
Beta Was this translation helpful? Give feedback.
12 replies
-
@eliorc |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I've already read the Adding models for new languages master thread and I couldn't find a straightforward guide for adding a new language.
To be honest, adding Hebrew to spaCy is an initiative I've tried to take before, but that time (few years ago) the language resources were very poor.
Lately there have been advancement (models for PoS, language models etc.) which are created by the community and also some initiatives that are supported (also econimically) by the goverment which makes adding Hebrew to spaCy much more feasible.
My question is, where can I find a documented step by step guide for adding a new language from scratch?
(I am aware of the he section in the languages)
Beta Was this translation helpful? Give feedback.
All reactions