Skip to content
Discussion options

You must be logged in to vote

There is not currently a trainable lemmatizer in the core spacy library. There's an experimental trainable edit tree lemmatizer in development, see:

https://explosion.ai/blog/edit-tree-lemmatizer

There's a UD benchmark project that uses it (also with a trainable tokenizer that I wouldn't recommend outside of benchmarking, see https://explosion.ai/blog/ud-benchmarks-v3-2):

https://github.com/explosion/projects/tree/v3/benchmarks/ud_benchmark

I would currently guess that the edit tree lemmatizer could move into the core library in v3.3.0, but we haven't made an official decision yet.

Replies: 1 comment 12 replies

Comment options

You must be logged in to vote
12 replies
@kanayer
Comment options

@adrianeboyd
Comment options

@kanayer
Comment options

@adrianeboyd
Comment options

@adrianeboyd
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / lemmatizer Feature: Rule-based and lookup lemmatization
3 participants