What is "trainable" about the lemmatizer? #8686
-
I can see that either a rule-based or lookup lemmatizer can be implemented and that many models use a trained, rule-based lemmatizer. I cannot find what kind of rule-deduction is being done, however. I've been looking for the lemmatizer model here but could not find it. Tl;dr: what kind of lemmatization strategy is used when using |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 11 replies
-
Hmm, are you referring to having In terms of the docs, the provided base class is a mistake as compared to the source (the base class is currently The current rule-based lemmatizers are all hand-written. |
Beta Was this translation helpful? Give feedback.
Hmm, are you referring to having
TrainablePipe
as the base class in the docs? The "Trainable" flag next to that is correct: there is not currently a trainable lemmatizer.In terms of the docs, the provided base class is a mistake as compared to the source (the base class is currently
Pipe
), but theLemmatizer
class has been designed so that it can be extended in the future without changing the API, so it has an unusedModel
parameter.The current rule-based lemmatizers are all hand-written.