Skip to content
Discussion options

You must be logged in to vote

Hello, the English models use a rule-based lemmatizer based on the POS, but POS can be incorrect, or the rules might not be 100% correct in all cases. The accuracy also depends on whether you run the lemmatizer on short paragraphs or whole sentences. Here, you can read more about how the lemmatizer works and how the token.pos influences the results.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by thomashacker
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / lemmatizer Feature: Rule-based and lookup lemmatization perf / accuracy Performance: accuracy
2 participants