UserWarning: [W108] The rule-based lemmatizer did not find POS annotation for one or more tokens. Check that your pipeline includes components that assign token.pos, typically 'tagger'+'attribute_ruler' or 'morphologizer'. #12886
-
I am getting the warning in the title. When I look at the referenced file spacy/pipeline/lemmatizer.py:211, it appears this warning means that a tokens POS is an empty string ""?
I added this to my function that creates my training data and it never prints anything, so I am a bit confused why I keep getting this warning?
My pipeline looks like this: 'tok2vec', 'tagger', 'attribute_ruler', 'lemmatizer', 'ner' |
Beta Was this translation helpful? Give feedback.
Answered by
micmizer
Aug 5, 2023
Replies: 1 comment
-
I think I just figured this out. My attribute ruler config was like this
And after changing that to this I no longer get the warnings:
|
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
adrianeboyd
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I think I just figured this out. My attribute ruler config was like this
And after changing that to this I no longer get the warnings: