Best method for matching many words and their lemmas #10933
Unanswered
nlovell1
asked this question in
Help: Coding & Implementations
Replies: 1 comment 2 replies
-
The PhraseMatcher allows you to match phrases, but only on one attribute. The Matcher allows for very versatile token-based matching. The EntityRuler is primarily a wrapper for either/both of the above so you can use it as a pipeline component. It also has the small extra feature that it allows setting entity IDs. If you just want to match phrases based on lemmas I'd recommend using the PhraseMatcher via the EntityRuler. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi-
Suppose I want to detect, for a list of multi word phrases, each time they appear in a body of text, allowing for changes in inflection to be still matched (so, effectively matching by lemma in the correct order).
I have read through the docs and see many options, including but not limited to, using phrase matcher with entity ruler, token matcher with entity ruler, and both of these directly without the entity ruler.
What are the consequences of using each, and, do they produce the same result for my purpose?
Beta Was this translation helpful? Give feedback.
All reactions