NER with only the keyword? #10923
-
It appears that spaCy's NER only works accurately when the keyword is blended in a sentence, are there any methods to get accurate NER results by training a model with only the keyword? Or is it best to formulate random words to put the keyword in a sentence with? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
The golden rule of training data is that the more it is like your real data, the better your results will be. When reading a sentence, the context around the entities can be as important as the entities themselves. For example, if you say, "I went to XXX for vacation", we can guess that XXX is a location. If your training data doesn't have any of that context it will be hard to learn from. Building sentences with "random words" will not help. If you only have a list of keywords you may be able to use a rule based matcher or do weak supervision. |
Beta Was this translation helpful? Give feedback.
The golden rule of training data is that the more it is like your real data, the better your results will be.
When reading a sentence, the context around the entities can be as important as the entities themselves. For example, if you say, "I went to XXX for vacation", we can guess that XXX is a location. If your training data doesn't have any of that context it will be hard to learn from. Building sentences with "random words" will not help.
If you only have a list of keywords you may be able to use a rule based matcher or do weak supervision.