Labelling entities in the training data for NER #9966
-
Hi. So I am training a custom NER model in spacy for identifying custom entities in my data. Consider the following text:
To be recognized and labelled as:
I am having trouble understanding how to tag this data using BILOU tagging approach?Should I instead use an entity ruler and add it as a pattern? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 11 replies
-
NER is only for contiguous spans of text. So you can't get output like you want using IOB labels or the NER model (or matcher) alone. What you can do is annotate the whole span of interest ( |
Beta Was this translation helpful? Give feedback.
NER is only for contiguous spans of text. So you can't get output like you want using IOB labels or the NER model (or matcher) alone.
What you can do is annotate the whole span of interest (
Nifty ... indices
) and then use post-processing to manually convert that string to the strings you want. The grammatical structure in your example is known as "coordination" and is pretty complicated, but if you focus on a few kinds of examples you want to deal with you may be able to handle it using dependency information.