Holding conflicting predictions - with Doc.ents
or Doc.spans
?
#7512
-
Hi, In SpaCy v3 there are two ways to hold span-based predictions I see: What's the reason to having both? I see that a Text: i need a hotel in sat Thanks, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
You could represent We are working on a new statistical pipeline component that can predict overlapping spans, but it's still in progress. It will predict spans in a span group rather than predicting token-level IOB. Right now the closest you can get is running multiple NER components that each predict a different set of non-overlapping entity types as |
Beta Was this translation helpful? Give feedback.
doc.ents
corresponds to the token-levelENT_IOB
andENT_TYPE
annotations that are used by the built-in NER component (EntityRecognizer
), which can only predict non-overlapping IOB tags.You could represent
doc.ents
with a span group, but as soon as your span group has overlapping spans, you can't convert it to the token-level IOB format.We are working on a new statistical pipeline component that can predict overlapping spans, but it's still in progress. It will predict spans in a span group rather than predicting token-level IOB.
Right now the closest you can get is running multiple NER components that each predict a different set of non-overlapping entity types as
doc.ents
. Before each …