Training NER on Incomplete Annotations #11114
-
Hello, does As in, we will start annotation entity type A, B. But then we want to add entity types C, and D. We ideally don't want to re-do the annotations with entity types A, B, C, and D. It's been mentioned a few other places. I believe Prodigy had the ability train on incomplete annotations (e.g. binary accept/reject, using the I wonder if there is a setting in the config we can make to signify incomplete annotations? I think I saw somewhere where we can label words as Thanks! Links |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
You can train a model from partial NER annotation, but it's intended for segments of docs where there is no entity annotation at all rather than annotation for a subset of entity types. You can set "missing" NER annotation for spans of a doc with I don't think it's going to work well in practice for your example case, since it would mean that your partially-annotated docs could only include the entity spans and not The binary accept/reject annotation is used with |
Beta Was this translation helpful? Give feedback.
You can train a model from partial NER annotation, but it's intended for segments of docs where there is no entity annotation at all rather than annotation for a subset of entity types.
You can set "missing" NER annotation for spans of a doc with
Doc.set_ents(missing=spans)
or useNone
as the IOB tag with the constructorDoc(ents=["O", None, "B-ENT", ...])
.I don't think it's going to work well in practice for your example case, since it would mean that your partially-annotated docs could only include the entity spans and not
O
, when the model really needs both to learn well.The binary accept/reject annotation is used with
incorrect_spans_key
, but this is to indicate that a particular sp…