EntityRuler and NER pipe configuration #10190
-
Hi all! First of all, I'm really loving Spacy 3. Now, let me describe the situation, the general idea is that given a text I want to recognize only one type of entity, so:
Now, the problem is that when I train with this configuration I'm using training data generated by the same How can I approach this configuration-wise? Do I need to have two configs? One for the Am I approaching the problem from an extremely wrong angle? Thanks very much. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
It could make sense to have two configs, one for training and one for the full pipeline. On the other hand, if you can include the initialized Entity Ruler at train time, you can just leave it there - if it's not an annotating component (as specified in the config) then the NER component won't see its annotations, so it just won't do anything. You can also mark it as a disabled component during training to keep it from running. This way you can use just one config and just override the The double NER sample project might be a helpful reference, though it's combining multiple NER components. |
Beta Was this translation helpful? Give feedback.
It could make sense to have two configs, one for training and one for the full pipeline.
spacy assemble
was made for this use case.On the other hand, if you can include the initialized Entity Ruler at train time, you can just leave it there - if it's not an annotating component (as specified in the config) then the NER component won't see its annotations, so it just won't do anything. You can also mark it as a disabled component during training to keep it from running. This way you can use just one config and just override the
disabled_components
setting with your training command.The double NER sample project might be a helpful reference, though it's combining multiple NER components.