Noun chunks returning empty list for the custom pipeline #11059
-
I have trained a custom pipeline and with new ner and Spancat component and used the parser and tagger from source. Here is my config
I have checked the syntax iterators in Default, it contains noun chucks. The default NLP model "en_core_web_sm" provides noun chunks for same sentence.
The parser output is same for both default and custom model.
I am not sure what I am missing here. I am not getting the noun chunks and pos_ from the doc created using the custom model. Can you please help me on finding the problem here? Thanks in advance |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
The noun chunks functionality depends on the UPOS tags ( For example, this config, which just sources components from the pretrained pipeline, works:
|
Beta Was this translation helpful? Give feedback.
The noun chunks functionality depends on the UPOS tags (
doc.pos
), not the fine-grained tags (doc.tag
). You need to include the AttributeRuler to get those. You can do that by sourcing the AttributeRuler fromen_core_web_sm
like you did the Tagger, though note it doesn't needreplace_listeners
.For example, this config, which just sources components from the pretrained pipeline, works: