SpanCategorizer for default NER models #9644

monWork · 2021-11-08T19:47:34Z

monWork
Nov 8, 2021

Is there a way we can use can we use SpanCategorizer component to get the confidence scores from the default trained models such as en_core_web_trf and en_core_web_md ?

We don't want to train new models, we just need the confidence scores for NER and POS from the default models, how it can be done ?

Answered by polm

Nov 9, 2021

You cannot get confidence scores from the default models without training anything. The tagger is not a model that generates a meaningful confidence score, it isn't structured like that. It's like asking for feathers from a horse.

If you have access to OntoNotes you can train a spancat with the same data the NER models were trained with. If you do not have access to OntoNotes, you have another option, though I would not recommend it. You can use the pretrained models to annotate a lot of text and use those annotations as training data for a spancat model. For NER this might work acceptably if you use enough text. I don't think this would work well for POS.

View full answer

polm · 2021-11-09T04:29:48Z

polm
Nov 9, 2021

You cannot get confidence scores from the default models without training anything. The tagger is not a model that generates a meaningful confidence score, it isn't structured like that. It's like asking for feathers from a horse.

If you have access to OntoNotes you can train a spancat with the same data the NER models were trained with. If you do not have access to OntoNotes, you have another option, though I would not recommend it. You can use the pretrained models to annotate a lot of text and use those annotations as training data for a spancat model. For NER this might work acceptably if you use enough text. I don't think this would work well for POS.

1 reply

monWork Nov 10, 2021
Author

Thanks for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

SpanCategorizer for default NER models #9644

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

SpanCategorizer for default NER models #9644

Uh oh!

monWork Nov 8, 2021

Replies: 1 comment · 1 reply

Uh oh!

polm Nov 9, 2021

Uh oh!

monWork Nov 10, 2021 Author

monWork
Nov 8, 2021

Replies: 1 comment 1 reply

polm
Nov 9, 2021

monWork Nov 10, 2021
Author