Adding lemmatizer to trained NER model #10619

spatiebalk · 2022-04-04T11:03:23Z

spatiebalk
Apr 4, 2022

I am training a NER model that also needs to have a lemmatizer. This lemmatizer I initialized with the source model en_core_web_sm as that is fine and does not need to be trained in my use case. However, after training, the NER goes fine, but the lemmatizer gives a different result than the en_core_web_sm model itself.

I was wondering whether something is wrong in my config file (here added as .txt but used as .cfg) and whether I can fix this without having to retrain the entire NER model?

I see there is a lemmatizer file in my model: trained_model/lemmatizer/lookups/lookups.bin. so is it maybe possible to replace this with the file used by en_core_web_sm? If so, where could I find this file?

config_NER_model_txt.txt

Answered by thomashacker

Apr 4, 2022

Hello,
you also need to source the tok2vec layer from the pre-trained model.

[components.tok2vec]
source = "en_core_web_sm"

You'll also need to add the tok2vec component to the pipeline variable in the [nlp] section
pipeline = ["senter", "transformer", "tok2vec", "parser", "tagger", "attribute_ruler", "entity_ruler", "ner", "lemmatizer"]

View full answer

thomashacker · 2022-04-04T13:15:52Z

thomashacker
Apr 4, 2022

Hello,
you also need to source the tok2vec layer from the pre-trained model.

[components.tok2vec]
source = "en_core_web_sm"

You'll also need to add the tok2vec component to the pipeline variable in the [nlp] section
pipeline = ["senter", "transformer", "tok2vec", "parser", "tagger", "attribute_ruler", "entity_ruler", "ner", "lemmatizer"]

1 reply

spatiebalk Apr 4, 2022
Author

I'll try this, thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Adding lemmatizer to trained NER model #10619

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Adding lemmatizer to trained NER model #10619

Uh oh!

spatiebalk Apr 4, 2022

Replies: 1 comment · 1 reply

Uh oh!

thomashacker Apr 4, 2022

Uh oh!

spatiebalk Apr 4, 2022 Author

spatiebalk
Apr 4, 2022

Replies: 1 comment 1 reply

thomashacker
Apr 4, 2022

spatiebalk Apr 4, 2022
Author