Unable to add custom labels to spacy doc spans #11251

ayoub-chammam · 2022-08-01T18:33:39Z

ayoub-chammam
Aug 1, 2022

Hi, I am trying to insert NER labels into Spacy doc,
my data is like this: ("some sentence text", [(0, 5, "label")])
I want to insert this into spans and be able to also set it as default when calling doc.ents , any ideas how ?

Answered by ayoub-chammam

Aug 1, 2022

I've been looking for an answer for quite some time and I finally figured out how to apply this (turned out to be so simple) so I'm posting my approach here so maybe other people could share their insights:

import spacy
text = 'EU rejects German call to boycott British lamb.'
nlp = spacy.load('en_core_web_sm')
doc = nlp(text)
offsets = [(0, 2, 'ORG'), (11, 17, 'MISC'), (34, 41, 'MISC')]
spans = [doc.char_span(x[0], x[1], label=x[2]) for x in offsets]
doc.spans['gold'] = spans

another approach I find was to apply only the Tokenizer from SpaCy:

tokens = tokenizer(text)
words = [token.text for token in tokens]
doc = Doc(Vocab(), words)

View full answer

ayoub-chammam · 2022-08-01T20:16:17Z

ayoub-chammam
Aug 1, 2022
Author

I've been looking for an answer for quite some time and I finally figured out how to apply this (turned out to be so simple) so I'm posting my approach here so maybe other people could share their insights:

import spacy
text = 'EU rejects German call to boycott British lamb.'
nlp = spacy.load('en_core_web_sm')
doc = nlp(text)
offsets = [(0, 2, 'ORG'), (11, 17, 'MISC'), (34, 41, 'MISC')]
spans = [doc.char_span(x[0], x[1], label=x[2]) for x in offsets]
doc.spans['gold'] = spans

another approach I find was to apply only the Tokenizer from SpaCy:

tokens = tokenizer(text)
words = [token.text for token in tokens]
doc = Doc(Vocab(), words)

2 replies

polm Aug 2, 2022

Glad you figured it out! For the record an example of handling this data is in the training data docs.

ayoub-chammam Aug 2, 2022
Author

How did I miss this! Thank you!! 😃

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Unable to add custom labels to spacy doc spans #11251

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

Unable to add custom labels to spacy doc spans #11251

Uh oh!

ayoub-chammam Aug 1, 2022

Replies: 1 comment · 2 replies

Uh oh!

ayoub-chammam Aug 1, 2022 Author

Uh oh!

polm Aug 2, 2022

Uh oh!

ayoub-chammam Aug 2, 2022 Author

ayoub-chammam
Aug 1, 2022

Replies: 1 comment 2 replies

ayoub-chammam
Aug 1, 2022
Author

ayoub-chammam Aug 2, 2022
Author