Skip to content
Discussion options

You must be logged in to vote

Hi, @AashishTiwari

Unfortunately, we don't support discontinuous spans, and we're not planning to. One way you can approach this problem
is to treat them as separate entities, perform NER / Span Categorization, then do the post-processing afterward (i.e., recombining the tokens that belong to the same entity using some rule or logic). Another option is to combine the entities, perform NER, and split them in post.

In addition, if you are to use spaCy for training, you can first convert your BRAT files into the ConLL format and use the convert command to turn them into spaCy files.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by adrianeboyd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer
2 participants