Skip to content
Discussion options

You must be logged in to vote

U- is used internally for blocked entities, which are tokens that are marked as not being allowed to be part of any entity, so ner won't predict any entity spans that include them. This feature isn't used that frequently, but it's supported by Doc.set_ents and ner.

The list from move_names is only for display to the user (not for anything internal in the parser), so my guess is that it's removed here to avoid confusion related to an "invalid-looking" tag that users wouldn't be expecting.

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@nikita-galileo
Comment options

@adrianeboyd
Comment options

Answer selected by svlandeg
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / ner Feature: Named Entity Recognizer
2 participants
Converted from issue

This discussion was converted from issue #10149 on January 28, 2022 08:28.