Skip to content
Discussion options

You must be logged in to vote

If the doc is fully parsed, it's better not to have blank dependencies (for technical reasons, since heads don't have an "unset" value), but it would make sense for these to be treated differently. It looks like v2 had some hard-coded behavior for space tokens that was removed in v3. We usually use the placeholder label dep for cases like this, but you could also use whatever label you'd like.

In v3 these kinds of rule-based exceptions have been moved into the attribute_ruler. You can modify this rule related to all whitespace tokens in the attribute_ruler to do this, just add "DEP": "dep" (or whichever label you'd like) to the assigned attrs:

{'patterns': [[{'IS_SPACE': True}]], 'attrs': {

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@yangfan0356
Comment options

Comment options

You must be logged in to vote
1 reply
@yangfan0356
Comment options

Answer selected by polm
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / parser Feature: Dependency Parser
3 participants