SpaCy with German: Passive, relative clause are not being determined (token.dep_ == sbp, rc) #11094
-
How to reproduce the behaviour
it shows the available labels for dependency:
when I call the nlp function with the text:
where Der Brief ist geschrieben worden, Der Brief wird geschrieben._ is in Passiv . However, it shows that the object's dependency is sb -- subject and not sbp -- passivized subject (PP)
the same situation for relative clause
Version is 3.3.0 Can you check what is the problem here? The result is the same independent on the model ( lg,md, trf ). |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
I think you're misunderstanding how these dependency labels are used.
The grammatical subject ("Brief") still uses "sb" as in an active sentence. Here's the full TIGER annotation manual (in German): https://www.ims.uni-stuttgart.de/documents/ressourcen/korpora/tiger-corpus/annotation/tiger_scheme-syntax.pdf Obviously you'll run into some level of statistical errors in the models (see #3052), and especially for sentences that look less like the training data (newspaper text), but |
Beta Was this translation helpful? Give feedback.
I think you're misunderstanding how these dependency labels are used.
rc
is used to attach the head of the relative clause to the noun it's modifying ("Kinder" -> "spielen"):sbp
is used for the "von X" phrase (logical subject / agent) in a passive sentence, not the grammatical subject of the passive sentence ("geschrieben" -> "von"):The grammatical subject ("Brief") still uses "sb" as in an active sentence.
Here's the full TIGER annotation manual (in German): https://www.ims.uni-stuttgart.de/documents/ressourcen/korpora/tiger-corpus/annotation/tiger_scheme-syntax.pdf
Obviously you'll run into some level of statistical errors in the models (see #3052), and especially for sentences that…