Skip to content
Discussion options

You must be logged in to vote

I don't know enough about Norwegian to know for sure what the best analysis is for this sentence, but if you take a look at the training data (UD_Norwegian-Bokmaal), you can see that the counts for UPOS / DEP labels for the token "ute" look like this:

      4 ute	ADP	advcl
     19 ute	ADP	case
      5 ute	ADP	compound:prt
      3 ute	ADP	conj
      5 ute	ADP	nmod
      6 ute	ADP	obl
      4 ute	SCONJ	mark

Given this, it's not surprising that the spacy model predicts ADP or obl. The wide range of possible dependency labels would suggest that this word might have more than one usage or meaning and with so much variation in the annotation the model is going to have difficulty getting this c…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by adrianeboyd
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lang / nb Norwegian (Bokmål) language data and models
2 participants