Skip to content
Discussion options

You must be logged in to vote

No errors jump out at me in the data or the config and you've checked the right things when inspecting the DocBin. This error does occur if there are no tags in the training data or if there are accidentally no documents at all in the .spacy file, which can happen sometimes if something goes wrong in the conversion.

One additional check to run would be spacy debug data with the exact same config and corpus settings as for spacy train, just in case there's an incorrect .spacy path that might be leading to the initialization error?

A few additional notes, not related to this particular error:

  • use spacy convert -T when converting from CoNLL-U to merge the multiword tokens, since spacy's def…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@funghetto
Comment options

Answer selected by funghetto
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
lang / it Italian language data and models training Training and updating models
2 participants