[E143] Labels for component 'tagger' not initialized. #11534
-
Hello, I'm trying to build an Italian model to use for commercial purposes as the ones provided have a restrictive license but get E143 when trying to train, this is the corpus I'm using: https://github.com/UniversalDependencies/UD_Italian-MarkIT This is the full output I get:
This is my config:
spacy info output: ============================== Info about spaCy ==============================
I tested the code provided in a similar github issue showing that there are tags in the document.
Part of output:
Thanks for the help. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
No errors jump out at me in the data or the config and you've checked the right things when inspecting the One additional check to run would be A few additional notes, not related to this particular error:
|
Beta Was this translation helpful? Give feedback.
No errors jump out at me in the data or the config and you've checked the right things when inspecting the
DocBin
. This error does occur if there are no tags in the training data or if there are accidentally no documents at all in the.spacy
file, which can happen sometimes if something goes wrong in the conversion.One additional check to run would be
spacy debug data
with the exact same config and corpus settings as forspacy train
, just in case there's an incorrect.spacy
path that might be leading to the initialization error?A few additional notes, not related to this particular error:
spacy convert -T
when converting from CoNLL-U to merge the multiword tokens, since spacy's def…