[Spacy nightly] Error in parsing docbin: "Error -3 while decompressing data: incorrect header check" #6390
-
How to reproduce the behaviourI exported some data from Prodigy with the Then I tried to train an NER model with the latest transformer models from Spacy 3.0.0rc2, following the instructions here to generate a config and run "spacy train". When I train the model, I get the following error. It looks like there's an issue parsing the spacy data. Is this because Spacy 2 and Spacy 3 docbin formats are incompatible? Your EnvironmentSpacy Nightly
Spacy 2
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
|
Hi, the names for the commands have ended up a bit confusing here, sorry. For prodigy v1, the You need to convert the spacy convert data.json .This will save We should improve this error message so it's clearer what's going on when you try to load JSON as DocBin. |
Beta Was this translation helpful? Give feedback.
-
|
Thanks, that fixed it! |
Beta Was this translation helpful? Give feedback.
Hi, the names for the commands have ended up a bit confusing here, sorry. For prodigy v1, the
data-to-spacycommand outputs the spacy v2 training format, which is.jsoninstead of.spacy(DocBin).You need to convert the
data-to-spacyoutput to.spacyfor v3 using spacy v3'sconvertcommand:spacy convert data.json .This will save
data.spacyin the current directory.We should improve this error message so it's clearer what's going on when you try to load JSON as DocBin.