pretraining to init_tok2vec (textcat) - ValueError: Cannot deserialize model: mismatched structure #12922
Unanswered
barebra
asked this question in
Help: Model Advice
Replies: 3 comments 6 replies
-
Hi @barebra! Yes, please provide your stack trace/error log and your config file. |
Beta Was this translation helpful? Give feedback.
5 replies
-
Hi @barebra, Sorry that this has been confusing. Just to double check - have you been using the exact same config file for both your pretraining and training steps? You really should only need one file, in which the |
Beta Was this translation helpful? Give feedback.
0 replies
-
Hi Sofie,
yes, I.m using the config.cfg - that was created from spaCy during pretraining (based upon the config_pretrain.cfg) and is stored in the "pretraining" directory together with the model.bins and the log.json - for training.
Yesterday I tried a "positive control" for the pretraining2training task, and it worked: I used the word vectors from spaCy's en_core_web_lg (300 dimensions) in combination with default settings of pretraining (e.g. spacy.PretrainCharacters.v1, hidden_size=300 etc.).
The word vectors I would like to use are from en_core_sci_lg (Allen AI, 200 dimensions). So, I assume I need to change hidden_size to 200 as well? Correct?
But, using the "positive control", I will also change certain parameters, e.g. pretraining.component and pretraining.layer, trying to reproduce the error.
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I'm currently pretraining with 170.000 PubMed-abstracts (component: "textcat", layer: "tok2vec"). Epoch-loss is still decreasing and I just wanted to run a first pilot-experiment (train/evaluate) with an intermediate model (model75.bin). I learned from this forum that the config for training has to resemble the one of pretraining, so I copy/pasted the [pretraining]-section from the config_pretrain.cfg to the config.cfg and started spacy train.
Anyway, there seems to be a problem with mismatched structure (see title). Can you help me? Do you need more information (error log, config.cfg)?
Beta Was this translation helpful? Give feedback.
All reactions