Resume Training Through The Command Line #9381
-
Hi, I am trying to finetune the spacy en_core_web_lg model with some additional examples through the command-line interface. I receive this error message when I run the training command. OSError: [E884] The pipeline could not be initialized because the vectors could not be found at './data/processed/en_vectors/'. If your pipeline was already initialized/trained before, call 'resume_training' instead of 'initialize', or initialize only the components that are new. I previously stored the vectors in the en_core_web_lg model by using vectors.to_disk() but I'm wondering if this was the correct way to store them when trying to then finetune from the command line. My config file is below.
Any help would be much appreciated. Thanks |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hello markFriel, When fine-tuning a model, be aware of the catastrophic forgetting problem (https://explosion.ai/blog/pseudo-rehearsal-catastrophic-forgetting) |
Beta Was this translation helpful? Give feedback.
Hello markFriel,
You can add the vectors directly by using "en_core_web_lg" instead ->
vectors = "en_core_web_lg"
When fine-tuning a model, be aware of the catastrophic forgetting problem (https://explosion.ai/blog/pseudo-rehearsal-catastrophic-forgetting)