Skip to content
Discussion options

You must be logged in to vote

The model-best is saved based on performance on the dev set. To be clear, the issue is that you have models that perform well on the dev set despite having high loss (= performing poorly on the training set)? That sounds like a really strange thing to happen.

There isn't a parameter for this, and the state of being best or not is held by the process rather than on disk, so I'm not sure it'd be easy to customize. You might want to look at hte spacy/training/loop.py code to see how that works - since it is Python you can modify it without recompiling if you need to.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@Lolologist
Comment options

Answer selected by Lolologist
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
training Training and updating models
2 participants