Training a TextCat with different weights between FP and FN. #11256
-
Hi everyone ! I used the quick configuration set up from here https://spacy.io/usage/training#quickstart with English / textcat / CPU / accuracy.
I'm just not sure this way of doing things is what I want to do.
If you need any more details, I'll be happy to give them to you. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
What you should probably start with:
After training, you could try out There's a bit of confusing duplication in the settings, so to modify the threshold after training if you want to use |
Beta Was this translation helpful? Give feedback.
What you should probably start with:
en_core_web_lg
, with no further changes to the config.tok2vec
for atextcat
component. Don't use thetok2vec
fromen_core_web_lg
.After training, you could try out
spacy report
(https://spacy.io/universe/project/spacy-report) to experiment with the threshold. The threshold is only used for scoring, it doesn't affect the training process itself or the annotations saved todoc.cats
, which are always the scores for all categories.There's a bit of confusing duplication in the settings, so to modify the threshold after training if you want to use
spacy evaluate
w…