What is "loss" when it comes to tok2vec #10667
-
I couldn't find quite a clear answer on this - an NER pipeline with a tok2vec component will report tok2vec loss, but I'm not clear what the objective for tok2vec is. Is it the loss from the NER objective with respect to the tok2vec component? Or is there something separate being optimized there? The FAQ describes loss as the "work remaining for the optimizer", but this seems like a way of describing loss more generally. The docs describe "tok2vec predictions", but what is tok2vec predicting? Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
Hi @bpben , During training, the tok2vec loss is just from other components (it's not separate). You can find more information in the model architectures documentation. |
Beta Was this translation helpful? Give feedback.
Hi @bpben ,
During training, the tok2vec loss is just from other components (it's not separate). You can find more information in the model architectures documentation.