-
Notifications
You must be signed in to change notification settings - Fork 162
Open
Description
Hello, in 'nq_eval.py' it is mentioned that "Each prediction should be provided with a long answer score, and a short answers score".
May I clarify what these scores refer to? Are these scores supposed to represent the confidence of the model's predictions, or is there a fixed method to obtain scores?
For example, can we define the 'score' to simply be the sum of the start and end logits of the prediction?
Lastly, are scores also required for null predictions?
Thank you very much!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels