We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 0c90721 commit e916aa4Copy full SHA for e916aa4
lm_eval/tasks/aime/aime.yaml
@@ -9,6 +9,7 @@ fewshot_split: train
9
test_split: train
10
doc_to_text: "Question: {{Question}}\nAnswer:"
11
doc_to_target: "{{Answer}}"
12
+process_results: !function utils.process_results
13
metric_list:
14
- metric: exact_match
15
aggregation: mean
0 commit comments