-
Notifications
You must be signed in to change notification settings - Fork 12
Size of P2 Dataset #2
Copy link
Copy link
Open
Description
Thanks for creating this benchmark!
To this end, we maintained a manageable dataset size by using 100 templates and generating
50 samples per template, resulting in 5000 total examples for each benchmark.
https://arxiv.org/pdf/2410.05229
The dataset in the repository for P2 generated_data/GSM_p2.jsonl contains 2500 instances.
Is 2500 the expected number of instances in P2, or is some of the data missing?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels