Skip to content

Size of P2 Dataset #2

@kl2806

Description

@kl2806

Thanks for creating this benchmark!

To this end, we maintained a manageable dataset size by using 100 templates and generating
50 samples per template, resulting in 5000 total examples for each benchmark.
https://arxiv.org/pdf/2410.05229

The dataset in the repository for P2 generated_data/GSM_p2.jsonl contains 2500 instances.

Is 2500 the expected number of instances in P2, or is some of the data missing?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions