Skip to content

Missing subsets in lhq dataset #28

@AlbertiPot

Description

@AlbertiPot

@loubnabnl Hi, thanks for your wonderful work, especially for those great datasets release.

When I went through the LHQ dataset from the extras release, it seems that all of the data in the huggingface dataset are from the deepmind-math, while there are no other mentioned data like GSM8K, APPS and etc.

I have raised an issue here https://huggingface.co/datasets/bigcode/starcoder2data-extras/discussions/4, which has my data investigation details.

Many thanks and best regards.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions