Skip to content

Abount the detail of the dataset. #1

@nzjin

Description

@nzjin

I find there are several fields of each sample in the dataset, which are dict_keys(['question_id', 'original_table_id', 'question', 'sql', 'answer', 'hard_positive_idx', 'positive_idx', 'negative_idx', 'dataset']).

  1. Could you provide a comment about the meaning of each field?
  2. Literally, positive idx means the index of relevant tables, however, I find some positive_idx are empty lists ([]), does it mean there are no relevant tables for this question? and how do you feed empty tables into the encoder?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions