Skip to content

Clarification on Vidore Test Set: Are answer annotations used during training/evaluation? #179

@Yingshu-Li

Description

@Yingshu-Li

Hi, I have a quick question about the Vidore test set.

When checking the preprocessing code (e.g., for vidore_arxivqa), the query is the question, and the candidate is constructed as prompt + image. It seems that the answer text of the question is never used in either training or evaluation.

Just want to double-check:
• Is it expected that the answer field is not used at all?
• So the model only learns from (question → image) relevance, without using the actual answer?

I want to confirm whether this is the intended design. Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    RepliedThe author team has replied to the issue and will wait for any further discussion before closing it.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions