Skip to content

[Discussion]Add dedicated display for RTEB benchmark results #3095

@q275343119

Description

@q275343119

Description of the feature

We need to display RTEB results in the interface.
To support this, we are opening this issue to discuss the related changes before finalizing the implementation.

A demo of the current implementation can be found here:
https://huggingface.co/spaces/SmileXing/leaderboard

There are currently three points that need discussion:

  1. Data filtering

    • When the benchmark is RTEB, only RTEB model results are displayed.

    • Question: How should we determine which models belong to RTEB and should be shown?

  2. Field adjustments

    • When the benchmark is RTEB, the zero-shot field is hidden (including the zero-shot option in advanced model filters).

    • Question: What do you think about this adjustment?

  3. Model ranking

    • When the benchmark is RTEB, the ranking logic is changed to use the RTEB-specific algorithm.

    • Question: What do you think about this change?

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions