-
Notifications
You must be signed in to change notification settings - Fork 553
Description
Description of the feature
We need to display RTEB results in the interface.
To support this, we are opening this issue to discuss the related changes before finalizing the implementation.
A demo of the current implementation can be found here:
https://huggingface.co/spaces/SmileXing/leaderboard
There are currently three points that need discussion:
-
Data filtering
-
When the benchmark is RTEB, only RTEB model results are displayed.
-
Question: How should we determine which models belong to RTEB and should be shown?
-
-
Field adjustments
-
When the benchmark is RTEB, the zero-shot field is hidden (including the zero-shot option in advanced model filters).
-
Question: What do you think about this adjustment?
-
-
Model ranking
-
When the benchmark is RTEB, the ranking logic is changed to use the RTEB-specific algorithm.
-
Question: What do you think about this change?
-