Skip to content

Improve eval-results badge docs and add moderation note#2208

Open
gary149 wants to merge 1 commit intomainfrom
improve-eval-results-docs
Open

Improve eval-results badge docs and add moderation note#2208
gary149 wants to merge 1 commit intomainfrom
improve-eval-results-docs

Conversation

@gary149
Copy link
Contributor

@gary149 gary149 commented Feb 6, 2026

Summary

  • Clarify badge table descriptions to match actual implementation in moon-landing (e.g. verified = valid verifyToken reproduced via Inspect AI, community = open PR, leaderboard = benchmark has a leaderboard, source = external URL provided)
  • Add a tip after Community Contributions explaining how community scores can be moderated (author can close the PR to remove a disputed score)

Context

User feedback surfaced two common questions:

  1. "Who runs the evals?" — badge descriptions now make provenance clearer
  2. "Can someone submit false scores?" — new tip explains the PR-based moderation model

Test plan

  • Review rendered markdown for clarity

Note

Low Risk
Low risk documentation-only change that adds guidance on how community-provided eval scores are displayed and can be removed if disputed.

Overview
Clarifies the eval-results documentation by adding a TIP under Community Contributions explaining that community-submitted scores are visible while a PR is open and can be removed by closing the PR if disputed, emphasizing the intent to move toward reproducible verified scores.

Written by Cursor Bugbot for commit 892b57e. This will update automatically on new commits. Configure here.

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@gary149 gary149 force-pushed the improve-eval-results-docs branch from f89f93b to b8b7aed Compare February 6, 2026 15:45
@gary149 gary149 marked this pull request as ready for review March 13, 2026 10:21
Add a tip after Community Contributions explaining that community scores
are visible while the PR is open and the model author can close the PR
to remove a disputed score.
@gary149 gary149 force-pushed the improve-eval-results-docs branch from b8b7aed to 892b57e Compare March 13, 2026 10:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants