Skip to content

Conversation

@juanmichelini
Copy link
Collaborator

Summary

Rename claude-4.6-opus to claude-opus-4-6 to match Anthropic's official branding.

Changes

  • Rename results directory from claude-4.6-opus to claude-opus-4-6
  • Update metadata.json model and directory_name fields
  • Update Model enum and mappings in validate_schema.py

Fixes #538

@juanmichelini can click here to continue refining the PR

- Rename results directory from claude-4.6-opus to claude-opus-4-6
- Update metadata.json model and directory_name fields
- Update Model enum and mappings in validate_schema.py

Fixes #538

Co-authored-by: openhands <[email protected]>
@github-actions
Copy link

github-actions bot commented Feb 9, 2026

📊 Progress Report

============================================================
OpenHands Index Results - Progress Report
============================================================

Target: Complete all model × benchmark pairs
  11 models × 5 benchmarks = 55 pairs
  (each pair requires all 3 metrics: score, cost_per_instance, average_runtime)

============================================================
OVERALL PROGRESS: ⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛⬛ 100.0%
  Complete: 55 / 55 pairs
============================================================

✅ Schema Validation

============================================================
Schema Validation Report
============================================================

Results directory: /home/runner/work/openhands-index-results/openhands-index-results/results
Files validated: 28
  Passed: 28
  Failed: 0

============================================================
VALIDATION PASSED
============================================================

This report measures progress towards the 3D array goal (benchmarks × models × metrics) as described in #2.

@juanmichelini
Copy link
Collaborator Author

Closing will do one big PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Rename claude-4.6-opus to claude-opus-4-6

2 participants