Skip to content

Benchmarks / accuracy #257

Benchmarks / accuracy

Benchmarks / accuracy #257

Triggered via schedule September 30, 2025 12:54
Status Cancelled
Total duration 1d 0h 0m 3s
Artifacts

accuracy_test.yaml

on: schedule
Matrix: accuracy_tests
create_pr
0s
create_pr
Fit to window
Zoom out
Zoom in

Annotations

3 errors
Qwen/Qwen3-8B-Base accuracy
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
Qwen/Qwen2.5-VL-7B-Instruct accuracy
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s
Qwen/Qwen3-30B-A3B accuracy
The job has exceeded the maximum execution time while awaiting a runner for 24h0m0s