Skip to content

Benchmarks / accuracy #46

Benchmarks / accuracy

Benchmarks / accuracy #46

Triggered via schedule August 8, 2025 18:37
Status Cancelled
Total duration 7h 29m 59s
Artifacts

accuracy_test.yaml

on: schedule
Matrix: accuracy_tests
create_pr
create_pr
Fit to window
Zoom out
Zoom in

Annotations

4 errors
Qwen/Qwen3-30B-A3B accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen/Qwen3-8B-Base accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Benchmarks / accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen/Qwen2.5-VL-7B-Instruct accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists