Skip to content

[LLM] Rewrite GSM8K reward function to follow standard GRPO conventions #794

[LLM] Rewrite GSM8K reward function to follow standard GRPO conventions

[LLM] Rewrite GSM8K reward function to follow standard GRPO conventions #794

Triggered via pull request March 5, 2026 11:01
@vmoensvmoens
edited #3542
Status Failure
Total duration 10s
Artifacts

auto-tag.yml

on: pull_request_target
add-label
5s
add-label
Fit to window
Zoom out
Zoom in

Annotations

2 errors
add-label
Process completed with exit code 1.
add-label
Unknown or invalid prefix '[LLM]'.