Skip to content

Actions: NVIDIA-NeMo/RL

Actions

Create PR to main with cherry-pick from release

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
658 workflow runs
658 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: Add GPT-OSS support via mcore (#1452)
Create PR to main with cherry-pick from release #633: Commit 441f745 pushed by terrykong
13s main
perf: Add qwen3 30b-a3b async-8-off recipe (#1642)
Create PR to main with cherry-pick from release #632: Commit 7dd9a01 pushed by terrykong
11s main
feat: Necessary changes for Gym GRPO tutorial (#1630)
Create PR to main with cherry-pick from release #631: Commit 0bddd47 pushed by terrykong
11s main
feat: add support from building images using vllm from private repos …
Create PR to main with cherry-pick from release #630: Commit df01ca7 pushed by terrykong
9s main
chore: fix grpo functional test metric (#1643)
Create PR to main with cherry-pick from release #629: Commit 52cebdf pushed by terrykong
10s main
docs: Revise news section for nemotron v3 and DAPO algorithm support …
Create PR to main with cherry-pick from release #628: Commit d4fffe0 pushed by terrykong
12s main
chore: Enable LoRA Nightly Test (#1634)
Create PR to main with cherry-pick from release #627: Commit a010564 pushed by terrykong
13s main
fix: Set use_flashinfer_fused_rope to False (#1636)
Create PR to main with cherry-pick from release #626: Commit 363165a pushed by terrykong
14s main
docs: Add SkyRL to inspired libraries list (#1632)
Create PR to main with cherry-pick from release #625: Commit 995efaa pushed by terrykong
11s main
chore: update megatron dev (11/21/2025) / mbridge (11/28/2025) (#1568)
Create PR to main with cherry-pick from release #624: Commit 5d04b36 pushed by terrykong
10s main
fix: Sort rollout outputs to match inputs order + gym bump (#1627)
Create PR to main with cherry-pick from release #623: Commit 5e3c0e2 pushed by terrykong
13s main
refactor: refactor env and data processor & add nemotron super 49b re…
Create PR to main with cherry-pick from release #622: Commit 7e5df0c pushed by terrykong
12s main
fix: swanlab logger error caused by define_metric (#1615)
Create PR to main with cherry-pick from release #621: Commit b33e440 pushed by terrykong
10s main
feat: LoRA SFT support for DTensorV2 path (#1556)
Create PR to main with cherry-pick from release #620: Commit 32f5bef pushed by terrykong
9s main
fix: Set validation accuracy to mean of rewards to handle non-[0,1] r…
Create PR to main with cherry-pick from release #619: Commit e3cfb11 pushed by terrykong
11s main
fix: add H200 TFLOPS (#1543)
Create PR to main with cherry-pick from release #618: Commit 5bc5eba pushed by terrykong
13s main
docs: update roadmap post v0.4 (#1607)
Create PR to main with cherry-pick from release #617: Commit 64ab08d pushed by terrykong
14s main
feat: Enable Ray dashboard for Ray state API (#1602)
Create PR to main with cherry-pick from release #616: Commit 0947683 pushed by terrykong
12s main
feat: log generation ISL/OSL histogram to wandb (#1594)
Create PR to main with cherry-pick from release #615: Commit 140cd97 pushed by terrykong
13s main
feat: allow uv-less execution and fingerprint the environment (#1491)
Create PR to main with cherry-pick from release #614: Commit ed9cab7 pushed by terrykong
14s main
chore: rename penguin -> nemo_gym and add the gym submodule (#1587)
Create PR to main with cherry-pick from release #613: Commit 23d2bed pushed by terrykong
14s main
refactor: Introduce BasePolicyWorker (#1585)
Create PR to main with cherry-pick from release #612: Commit a99bc26 pushed by terrykong
15s main
fix: ADDING DOCS (#1595)
Create PR to main with cherry-pick from release #611: Commit 5e73bfd pushed by terrykong
15s main
feat: force on-policy ratio to 1 (#1529)
Create PR to main with cherry-pick from release #610: Commit 1cad374 pushed by terrykong
12s main
feat: Add moe load balancing metrics (#1520)
Create PR to main with cherry-pick from release #609: Commit 859a89a pushed by terrykong
21s main