selective ac by faresobeid · Pull Request #2055 · PrimeIntellect-ai/prime-rl

faresobeid · 2026-03-20T00:13:12Z

Can use with
[model.ac]
freq = 1
mode = "selective"
targets = ["norm", "attention_sdpa", "routed_experts"]

Note

Medium Risk
Changes core model execution and memory behavior by introducing selective activation checkpointing and refactoring attention/MoE internals, which could affect training correctness/perf on specific custom model layers.

Overview
Adds selective activation checkpointing via new model.ac.mode (full/selective) and model.ac.targets, including validation (non-empty targets; requires model.impl='custom') and benchmark-script support for the new CLI flags.

Implements selective checkpointing by patching specific subcomponent methods (norm, _attention_core, _mla_up_proj, _run_routed_experts) with non-reentrant checkpoints, updating apply_ac to mix selective + full-block fallback per layer and to error on unsupported targets.

Refactors attention implementations to expose _attention_core for SDPA/Flash paths (AFMoE, Qwen3.5-MoE, and shared attention layers), extracts GLM MoE DSA sparse MLA attention into a new mla_attn.py, and splits MoE routed expert compute into _run_routed_experts; also resets MoE tokens_per_expert buffers after model setup to avoid stale runtime stats.

^{Written by Cursor Bugbot for commit c4a87ca. This will update automatically on new commits. Configure here.}

benchmarks/scripts/run_single_benchmark.py

src/prime_rl/configs/trainer.py

samsja

I am not sure how I feel about the run_with_optional_checkpoint pattern, wondering if we can do the same with hook instead, defo make the code harder to understand, how does torchtitan do this ?

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Autofix Details

Bugbot Autofix prepared a fix for the issue found in the latest run.

✅ Fixed: Missing changelog for config schema changes
- Added a CHANGELOG.md entry documenting the new model.ac.mode and model.ac.targets fields.

Or push these changes by commenting:

@cursor push f77b587704

Preview (f77b587704)

diff --git a/CHANGELOG.md b/CHANGELOG.md
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -2,6 +2,8 @@
 
 Documenting changes which affect configuration usage patterns (added/moved/removed/renamed fields, notable logic changes).
 
+- **`model.ac.mode`** and **`model.ac.targets`**: Added selective activation checkpointing controls. `mode` selects `"full"` (whole blocks) vs `"selective"` (subcomponents on supported custom decoder layers). When `"selective"`, `targets` chooses from `["norm", "attention_sdpa", "mla_up_proj", "routed_experts"]`. Defaults: `mode="full"`, `targets=["norm"]`. (2026-03-20)
+
 - **`orchestrator.advantage.length_weighted_mean`**: Removed. The default advantage now always uses the plain per-problem mean baseline unless `orchestrator.advantage.length_shaping_alpha` is set. Existing configs must delete this field. (2026-03-19)
 - **`orchestrator.advantage.length_shaping_alpha`**: Added Group Relative Reward Rescaling coefficient to the default advantage config. When set, applies length-based GR3 shaping during advantage computation and requires `orchestrator.buffer.online_difficulty_filtering = true` (default: `None`) (2026-03-18)
 - **`prime_monitor.log_extras.sample_ratio`**: Added ratio-based rollout sampling (0.0–1.0, default: None). When set, caps the number of rollouts logged per step to `len(rollouts) * sample_ratio`. `None` preserves current behavior (log all rollouts). Interacts with existing `interval` gate which still runs first. (2026-03-12)

_{This Bugbot Autofix run was free. To enable autofix for future PRs, go to the Cursor dashboard.}

src/prime_rl/configs/trainer.py

src/prime_rl/trainer/models/glm4_moe/modeling_glm4_moe.py

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

benchmarks/scripts/run_single_benchmark.py

src/prime_rl/trainer/models/afmoe/modeling_afmoe.py

src/prime_rl/configs/trainer.py

Signed-off-by: faresobeid <111092724+faresobeid@users.noreply.github.com>

S1ro1

Just nits on styling, else lgtm

S1ro1 · 2026-03-23T02:58:57Z

src/prime_rl/trainer/models/layers/mla_attn.py

+        return dq, dkv, None, None
+
+
+class LayerNorm(nn.Module):


This shouldn't live in this file imo, create a new layers/norms.py maybe?

S1ro1 · 2026-03-23T02:59:36Z

src/prime_rl/trainer/models/layers/mla_attn.py

+        return indices.view(1, total_tokens, 1, index_topk)
+
+
+class GlmMoeDsaAttention(nn.Module):


I don't like this being in this file either, or atleast it being called GlmMoeDsaAttention if it has to live here.

S1ro1 · 2026-03-23T02:59:56Z

src/prime_rl/trainer/models/layers/mla_attn.py

+
+
+class GlmMoeDsaAttention(nn.Module):
+    def __init__(self, config: GlmMoeDsaConfig):


it also depends on GlmMoeDsaConfig which introduces ugly circular pattern

faresobeid added 2 commits March 20, 2026 05:42

selective ac

e7db2c1

ruff

e463fe7

samsja reviewed Mar 20, 2026

View reviewed changes

benchmarks/scripts/run_single_benchmark.py Show resolved Hide resolved

samsja reviewed Mar 20, 2026

View reviewed changes

src/prime_rl/configs/trainer.py Outdated Show resolved Hide resolved

samsja reviewed Mar 20, 2026

View reviewed changes

fix

6bbfe84

cursor bot reviewed Mar 20, 2026

View reviewed changes

src/prime_rl/configs/trainer.py Show resolved Hide resolved

faresobeid added 2 commits March 20, 2026 07:07

some changes

ac5350b

some changes

83ca243

samsja reviewed Mar 20, 2026

View reviewed changes

src/prime_rl/trainer/models/glm4_moe/modeling_glm4_moe.py Outdated Show resolved Hide resolved

cursor bot reviewed Mar 20, 2026

View reviewed changes

benchmarks/scripts/run_single_benchmark.py Show resolved Hide resolved

faresobeid added 9 commits March 20, 2026 07:41

some more

414187e

fix

e428cd7

some more

2a6d991

some more

6788fd1

small

e937732

simplify

193023f

address comments

9875e7b

clean

f8c5534

fix

cefa0dc

samsja reviewed Mar 20, 2026

View reviewed changes

src/prime_rl/trainer/models/afmoe/modeling_afmoe.py Show resolved Hide resolved

samsja reviewed Mar 21, 2026

View reviewed changes

src/prime_rl/configs/trainer.py Show resolved Hide resolved

faresobeid and others added 2 commits March 22, 2026 06:01

address comment

69158a9

Merge branch 'main' into selective-ac-new

c4a87ca

Signed-off-by: faresobeid <111092724+faresobeid@users.noreply.github.com>

S1ro1 approved these changes Mar 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

selective ac#2055

selective ac#2055
faresobeid wants to merge 16 commits intomainfrom
selective-ac-new

faresobeid commented Mar 20, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

samsja left a comment

Uh oh!

cursor bot left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

S1ro1 left a comment

Uh oh!

S1ro1 Mar 23, 2026

Uh oh!

S1ro1 Mar 23, 2026

Uh oh!

S1ro1 Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		return indices.view(1, total_tokens, 1, index_topk)


		class GlmMoeDsaAttention(nn.Module):



		class GlmMoeDsaAttention(nn.Module):
		def __init__(self, config: GlmMoeDsaConfig):

Conversation

faresobeid commented Mar 20, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samsja left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

S1ro1 left a comment

Choose a reason for hiding this comment

Uh oh!

S1ro1 Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

S1ro1 Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

S1ro1 Mar 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

faresobeid commented Mar 20, 2026 •

edited by cursor bot

Loading

cursor bot left a comment •

edited

Loading