The plan by dlwh · Pull Request #2518 · marin-community/marin

dlwh · 2026-01-28T21:48:47Z

Include a summary of the changes and the related issue if any.

A good description is a paragraph or so describing the changes you made and the
motivation. You may follow this with a few bullets for specific changes, but
try to keep it concise.

e.g.

Title: [RL] Fix loss: use global token normalization instead of per-example

"""
This fixes a regression in the DAPO loss computation by switching
from per-example normalization (/ n_i) back to global token
normalization (/ N). Per-example normalization gives shorter responses
disproportionately more gradient weight, which hurts math reasoning
tasks where correct answers often require detailed, longer derivations.
Global normalization weights all examples equally regardless of response
length.
"""

Fixes #

github-actions · 2026-02-21T01:14:43Z

This pull request has been inactive for 23 days and is marked as stale.
If there is no further activity within 7 days, it will be automatically closed.
If you believe this PR should remain open, please add a comment or update the PR.

dlwh added 12 commits January 2, 2026 15:24

very rough cut

37ae87d

letting codex have a go at it

df78921

oops i accidentally a file

9c24562

wip

0b0ea3b

wip

4141e9e

Merge remote-tracking branch 'origin/main' into the_plan

1cea27d

wip

a631f8b

Merge remote-tracking branch 'origin/main' into the_plan

9baf316

wip

038b7bc

Merge remote-tracking branch 'origin/main' into the_plan

45ed7ee

pallas plan entry

33c0f5b

Merge remote-tracking branch 'origin/main' into the_plan

94fa9c2

github-actions bot added the stale label Feb 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

The plan#2518

The plan#2518
dlwh wants to merge 12 commits intomainfrom
the_plan

dlwh commented Jan 28, 2026

Uh oh!

github-actions bot commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

dlwh commented Jan 28, 2026

Uh oh!

github-actions bot commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant