Pre first release cleanup #71

skyw · 2025-11-05T23:45:38Z

Slightly updated test.
Force keyword arguments for optimizers that have very long argument list, otherwise error prone
Abstract weight decay to single mixin class
Some code style fix/improvement

@mkhona-nvidia @FDecaYed please take a look, there will be another doc only clean up before we making a release, after which things changing things would become maintenance burden.

@mkhona-nvidia coverage of scalar optimizers and soap with adaptative criteria is low, would be good to improve. It can also be done after release, so not mandatory.

Signed-off-by: Hao Wu <[email protected]>

copy-pr-bot · 2025-11-05T23:45:41Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

skyw · 2025-11-05T23:47:02Z

/ok to test fb57fe5

Signed-off-by: Hao Wu <[email protected]>

emerging_optimizers/orthogonalized_optimizers/orthogonalized_optimizer.py

mkhona-nvidia · 2025-11-06T22:19:08Z

emerging_optimizers/mixin.py

+WeightDecayT = Literal["decoupled", "independent", "l2"]
+
+
+class WeightDecayMixin:


What is the reason for this to be a class rather than a set of functions that are chosen based on arguments?

weight decay is a function highly coupled with optimizer, and shared among a lot of optim subclass.

Thought about a optimizer based class with this function and make everyone inherit from it, but not all of our optimizers use the same base.

Signed-off-by: Hao Wu <[email protected]>

skyw · 2025-11-06T23:27:19Z

/ok to test 832ae49

emerging_optimizers/orthogonalized_optimizers/scion.py

Signed-off-by: Hao Wu <[email protected]>

chtruong814 · 2025-11-07T22:46:46Z

/ok to test da5be01

FDecaYed

LGTM

skyw added 3 commits November 5, 2025 15:03

force kwarg passing for functions with long arg list

63dd043

Signed-off-by: Hao Wu <[email protected]>

update test settings in CI

db16eba

Signed-off-by: Hao Wu <[email protected]>

add smoke test for scion

9008e43

Signed-off-by: Hao Wu <[email protected]>

skyw requested review from FDecaYed and mkhona-nvidia November 5, 2025 23:45

copy-pr-bot bot temporarily deployed to test November 5, 2025 23:47 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 5, 2025 23:53 Inactive

mkhona-nvidia previously approved these changes Nov 5, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to nemo-ci November 6, 2025 00:37 Inactive

update orth optimizer tests

09edea6

Signed-off-by: Hao Wu <[email protected]>

skyw dismissed mkhona-nvidia’s stale review via 9190b79 November 6, 2025 19:59

skyw force-pushed the skyw/prerelease_cleanup branch from fb57fe5 to 9190b79 Compare November 6, 2025 19:59

skyw added 2 commits November 6, 2025 12:00

make weight decay a mixin

8c4d515

Signed-off-by: Hao Wu <[email protected]>

apply weight decay mixin to more places

5144a6e

Signed-off-by: Hao Wu <[email protected]>

skyw force-pushed the skyw/prerelease_cleanup branch from 9190b79 to 5144a6e Compare November 6, 2025 20:00

skyw added 5 commits November 6, 2025 12:14

export module in init

ba89adb

Signed-off-by: Hao Wu <[email protected]>

really add mixin.py

b6af421

Signed-off-by: Hao Wu <[email protected]>

unify weight decay options

926f090

Signed-off-by: Hao Wu <[email protected]>

add more tests

24269c9

Signed-off-by: Hao Wu <[email protected]>

improve import code style

aa56968

Signed-off-by: Hao Wu <[email protected]>

mkhona-nvidia reviewed Nov 6, 2025

View reviewed changes

skyw added 2 commits November 6, 2025 15:04

remove left over

adf9d17

Signed-off-by: Hao Wu <[email protected]>

change some assert to raise

832ae49

Signed-off-by: Hao Wu <[email protected]>

skyw marked this pull request as ready for review November 6, 2025 23:24

skyw requested a review from a team as a code owner November 6, 2025 23:24

copy-pr-bot bot temporarily deployed to test November 6, 2025 23:27 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 6, 2025 23:31 Inactive

mkhona-nvidia reviewed Nov 6, 2025

View reviewed changes

emerging_optimizers/orthogonalized_optimizers/scion.py Outdated Show resolved Hide resolved

fix scion argument

da5be01

Signed-off-by: Hao Wu <[email protected]>

mkhona-nvidia self-requested a review November 7, 2025 16:40

mkhona-nvidia approved these changes Nov 7, 2025

View reviewed changes

chtruong814 approved these changes Nov 7, 2025

View reviewed changes

copy-pr-bot bot temporarily deployed to test November 7, 2025 22:47 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 7, 2025 22:47 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci November 7, 2025 22:48 Inactive

copy-pr-bot bot had a problem deploying to nemo-ci November 7, 2025 22:48 Failure

copy-pr-bot bot temporarily deployed to nemo-ci November 7, 2025 22:58 Inactive

skyw merged commit daacec6 into main Nov 7, 2025
21 of 23 checks passed

skyw deleted the skyw/prerelease_cleanup branch November 7, 2025 23:07

FDecaYed reviewed Nov 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pre first release cleanup #71

Pre first release cleanup #71

Uh oh!

skyw commented Nov 5, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Nov 5, 2025

Uh oh!

skyw commented Nov 5, 2025

Uh oh!

Uh oh!

mkhona-nvidia Nov 6, 2025

Uh oh!

skyw Nov 6, 2025

Uh oh!

skyw commented Nov 6, 2025

Uh oh!

Uh oh!

chtruong814 commented Nov 7, 2025

Uh oh!

Uh oh!

FDecaYed left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		WeightDecayT = Literal["decoupled", "independent", "l2"]


		class WeightDecayMixin:

Pre first release cleanup #71

Pre first release cleanup #71

Uh oh!

Conversation

skyw commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

copy-pr-bot bot commented Nov 5, 2025

Uh oh!

skyw commented Nov 5, 2025

Uh oh!

Uh oh!

mkhona-nvidia Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

skyw Nov 6, 2025

Choose a reason for hiding this comment

Uh oh!

skyw commented Nov 6, 2025

Uh oh!

Uh oh!

chtruong814 commented Nov 7, 2025

Uh oh!

Uh oh!

FDecaYed left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

skyw commented Nov 5, 2025 •

edited

Loading