fix: add default optim arg in training arg #607

YashasviChaurasia · 2025-09-15T05:51:28Z

Description of the change

The change defaults the optimizer argument to adamw_torch as default for fms-hf-tuning.

The default for hf trainer is adamw_torch_fused which wont be compatible with mixed precision, given that we push for mixed precision ="bf16" for performance, I propose we should make adamw_torch as default for our stack..
hf optim

Related issue number

How to verify the PR

Signed-off-by: yashasvi <[email protected]>

github-actions · 2025-09-15T05:51:36Z

Thanks for making a pull request! 😃
One of the maintainers will review and advise on the next steps.

dushyantbehl

LGTM

fix: add default optim arg in training arg

aa511b2

Signed-off-by: yashasvi <[email protected]>

github-actions bot added the fix label Sep 15, 2025

YashasviChaurasia marked this pull request as ready for review September 15, 2025 07:02

YashasviChaurasia requested review from aluu317, anhuong, dushyantbehl, fabianlim and kmehant as code owners September 15, 2025 07:02

dushyantbehl approved these changes Sep 15, 2025

View reviewed changes

dushyantbehl merged commit f41eb2c into foundation-model-stack:main Sep 15, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add default optim arg in training arg #607

fix: add default optim arg in training arg #607

Uh oh!

YashasviChaurasia commented Sep 15, 2025

Uh oh!

github-actions bot commented Sep 15, 2025

Uh oh!

dushyantbehl left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: add default optim arg in training arg #607

fix: add default optim arg in training arg #607

Uh oh!

Conversation

YashasviChaurasia commented Sep 15, 2025

Description of the change

Related issue number

How to verify the PR

Uh oh!

github-actions bot commented Sep 15, 2025

Uh oh!

dushyantbehl left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants