Allow train.max_steps for profiling runs #2193

MukeshK17 · 2026-01-18T10:34:19Z

What does this PR do?

Allows --train.max_steps to be passed via the CLI for pretraining runs.
Previously, this argument caused a validation error in pretrain.py.

The argument is now accepted and emits a warning clarifying that it is intended
for profiling, debugging, or minimal sanity-check runs.

Motivation

Users may want to run a very small number of training steps (e.g. max_steps=1)
to measure memory usage or execution time without committing to full pretraining.

Changes

Removed train.max_steps from the unsupported argument list in pretrain.py
Added a warning when train.max_steps is provided to clarify intended usage

Tests

Core unit tests pass locally
HF tokenizer tests requiring gated models fail locally without HF_TOKEN
Optional dependency tests (datasets, bitsandbytes, lm_eval) not run locally
CI is expected to cover gated and optional test paths

bhimrazy · 2026-01-20T08:03:31Z

LGTM 👍
@MukeshK17 , Could you also include a test for this? Maybe in test_cli or wherever it fits best. That would help ensure this behavior stays covered.

MukeshK17 · 2026-01-20T10:30:15Z

Thanks! I’ll add a CLI-level test to ensure --train.max_steps is accepted and doesn’t raise a validation error. Will update the PR shortly.

MukeshK17 · 2026-01-20T15:49:48Z

Added a CLI test in test_cli.py to cover --train.max_steps acceptance and warning behavior.
Thanks for the suggestion!

MukeshK17 requested review from KaelanDt, andyland, k223kim, lantiga, lianakoleva and t-vi as code owners January 18, 2026 10:34

Allow train.max_steps for profiling runs with warning

cc19a3c

MukeshK17 and others added 2 commits January 21, 2026 02:00

Add CLI test for --train.max_steps warning behavior

4b31588

Merge branch 'main' into fix-allow-max-steps

93eeff4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow train.max_steps for profiling runs #2193

Allow train.max_steps for profiling runs #2193

Uh oh!

MukeshK17 commented Jan 18, 2026

Uh oh!

bhimrazy commented Jan 20, 2026

Uh oh!

MukeshK17 commented Jan 20, 2026

Uh oh!

MukeshK17 commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Allow train.max_steps for profiling runs #2193

Are you sure you want to change the base?

Allow train.max_steps for profiling runs #2193

Uh oh!

Conversation

MukeshK17 commented Jan 18, 2026

What does this PR do?

Motivation

Changes

Tests

Uh oh!

bhimrazy commented Jan 20, 2026

Uh oh!

MukeshK17 commented Jan 20, 2026

Uh oh!

MukeshK17 commented Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants