Skip to content

fix: Change base.yml path from MaxText to maxtext#1209

Open
RUEI4341 wants to merge 4 commits intoGoogleCloudPlatform:masterfrom
CIeNET-International:maxtext/release/pr-223
Open

fix: Change base.yml path from MaxText to maxtext#1209
RUEI4341 wants to merge 4 commits intoGoogleCloudPlatform:masterfrom
CIeNET-International:maxtext/release/pr-223

Conversation

@RUEI4341
Copy link
Contributor

@RUEI4341 RUEI4341 commented Mar 6, 2026

Description

In AI-Hypercomputer/maxtext#3044, all config files were moved from src/MaxText/configs to the lowercase src/maxtext/configs. Because of this directory change, the XLML DAGs are currently failing with the following error:
FileNotFoundError: [Errno 2] No such file or directory: '/deps/src/MaxText/configs/base.yml'

This change updated file paths for compatibility with AI-Hypercomputer/maxtext#3044.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run one-shot tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed.

In AI-Hypercomputer/maxtext#3044, all config files were moved from `src/MaxText/configs` to the lowercase `src/maxtext/configs`.
Because of this directory change, the XLML DAGs are currently failing with the following error:
```FileNotFoundError: [Errno 2] No such file or directory: '/deps/src/MaxText/configs/base.yml'```

This change updated file paths for compatibility with AI-Hypercomputer/maxtext#3044.
base_command = (
f"export BASE_OUTPUT_PATH={BASE_OUTPUT_PATH} && "
+ "python3 -m MaxText.train MaxText/configs/base.yml base_output_directory=gs://runner-maxtext-logs run_name=${RUN_NAME} model_name=mixtral-8x7b tokenizer_path=assets/tokenizer.mistral-v1 dataset_path=gs://maxtext-dataset per_device_batch_size=4 enable_checkpointing=false ici_fsdp_parallelism=-1 max_target_length=1024 async_checkpointing=false attention=flash dtype=bfloat16 weight_dtype=bfloat16"
+ "python3 -m MaxText.train maxtext/configs/base.yml base_output_directory=gs://runner-maxtext-logs run_name=${RUN_NAME} model_name=mixtral-8x7b tokenizer_path=assets/tokenizer.mistral-v1 dataset_path=gs://maxtext-dataset per_device_batch_size=4 enable_checkpointing=false ici_fsdp_parallelism=-1 max_target_length=1024 async_checkpointing=false attention=flash dtype=bfloat16 weight_dtype=bfloat16"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A separate PR would be good, but can you please also update the python3 -m MaxText.train references to use python3 -m maxtext.trainers.pre_train.train instead? The old ones are deprecated and will be removed in the near future. This is the full list of commands we will need to update: https://github.com/AI-Hypercomputer/maxtext/tree/102af23138003f20df9e9c8194ee6617e47881f9/src/MaxText

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Understood. We will open a separate PR to migrate those deprecated commands to the new training path.

run_cmds = [
"pip show aqtp",
f"bash MaxText/configs/{tpu}/{model_size}.sh EXECUTABLE=train.py OUTPUT_PATH={base_output_directory} PLATFORM=gke",
f"bash maxtext/configs/tpu/{tpu}/{model_size}.sh EXECUTABLE=train.py OUTPUT_PATH={base_output_directory} PLATFORM=gke",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please use src/maxtext/configs as the path instead of maxtext/configs. This will work temporarily, but we will likely remove this soon

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the clarification. We will update the path to src/maxtext/configs for now.

@alfredyu-cienet alfredyu-cienet changed the title fix: Change base.yml path from MaxText to maxtext (#223) fix: Change base.yml path from MaxText to maxtext Mar 6, 2026
@alfredyu-cienet alfredyu-cienet requested a review from xuefgu as a code owner March 6, 2026 02:49
@RUEI4341
Copy link
Contributor Author

RUEI4341 commented Mar 6, 2026

Hi @bvandermoon, Could you review these changes when you have a moment? Thank you.

Also, to avoid potential code conflicts, we are planning to wait until this is merged before submitting the next PR, which will migrate MaxText.train to maxtext.trainers.pre_train.train.

@RUEI4341 RUEI4341 requested a review from bvandermoon March 6, 2026 06:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants