Skip to content

Commit f0aaecc

Browse files
testsean-smith
authored andcommitted
If merged this commit does the following:
* Remove subpath from pretrain_llama.py * Install toml package * Adjust --gres=gpu:8 to number of user specified devices Signed-off-by: Sean Smith <seasmith@nvidia.com>
1 parent 4017c93 commit f0aaecc

File tree

2 files changed

+3
-2
lines changed

2 files changed

+3
-2
lines changed

large_language_model_pretraining/nemo/Dockerfile

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,7 @@ FROM ${NEMO_BASE_IMAGE} AS nemo-base-image
1818
RUN pip uninstall transformers -y
1919
RUN pip install transformers==4.47.1 blobfile==3.0.0
2020
RUN pip install prettytable==3.12.0
21+
RUN pip install toml==0.10.2
2122
RUN pip install git+https://github.com/mlcommons/logging.git@4.1.0-rc3
2223

2324
# setup workspace

large_language_model_pretraining/nemo/pretrain_llama31.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -75,8 +75,8 @@ def slurm_executor(
7575
gpus_per_node=devices,
7676
mem="0",
7777
exclusive=True,
78-
gres="gpu:8",
79-
packager=run.GitArchivePackager(subpath="large_language_model_pretraining/nemo", ref="HEAD"),
78+
gres=f"gpu:{devices}",
79+
packager=run.GitArchivePackager(),
8080
dependencies=dependencies,
8181
)
8282

0 commit comments

Comments
 (0)