You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: examples/nemo_run/qat/ADVANCED.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,15 +8,13 @@ To run the example on slurm, edit the `SLURM_CONFIG` at the bottom of `nemo_qat_
8
8
9
9
To launch the Flow on a Slurm cluster, modify your Slurm credentials at the bottom of `nemo_qat_flow.py` and add the `--use-slurm` flag to the command. On a different server (e.g. your local server), launch the NeMo container as described in the [README](README.md) then run `python qat/nemo_qat_flow.py --use-slurm --log-dir /slurm/log/dir`, which will `ssh` into the Slurm cluster, `rsync` your files over, and launch the tasks. The log directory on the Slurm cluster should look like this after an experiment is run (assuming your experiment name is `qat_flow_ckpts`)
10
10
11
-
**NOTE:**`rsync` may not currently be available in the NeMo container and will be added as a dependency.
12
-
13
-
```
11
+
```bash
14
12
qat_flow_ckpts qat_flow_ckpts_1755708286
15
13
```
16
14
17
15
If you `cd` into the experiment itself, e.g. `cd qat_flow_ckpts_1755708286`, you'll find a directory structure like the following. Each folder is for a stage of the Simplified Flow, and in each stage you can see the logs for that stage as well as the sbatch command that was run. You can `cd` into each stage and `tail -f` the log file to see the logs while the stage is running.
18
16
19
-
```
17
+
```bash
20
18
├── 00_openscience_data
21
19
│ ├── code
22
20
│ ├── configs
@@ -54,3 +52,5 @@ If you `cd` into the experiment itself, e.g. `cd qat_flow_ckpts_1755708286`, you
54
52
│ ├── code
55
53
│ └── configs
56
54
```
55
+
56
+
**NOTE:**`rsync` may not currently be available in the NeMo container and will be added as a dependency.
0 commit comments