File tree Expand file tree Collapse file tree 2 files changed +6
-3
lines changed
examples/multi_node_examples Expand file tree Collapse file tree 2 files changed +6
-3
lines changed Original file line number Diff line number Diff line change 11# Multi-node example
22
3- Run this module to launch a job which runs on 2 nodes each using 2 GPUs.
3+ To run this demo which launches a single job that trains on 2 nodes (2 gpus per node), do the following:
44
5+ 1 . Log into the jumphost node of your SLURM-managed cluster.
6+ 2 . Create a conda environment with Lightning and a GPU PyTorch version.
7+ 3 . Submit this script.
58``` bash
6- bash job_submit.sh
9+ bash job_submit.sh your_env_name_with_lightning_installed
710```
Original file line number Diff line number Diff line change 88# SBATCH --time=0-02:00:00
99
1010# activate conda env
11- conda activate my_env
11+ conda activate $1
1212
1313# -------------------------
1414# debugging flags (optional)
You can’t perform that action at this time.
0 commit comments