Skip to content

Commit 97b4105

Browse files
committed
Updated Tigergpu tutorial
1 parent 0be5be0 commit 97b4105

File tree

1 file changed

+27
-6
lines changed

1 file changed

+27
-6
lines changed

docs/PrincetonUTutorial.md

Lines changed: 27 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# TigerGPU Tutorial
2-
*Last updated 2019-10-24.*
2+
*Last updated 2021-6-3.*
33

44
## Building the package
55
### Login to TigerGPU
@@ -32,7 +32,14 @@ module load openmpi/cuda-8.0/intel-17.0/3.0.0/64
3232
module load intel
3333
module load hdf5/intel-17.0/intel-mpi/1.10.0
3434
```
35-
As of the latest update of this document, the above modules correspond to the following versions on the TigerGPU system, given by `module list`:
35+
As of the latest update of this document (Summer 2021), the above modules correspond to the following versions on the TigerGPU system, given by `module list`:
36+
```
37+
Currently Loaded Modulefiles:
38+
1) anaconda3/2020.7 3) cudnn/cuda-9.2/7.6.3 5) hdf5/gcc/openmpi-1.10.2/1.10.0
39+
2) cudatoolkit/10.2 4) openmpi/cuda-11.0/gcc/4.0.4/64
40+
```
41+
42+
A previous version of this document (2019) listed the below modules for the Tiger GPU system.
3643
```
3744
Currently Loaded Modulefiles:
3845
1) anaconda3/2019.3 4) openmpi/cuda-8.0/intel-17.0/3.0.0/64 7) hdf5/intel-17.0/intel-mpi/1.10.0
@@ -47,17 +54,25 @@ Next, install the `plasma-python` package:
4754
python setup.py install
4855
```
4956

50-
Where `my_env` should contain the Python packages as per `requirements-travis.txt` file.
57+
Where `my_env` should contain the Python packages as per `envs/pip-requirements-travis.txt` file.
5158

5259
### Common build issue: cluster's MPI library and `mpi4py`
5360

5461
Common issue is Intel compiler mismatch in the `PATH` and what you use in the module. With the modules loaded as above,
55-
you should see something like this:
62+
you should see something like this (as of summer 2021):
63+
64+
```
65+
$ which mpicc
66+
/usr/local/openmpi/cuda-11.0/4.0.4/gcc/x86_64/bin/mpicc
67+
```
68+
69+
A previous version of this document had the below output (as of 2019).
5670
```
5771
$ which mpicc
5872
/usr/local/openmpi/cuda-8.0/3.0.0/intel170/x86_64/bin/mpicc
5973
```
60-
Especially note the presence of the CUDA directory in this path. This indicates that the loaded OpenMPI library is [CUDA-aware](https://www.open-mpi.org/faq/?category=runcuda).
74+
75+
In both cases, especially note the presence of the CUDA directory in this path. This indicates that the loaded OpenMPI library is [CUDA-aware](https://www.open-mpi.org/faq/?category=runcuda).
6176

6277
If you `conda activate` the Anaconda environment **after** loading the OpenMPI library, your application would be built with the MPI library from Anaconda, which has worse performance on this cluster and could lead to errors. See [mpi4py on HPC Clusters](https://researchcomputing.princeton.edu/support/knowledge-base/mpi4py) for a related discussion.
6378

@@ -78,7 +93,13 @@ ln -s /tigress/FRNN/signal_data signal_data
7893
All the configuration parameters are summarised in `examples/conf.yaml`. In this section, we highlight the important ones used to control the input data.
7994

8095
Currently, FRNN is capable of working with JET and D3D data as well as thecross-machine regime. The switch is done in the configuration file:
96+
```yaml
97+
paths:
98+
...
99+
data: 'jet_0D'
100+
```
81101
102+
Older yaml files kept for archival purposes will denote this data set as follow:
82103
```yaml
83104
paths:
84105
...
@@ -107,7 +128,7 @@ This will preprocess the data and save rescaled copies of the signals in `/tigre
107128
Preprocessing must be performed only once per each dataset. For example, consider the following dataset specified in the config file `examples/conf.yaml`:
108129
```yaml
109130
paths:
110-
data: jet_data_0D
131+
data: jet_0D
111132
```
112133
Preprocessing this dataset takes about 20 minutes to preprocess in parallel and can normally be done on the cluster headnode.
113134

0 commit comments

Comments
 (0)