Skip to content
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions docs/software/communication/cray-mpich.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,6 +79,13 @@ Cray MPICH may sometimes hang on larger runs.

Performance may be negatively affected by this option.

#### `"cxil_map: write error"` when doing inter-node GPU-aware MPI communication

The following environment variable can be set to disable gdrcopy:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about:

This error message is sometimes triggered by applications that use GPU Direct MPI calls when they trigger a bug in gdrcopy (a low-level library used to copy buffers between GPUs).
Setting the following option will completely disable gdrcopy.
Note that this has a performance impact for small message sizes, so it should only be enabled on a case-by-case basis.

You could also mention that it has been used for ICON.

```bash
export FI_CXI_SAFE_DEVMEM_COPY_THRESHOLD=0
```

### Resolved issues

#### `"cxil_map: write error"` when doing inter-node GPU-aware MPI communication
Expand Down