Skip to content

Commit 8622b34

Browse files
author
Rolf vandeVaart
committed
Check for GPU Direct RDMA and leave pinned turned off
1 parent 347290f commit 8622b34

File tree

2 files changed

+17
-0
lines changed

2 files changed

+17
-0
lines changed

opal/mca/btl/openib/btl_openib_component.c

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2577,6 +2577,15 @@ btl_openib_component_init(int *num_btl_modules,
25772577
opal_leave_pinned_pipeline = 0;
25782578
}
25792579

2580+
#if OPAL_CUDA_SUPPORT
2581+
if (mca_btl_openib_component.cuda_want_gdr && (0 == opal_leave_pinned)) {
2582+
opal_show_help("help-mpi-btl-openib.txt",
2583+
"CUDA_gdr_and_nopinned", true,
2584+
opal_process_info.nodename);
2585+
goto no_btls;
2586+
}
2587+
#endif /* OPAL_CUDA_SUPPORT */
2588+
25802589
index = mca_base_var_find("ompi", "btl", "openib", "max_inline_data");
25812590
if (index >= 0) {
25822591
if (OPAL_SUCCESS == mca_base_var_get_value(index, NULL, &source, NULL)) {

opal/mca/btl/openib/help-mpi-btl-openib.txt

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -692,3 +692,11 @@ You cannot have fork support and CUDA GPU Direct RDMA support on at the
692692
same time. Please disable one of them. Deactivating the openib BTL.
693693

694694
Local host: %s
695+
#
696+
[CUDA_gdr_and_nopinned]
697+
You requested to run with CUDA GPU Direct RDMA support but also with
698+
"leave pinned" turned off. This will result in very poor performance
699+
with CUDA GPU Direct RDMA. Either disable GPU Direct RDMA support or
700+
enable "leave pinned" support. Deactivating the openib BTL.
701+
702+
Local host: %s

0 commit comments

Comments
 (0)