Help understanding error message #10862
hahahannes
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I am running UCX with the following settings:
to enforce the usage of GPU RDMA. I am getting the following error:
With
-x UCX_TLS=rc,gdr_copy,cuda_copy
, it works. I assumecuda_copy
is used as a fallback and will result in staging through host memory.I would be very grateful for an explanation of what the possible error sources could be. I am assuming that this means that the GDR setup is not correct but I am not sure.
Thanks a lot !
Beta Was this translation helpful? Give feedback.
All reactions