Commit 1ae9953
[ROCm] Update CUDAPluggableAllocator.h (#1984) (pytorch#153974)
[ROCm] Update CUDAPluggableAllocator.h (#1984) (pytorch#150010)
Altering the flag to use the correct streamType in CUDAPluggableAllocator class for ROCm gpu. The flag TORCH_HIP_VERSION does not work for ROCm as intended. This flag is replaced with USE_ROCM. This is impacting Distributed Fused Adam in Rocm/APEX when using nccl_ub feature. This has been tested with rocm/apex.
See PR ROCm/apex#184
Pull Request resolved: pytorch#150010
Approved by: https://github.com/jeffdaily
(cherry picked from commit a19b667)
Co-authored-by: Sriram Kumar <[email protected]>1 parent 4a815ed commit 1ae9953
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
| 40 | + | |
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
| |||
0 commit comments