-
Notifications
You must be signed in to change notification settings - Fork 215
[Issue] Windows Test failures in multi arch ci with error HIP error: no ROCm-capable device is detected file - gfx110x-all #4274
Copy link
Copy link
Open
Labels
Description
Platform: windows
Effected Archs: gfx110x-all
Tests Failed:
miopen-providefr,
rocprim,
rocrand,
rocpsparse,
rocthrust
Runner Names on which tests failed:
azure-windows-11-gfx1101-gpu-rocm-runner-25azure-windows-11-gfx1101-gpu-rocm-runner-12
Full Logs:
- https://github.com/ROCm/TheRock/actions/runs/23774631922/job/69291797730
- https://github.com/ROCm/TheRock/actions/runs/23774631922/job/69291797924
- https://github.com/ROCm/TheRock/actions/runs/23774631922/job/69291797892
- https://github.com/ROCm/TheRock/actions/runs/23774631922/job/69291797855
Error Logs:
FAILED ] TestMiopenBatchnormFwdTrainingActivPlanBuilder.IsApplicableReturnsTrueForFusionWithoutRunningStatistics (2 ms)
[ RUN ] TestMiopenBatchnormFwdTrainingActivPlanBuilder.IsApplicableReturnsFalseForNonReluActivation
MIOpen Error: azw11gfx1101r25:C:/home/runner/_work/TheRock/TheRock/rocm-libraries/projects/miopen/src/hip/handlehip.cpp:132: No device: no ROCm-capable device is detected
[----------] 1 test from RocprimInternalMergePathTests
[ RUN ] RocprimInternalMergePathTests.Basic
HIP error: no ROCm-capable device is detected file: C:/home/runner/_work/TheRock/TheRock/rocm-libraries/projects/rocprim/test/rocprim\../../common/utils_device_ptr.hpp line: 80
[ RUN ] rocrand_basic_tests/rocrand_basic_tests.rocrand_create_destroy_generator_test/0
rocRAND internal error: hipErrorNoDevice in function ~poisson_distribution_manager at C:/home/runner/_work/TheRock/TheRock/rocm-libraries/projects/rocrand/library/src/rng\distribution/poisson.hpp:216
// rocSPARSE.error.trace: { "function": "_rocsparse_handle::_rocsparse_handle",
// "line" : "157",
// "file" : "C:/home/runner/_work/TheRock/TheRock/rocm-libraries/projects/rocsparse/library/src/common/rocsparse_handle.cpp",
// "status" : "internal error",
// "msg" : "hip error detected: code '100', name 'hipErrorNoDevice', description 'no ROCm-capable device is detected'" }
The creation of the rocsparse_handle failed.
Insufficient resources for test swap_ranges.hip:
Test requested resources of type 'gpus' in the following amounts:
1 slot
but only the following units were available:
Resource spec file:
resources.json
Insufficient resources for test tabulate.hip:
Test requested resources of type 'gpus' in the following amounts:
1 slot
but only the following units were available:```
Reactions are currently unavailable