Skip to content

Conversation

@msimberg
Copy link
Collaborator

Just for testing builds, I don't know if this will work.

@msimberg
Copy link
Collaborator Author

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@msimberg
Copy link
Collaborator Author

hsa-amd-aqlprofile had missing compiler dependencies: spack/spack-packages#2532.

@msimberg
Copy link
Collaborator Author

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@msimberg
Copy link
Collaborator Author

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@msimberg
Copy link
Collaborator Author

ROCm is a menace: https://gitlab.com/cscs-ci/ci-testing/webhook-ci/mirrors/551234120955960/1440398897047560/-/jobs/12201564988#L3724. hipblaslt seems to be picking up amdclang++ from the system... needs further investigation.

@msimberg
Copy link
Collaborator Author

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@msimberg
Copy link
Collaborator Author

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@msimberg
Copy link
Collaborator Author

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@jgphpc
Copy link
Contributor

jgphpc commented Nov 28, 2025

Hi,
Could you add rocprofiler_sdk/package.py to the recipe ? needed for linaro...

@msimberg
Copy link
Collaborator Author

Hi, Could you add rocprofiler_sdk/package.py to the recipe ? needed for linaro...

I'll try it out, hopefully no bigger issues (though note that I have issues with other packages before this is usable unfortunately).

@msimberg
Copy link
Collaborator Author

msimberg commented Dec 3, 2025

spack/spack-packages#2287 to add rocm 7.1.0 is also currently open. It may help, it may make things worse... The PR description does mention a change to hipblaslt, which may change something.

@afzpatel
Copy link

afzpatel commented Dec 3, 2025

ROCm is a menace: https://gitlab.com/cscs-ci/ci-testing/webhook-ci/mirrors/551234120955960/1440398897047560/-/jobs/12201564988#L3724. hipblaslt seems to be picking up amdclang++ from the system... needs further investigation.

Yes, it looks like an older version of ROCm is installed on the system and it's choosing the incorrect version of amdclang++. I'll see if I can reproduce the issue and put in a fix.

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

1 similar comment
@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

2 similar comments
@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

1 similar comment
@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

Issue is:
ImportError: /user-environment/env/default/lib/libhipsparse.so.4: undefined symbol: rocsparse_destroy_spmv_descr

which is strange because:
 ldd /user-environment/env/default/lib/libhipsparse.so.4
        linux-vdso.so.1 (0x00007ffddc1c1000)
        /user-environment/env/default/lib/librocsparse.so (0x00007f6af9a27000)
 nm /user-environment/linux-zen3/rocsparse-7.1.0-qfflrfxfu2vaqu52hitvwyukdwq767rf/lib/librocsparse.so | grep rocsparse_destroy_spmv_descr
0000000002f33fc0 T rocsparse_destroy_spmv_descr
@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@simonpintarelli
Copy link
Member

hipblaslt etc are in the cache now.

composable-kernels, a dependency of miopen-hip seems to take forever to build, it ran into walltime limit after 21 hrs..

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

hipblaslt etc are in the cache now.

composable-kernels, a dependency of miopen-hip seems to take forever to build, it ran into walltime limit after 21 hrs..

Thank you very much for the help 👍 This is something to check in a following PR maybe with updating to ROCM 7.1.1

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

1 similar comment
@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

depends_on("py-textual-plotext", when="@7.0:", type=("build", "run"))
depends_on("py-sqlalchemy@2.0.42:", when="@7.1:", type=("build", "run"))

patch("keep_ld_preload.patch", when="@7.1.0")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For some reason hipsparse.so doesn't link with some signatures of rocsparse.so and copy fails when it gets imported without export LD_PRELOAD=/user-environment/env/default/lib/librocsparse.so:$LD_PRELOAD or something similar.
I tried to fix the issue in hipsparse/rocsparse but I don't understand why linking is going wrong. I will probably have to keep using the LD_PRELOAD workaround and report the issue to AMD

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

@iomaganaris
Copy link
Collaborator

cscs-ci run alps;system=beverin;uarch=mi200;uenv=prgenv-gnu:25.12

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants