[libclc] Support the generic address space #137183

frasercrmck · 2025-04-24T14:26:09Z

This commit provides definitions of builtins with the generic address space.

One concept to consider is the difference between supporting the generic address space from the user's perspective and the requirement for libclc as a compiler implementation detail to define separate generic address space builtins. In practice a target (like NVPTX) might notionally support the generic address space, but it's mapped to the same LLVM target address space as another address space (often the private one).

In such cases libclc must be careful not to define both private and generic overloads of the same builtin. We track these two concepts separately, and make the assumption that if the generic address space does clash with another, it's with the private one. We track the concepts separately because there are some builtins such as atomics that are defined for the generic address space but not the private address space.

frasercrmck · 2025-04-24T14:26:21Z

CC @wenju-he

arsenm

Even as a compiler implementation detail, libclc should not need to consider the address space mapping (unless maybe you're directly using IR)

There is a clang bug if there is different mangling. The itanium mangling should be coming from the source type / original address space, not whatever IR address space value that happens to map to

frasercrmck · 2025-04-24T16:14:06Z

There is a clang bug if there is different mangling. The itanium mangling should be coming from the source type / original address space, not whatever IR address space value that happens to map to

Yeah, that would be nice but this is what's happening, I'm afraid.

It is actually supported in the Itanium mangler:

    if (Context.getASTContext().addressSpaceMapManglingFor(AS)) {
      //  <target-addrspace> ::= "AS" <address-space-number>
      unsigned TargetAS = Context.getASTContext().getTargetAddressSpace(AS);
      if (TargetAS != 0 ||
          Context.getASTContext().getTargetAddressSpace(LangAS::Default) != 0)
        ASString = "AS" + llvm::utostr(TargetAS);
    } else {
      switch (AS) {
      default: llvm_unreachable("Not a language specific address space");
      //  <OpenCL-addrspace> ::= "CL" [ "global" | "local" | "constant" |
      //                                "private"| "generic" | "device" |
      //                                "host" ]
      case LangAS::opencl_global:
        ASString = "CLglobal";
        break;

It's just that targets we care about in libclc unconditionally enable that address space map mangling for all address spaces, such as AMDGPU and NVPTX.

I'm not sure I would want to change this behaviour at this point. At least not for the purposes of enabling generic address space support in libclc. There will be a bunch of downstream toolchains that rely on the current mangling scheme.

arsenm · 2025-04-24T19:01:46Z

It is actually supported in the Itanium mangler:

I don't remember this part of the hack. There was a recent fix to always use the correct mapping values for AMDGPU when generic address space is enabled (which should be the only mapping, still need to do something about the setAddressSpaceMap hack).

arsenm · 2025-04-24T19:02:37Z

libclc/CMakeLists.txt

+    # FIXME: Shouldn't clang automatically enable this extension based on the
+    # target?


Yes, the extension should be reported as available or not by the target macros

we should enable __opencl_c_generic_address_space for amdgpu and nvptx in setSupportedOpenCLOpts API rather than in this CMakeLists file, right?

yes (although for amdgpu it needs to skip the ancient targets without flat addressing)

See the first PR for AMDGPU support: #137636.

I'll do a separate one for NVPTX. It looks like SPIRV (and X86) enable all by default. That should cover all libclc targets.

With #137940 we can remove this CMake logic, thanks all!

One result of #137636 is that (I believe) because of the default AMDGPU devices we compile libclc for, the generic address space isn't being enabled by default for any AMDGPU target in this PR. Should we perhaps be building libclc for a newer AMDGCN device? It should be at least GFX700 for generic address space support.

There isn't a generic target you can just compile for. For amdhsa triples, we assume the default dummy target-cpu supports flat pointers. -mlink-builtin-bitcode plus a collection of other hacks let's us get pretty far in pretending we can have a generic bitcode, but it's fragile system I would like to get away from.

Long term I think we need better compartmentalization on target feature dependence. i.e. we should have a base implementation plus the compiler can select target specific function variants later. In this case the generic address space is pretty fundamental. I doubt anyone is regularly testing with gfx6 anywhere, maybe we could just drop them from the support list here. It also shouldn't be that part to implement software tagged flat pointers, there would just be a small runtime component required to make it work (which is driver work which would never really be implemented)

That's probably a good idea, yes. There's a lot of redundancy in having these large bytecode libraries which are 95% functionally equivalent between targets/architectures.

I can circle back to this in a follow-up PR to investigate the enabling of the generic address space support for AMDGPU architectures.

libclc/clc/include/clc/math/unary_decl_with_int_ptr.inc

libclc/clc/include/clc/clcfunc.h

This commit provides definitions of builtins with the generic address space. It is assumed that all current libclc targets can support the generic address space. One concept to consider is the difference between supporting the generic address space from the user's perspective, and the requirement for libclc as a compiler implementation detail to define separate generic address space builtins. In practice a target (like NVPTX) might notionally support the generic address space, but it's mapped to the same LLVM target address space as the private address space. Therefore libclc may not define both private and generic overloads of the same builtin. We track these two concepts separately, and make the assumption that if the generic address space does clash with another, it's with the private one.

github-actions · 2025-05-21T10:49:26Z

✅ With the latest revision this PR passed the C/C++ code formatter.

frasercrmck added the libclc libclc OpenCL library label Apr 24, 2025

frasercrmck requested a review from arsenm April 24, 2025 14:26

arsenm reviewed Apr 24, 2025

View reviewed changes

libclc/clc/include/clc/math/unary_decl_with_int_ptr.inc Show resolved Hide resolved

libclc/clc/include/clc/clcfunc.h Outdated Show resolved Hide resolved

frasercrmck force-pushed the libclc-generic-addrspace branch from b437f11 to 694166d Compare April 29, 2025 16:41

frasercrmck added 5 commits May 21, 2025 11:27

Update decl guards

6b518a8

clean up cmake

54fc5a0

wop reduce macros

70f8478

update macro usage

b34b140

frasercrmck force-pushed the libclc-generic-addrspace branch from 694166d to b34b140 Compare May 21, 2025 10:47

fix formatting

3ad27e2

arsenm approved these changes May 21, 2025

View reviewed changes

frasercrmck merged commit 94142d9 into llvm:main May 21, 2025
9 checks passed

frasercrmck deleted the libclc-generic-addrspace branch May 21, 2025 16:50

		# FIXME: Shouldn't clang automatically enable this extension based on the
		# target?

[libclc] Support the generic address space #137183

[libclc] Support the generic address space #137183

Uh oh!

Conversation

frasercrmck commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

frasercrmck commented Apr 24, 2025

Uh oh!

arsenm left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

frasercrmck commented Apr 24, 2025

Uh oh!

arsenm commented Apr 24, 2025

Uh oh!

arsenm Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

wenju-he Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

frasercrmck Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

frasercrmck May 21, 2025

Choose a reason for hiding this comment

Uh oh!

arsenm May 21, 2025

Choose a reason for hiding this comment

Uh oh!

frasercrmck May 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

frasercrmck commented Apr 24, 2025 •

edited

Loading

arsenm left a comment •

edited

Loading

github-actions bot commented May 21, 2025 •

edited

Loading