[neural.slang] Enable __getStructuredBufferPtr for CUDA

# Problem Description
The motivation of this feature request is that we want a flexible way to call
```
__atomic_reduce_add(__ref T dst, T value)
```
This intrinsic requires that the element type of the buffer must be same type as the value type. So in some cases, when our buffer type is just `T`, but the value type could be `vector<T, 2>`, we basically have to good option to call this intrinsic. This happens at neural.slang where we have the optimized code path that call the `half2` version of the `__atomic_reduce_add`, while we can't change the buffer to `StructuredBuffer<half2>`. So in this case, we need a way to cast the pointer of the buffer.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[neural.slang] Enable __getStructuredBufferPtr for CUDA #10167

Problem Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[neural.slang] Enable __getStructuredBufferPtr for CUDA #10167

Description

Problem Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions