Skip to content

Conversation

@wenju-he
Copy link
Contributor

Also delete unary_def_via_fp32.inc. There are small changes in amdgcn--amdhsa.bc due to vector conversion is scalarized, e.g.
%2 = fpext <4 x half> %0 to <4 x float>
%3 = extractelement <4 x float> %2, i64 0
%4 = tail call float @llvm.fabs.f32(float %3)
->
%2 = extractelement <4 x half> %0, i64 0
%3 = tail call half @llvm.fabs.f16(half %2)
%4 = fpext half %3 to float

Also delete unary_def_via_fp32.inc. There are small changes in
amdgcn--amdhsa.bc due to vector conversion is scalarized, e.g.
  %2 = fpext <4 x half> %0 to <4 x float>
  %3 = extractelement <4 x float> %2, i64 0
  %4 = tail call float @llvm.fabs.f32(float %3)
->
  %2 = extractelement <4 x half> %0, i64 0
  %3 = tail call half @llvm.fabs.f16(half %2)
  %4 = fpext half %3 to float
@wenju-he wenju-he requested a review from frasercrmck July 23, 2025 07:33
@llvmbot llvmbot added the libclc libclc OpenCL library label Jul 23, 2025
@wenju-he wenju-he merged commit bcd0d97 into llvm:main Jul 29, 2025
11 checks passed
@wenju-he wenju-he deleted the libclc-delete-unary_def_via_fp32.inc branch July 29, 2025 00:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

libclc libclc OpenCL library

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants