https://godbolt.org/z/j1WPxaGKG
uint32_t4 can be implicitly casted to uint64_t4 but __attribute__((__vector_size__(4 * sizeof(unsigned int)))) unsigned int raises an error instead.
This causes issues with intrinsics such as firstbitlow which use the attribute type
DXC is perfectly fine with the implict cast for firstbitlow, while clang raises a compiler error