Fix inaccurate `std::intrinsics::simd` documentation #137828

folkertdev · 2025-02-28T21:50:57Z

This addresses two issues:

the docs on comparison operators (simd_gt etc.) said they only work for floating-point vectors, but they work for integer vectors too.
the docs on various functions that use a mask did not document that the mask must be a signed integer vector. Unsigned integer vectors would cause invalid behavior when the mask vector is widened (unsigned integers would use zero extension, producing incorrect results).

rustbot · 2025-02-28T21:51:02Z

Some changes occurred to the intrinsics. Make sure the CTFE / Miri interpreter
gets adapted for the changes, if necessary.

cc @rust-lang/miri, @rust-lang/wg-const-eval

Some changes occurred to the platform-builtins intrinsics. Make sure the
LLVM backend as well as portable-simd gets adapted for the changes.

cc @antoyo, @GuillaumeGomez, @bjorn3, @calebzulawski, @programmerjake

calebzulawski · 2025-02-28T22:06:46Z

library/core/src/intrinsics/simd.rs

 /// `U` must be a vector of pointers to the element type of `T`, with the same length as `T`.
 ///
-/// `V` must be a vector of integers with the same length as `T` (but any element size).
+/// `V` must be a vector of signed integers with the same length as `T` (but any element size).


Maybe outside the scope of this PR, but this would need to be checked in codegen and emit an ICE

what exactly? you get a monomorphization error when these rules are violated. They are not the prettiest errros, but they are not ICEs either. Unless I'm missing something. e.g. https://godbolt.org/z/f5j5re5z8

Yep, that's a post-mono error.

Oh, good. I misunderstood and thought you meant the intrinsic was permitting invalid behavior.

yeah this is just bringing the docs up to date with the behavior that we already have and enforce.

Looking at this in the LLVM backend I don't even see such a widening cast. This code converts the masks to i1 vectors which LLVM seems to use for them, and it does that with lshr and trunc. The lshr is entirely unnecessary for correctness as we require the input to be all-1 or all-0, but

/// The rust simd semantics are that each element should either consist of all ones or all zeroes, /// but this information is not available to llvm. Truncating the vector effectively uses the lowest bit, /// but codegen for several targets is better if we consider the highest bit by shifting.

But I can't find anything that would go wrong with unsigned integers.

I personally agree, in that an intrinsic (that normal users should never have to worry about), we could either demand that the lane width matches the other arguments, or that the mask uses sign extension no matter the signedness of the argument.

From what I can tell the codegen backends already handle this, because e.g. in llvm (and from what i can see, also cranelift and gcc) the integers are just a bunch of bits.

The counter-argument was that performing sign extension on what rust believes is a vector with unsigned types would violate type safety.

The counter-argument was that performing sign extension on what rust believes is a vector with unsigned types would violate type safety.

I don't know what you mean by that. There's no sign extension happening, as far as I can tell. Also, all the backends have to do is implement the intended mask semantics, which is described in a bitwise way. If they use sign extension as part of that, that's completely fine. There's no type safety violation here.

I agree, the backends support it, and the intrinsic could (and in my opinion should) support it too. I'm just relaying the response I got when I suggested exactly that https://rust-lang.zulipchat.com/#narrow/channel/257879-project-portable-simd/topic/add.20.60simd_max.60.20and.20.60simd_min.60/near/502647748

Let's move this discussion to Zulip: https://rust-lang.zulipchat.com/#narrow/channel/257879-project-portable-simd/topic/On.20the.20sign.20of.20masks

library/core/src/intrinsics/simd.rs

these all also accept integer vectors as arguments

this is because they may be widened, and that only works when sign extension is used: zero extension would produce invalid results

workingjubilee · 2025-02-28T23:53:23Z

cool! looks good to me. test improvement stuff sounds nice as followup if people want but isn't required here and now.

@bors r+ rollup

bors · 2025-02-28T23:53:25Z

📌 Commit 854e9f4 has been approved by workingjubilee

It is now in the queue for this repository.

…s, r=workingjubilee Fix inaccurate `std::intrinsics::simd` documentation This addresses two issues: - the docs on comparison operators (`simd_gt` etc.) said they only work for floating-point vectors, but they work for integer vectors too. - the docs on various functions that use a mask did not document that the mask must be a signed integer vector. Unsigned integer vectors would cause invalid behavior when the mask vector is widened (unsigned integers would use zero extension, producing incorrect results). r? `@workingjubilee`

rustbot assigned workingjubilee Feb 28, 2025

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Feb 28, 2025

calebzulawski reviewed Feb 28, 2025

View reviewed changes

workingjubilee reviewed Feb 28, 2025

View reviewed changes

library/core/src/intrinsics/simd.rs Outdated Show resolved Hide resolved

correct the docs on simd_ comparison operators

4549266

these all also accept integer vectors as arguments

folkertdev force-pushed the simd-intrinsic-doc-fixes branch from 78636b7 to 417c51c Compare February 28, 2025 23:21

intrinsics::simd: document that masks must be signed integer vectors

854e9f4

this is because they may be widened, and that only works when sign extension is used: zero extension would produce invalid results

folkertdev force-pushed the simd-intrinsic-doc-fixes branch from 417c51c to 854e9f4 Compare February 28, 2025 23:29

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 28, 2025

matthiaskrgr mentioned this pull request Mar 1, 2025

Rollup of 9 pull requests #137841

Closed

folkertdev mentioned this pull request Mar 1, 2025

improve simd_select error message when used with invalid mask type #137851

Merged

matthiaskrgr mentioned this pull request Mar 1, 2025

Rollup of 10 pull requests #137855

Merged

bors merged commit c112b70 into rust-lang:master Mar 2, 2025
6 checks passed

rustbot added this to the 1.87.0 milestone Mar 2, 2025

Fix inaccurate std::intrinsics::simd documentation #137828

Fix inaccurate std::intrinsics::simd documentation #137828

Uh oh!

Conversation

folkertdev commented Feb 28, 2025

Uh oh!

rustbot commented Feb 28, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

folkertdev Feb 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

workingjubilee commented Feb 28, 2025

Uh oh!

bors commented Feb 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Fix inaccurate `std::intrinsics::simd` documentation #137828

Fix inaccurate `std::intrinsics::simd` documentation #137828

folkertdev Feb 28, 2025 •

edited

Loading