Improve quality of f32/f64 generation. by reinerp · Pull Request #103 · smol-rs/fastrand

reinerp · 2025-05-11T05:34:20Z

The previous int-to-float conversion had a bias of probability 2^-24 / 2^-53 for types f32 / f64 respectively. The new conversion has a bias of 2^-64, which is the same bias as the underlying WyRand generator. See comments in lib.rs for explanation.

The new conversion is a slightly shorter instruction sequence on x86 and ARM, but executes as 1 more uop on x86. Seems unlikely to harm performance much, if at all. https://rust.godbolt.org/z/q3zMxEc3T.

The added tests in smoke.rs fail with the old conversion and succeed with the new conversion.

The previous int-to-float conversion had a bias of probability 2^-24 / 2^-53 for types f32 / f64 respectively. The new conversion has a bias of 2^-64, which is the same bias as the underlying WyRand generator. The new conversion is a slightly shorter instruction sequence on x86 and ARM, but executes as 1 more uop on x86. Seems unlikely to harm performance much, if at all. https://rust.godbolt.org/z/q3zMxEc3T

This is unavailable on thumbv7m-none-eabi.

They both end up compiling to the same after inlining and simplification, but the former requires more work from the optimizer to get there.

taiki-e

Looks great, thanks!

taiki-e · 2025-10-04T15:09:13Z

tests/smoke.rs

-}
-
-#[test]
-fn digit() {


Sorry, I accidentally removed several unrelated tests when resolving conflicts. Filed #110 to re-add them.

notgull · 2025-10-05T02:23:53Z

src/lib.rs

+        loop {
+            let x = self.f32_inclusive();
+            if x < 1.0 {
+                return x;
+            }
+        }
+    }


Was the perf of this measured? Seems like adding a branch and a loop to f32 might have performance implications I'm not comfortable with releasing.

I didn't measure at the time since there was no existing benchmark; I only looked at assembly.

I've now added a benchmark in #112 and measured the results on AArch64 (M2 Pro) and x86-64 (Zen4). I get:

AArch64: 0.449ns/iter: old f32(). (2, 3, or 4 iters unrolled) 0.344ns/iter: new f32_inclusive(). (4 iters unrolled) 0.587ns/iter: new f32(). (2 iters unrolled) x86-64: 0.856ns/iter: old f32(). (3 iterations) 0.649ns/iter: new f32_inclusive(). 0.772ns/iter: new f32(). (2 iterations unrolled)

So the change is always a win if you switch to f32_inclusive(), and it's a win on x86-64 (but a loss on AArch64) if you stay with f32().

Note that the branch is almost-always taken (probability 1 - 2^-23) so the branch predictor will predict this ~perfectly.

(These numbers are the benchmark numbers, divided by 10_000, since the benchmark itself generates 10_000 f32s.)

reinerp added 8 commits May 10, 2025 22:06

Test f32/f64 generation.

16c465a

Apply clippy suggestions

25a58a1

Add some more commentary.

0c328ca

cargo fmt

5b7946f

Remove use of f32::exp2().

9ae0608

This is unavailable on thumbv7m-none-eabi.

Add #[inline]

d571112

Switch from u64(..) to gen_u64().

7c3b83b

They both end up compiling to the same after inlining and simplification, but the former requires more work from the optimizer to get there.

taiki-e approved these changes Oct 4, 2025

View reviewed changes

Merge branch 'master' into master

d994a10

taiki-e merged commit 6ceb4f1 into smol-rs:master Oct 4, 2025
14 checks passed

taiki-e reviewed Oct 4, 2025

View reviewed changes

taiki-e mentioned this pull request Oct 4, 2025

Re-add tests that were accidentally removed when resolving conflicts #110

Merged

notgull reviewed Oct 5, 2025

View reviewed changes

taiki-e mentioned this pull request Jan 20, 2026

Release 2.4.0 #116

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve quality of f32/f64 generation.#103

Improve quality of f32/f64 generation.#103
taiki-e merged 9 commits intosmol-rs:masterfrom
reinerp:master

reinerp commented May 11, 2025

Uh oh!

taiki-e left a comment

Uh oh!

Uh oh!

taiki-e Oct 4, 2025

Uh oh!

notgull Oct 5, 2025

Uh oh!

reinerp Oct 5, 2025

Uh oh!

reinerp Oct 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

Conversation

reinerp commented May 11, 2025

Uh oh!

taiki-e left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

taiki-e Oct 4, 2025

Choose a reason for hiding this comment

Uh oh!

notgull Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

reinerp Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

reinerp Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants