Use internal iteration in `Vec::extend_desugared()` #138752

ChayimFriedman2 · 2025-03-20T16:51:30Z

Because LLVM is unable to optimize well external iteration with some iterator kinds (e.g. chain()).

To do that I had to hoist the size_hint() call to the beginning of the loop (since I no longer have access to the iterator inside the loop), which might slightly pessimize certain iterators that are able to give more accurate size bounds during iteration (e.g. flatten()). However, the effect should not be big, and also, common iterators like these also suffer from the external iteration optimizibility problem (e.g. flatten()).

Because LLVM is unable to optimize well external iteration with some iterator kinds (e.g. `chain()`). To do that I had to hoist the `size_hint()` call to the beginning of the loop (since I no longer have access to the iterator inside the loop), which might slightly pessimize certain iterators that are able to give more accurate size bounds during iteration (e.g. `flatten()`). However, the effect should not be big, and also, common iterators like these also suffer from the external iteration optimizibility problem (e.g. `flatten()`).

rustbot · 2025-03-20T16:51:35Z

r? @joboet

rustbot has assigned @joboet.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

compiler-errors · 2025-03-20T16:53:54Z

Is there some sort of codegen test you could use to demonstrate that this has a beneficial effect?

ChayimFriedman2 · 2025-03-20T16:55:35Z

I'm pretty sure a benchmark will show a difference, but I will check.

rust-log-analyzer · 2025-03-20T16:56:45Z

The job x86_64-gnu-llvm-18 failed! Check out the build log: (web) (plain)

Click to see the possible cause of the failure (guessed by this bot)

#19 exporting to docker image format
#19 sending tarball 19.8s done
#19 DONE 33.9s
##[endgroup]
Setting extra environment values for docker:  --env ENABLE_GCC_CODEGEN=1 --env GCC_EXEC_PREFIX=/usr/lib/gcc/
[CI_JOB_NAME=x86_64-gnu-llvm-18]
[CI_JOB_NAME=x86_64-gnu-llvm-18]
debug: `DISABLE_CI_RUSTC_IF_INCOMPATIBLE` configured.
---
sccache: Listening on address 127.0.0.1:4226
##[group]Configure the build
configure: processing command line
configure: 
configure: build.configure-args := ['--build=x86_64-unknown-linux-gnu', '--llvm-root=/usr/lib/llvm-18', '--enable-llvm-link-shared', '--set', 'rust.randomize-layout=true', '--set', 'rust.thin-lto-import-instr-limit=10', '--set', 'build.print-step-timings', '--enable-verbose-tests', '--set', 'build.metrics', '--enable-verbose-configure', '--enable-sccache', '--disable-manage-submodules', '--enable-locked-deps', '--enable-cargo-native-static', '--set', 'rust.codegen-units-std=1', '--set', 'dist.compression-profile=balanced', '--dist-compression-formats=xz', '--set', 'rust.lld=false', '--disable-dist-src', '--release-channel=nightly', '--enable-debug-assertions', '--enable-overflow-checks', '--enable-llvm-assertions', '--set', 'rust.verify-llvm-ir', '--set', 'rust.codegen-backends=llvm,cranelift,gcc', '--set', 'llvm.static-libstdcpp', '--enable-new-symbol-mangling']
configure: build.build          := x86_64-unknown-linux-gnu
configure: target.x86_64-unknown-linux-gnu.llvm-config := /usr/lib/llvm-18/bin/llvm-config
configure: llvm.link-shared     := True
configure: rust.randomize-layout := True
configure: rust.thin-lto-import-instr-limit := 10
---
[RUSTC-TIMING] memchr test:false 0.900
error: variable does not need to be mutable
    --> library/alloc/src/vec/mod.rs:3538:59
     |
3538 |     fn extend_desugared<I: Iterator<Item = T>>(&mut self, mut iterator: I) {
     |                                                           ----^^^^^^^^
     |                                                           |
     |                                                           help: remove this `mut`
     |
     = note: `-D unused-mut` implied by `-D warnings`

ChayimFriedman2 · 2025-03-20T17:41:16Z

So, @compiler-errors, unlike what I thought, this is not a 100% win (never trust your instincts in performance!). I benchmarked three scenarios: a flatten() call with a small array containing small slices, a call to flatten() with a large vector (100 elements) containing large vectors (0-100 elements, ascending), and small (15x10) vectors but with two chains and one flatten.

The results are: in the first two cases, the algorithms are almost equivalent, with a slight preference to the old algorithm for the first case and a slight preference to the new in the second case. In the third case, however, the new algorithm is almost 3x faster.

So my conclusion is: one such operation is not bad, but once you start to add more this version is significantly faster.

joboet · 2025-03-21T16:19:20Z

I think you could preserve the more precise length estimation by using try_for_each combined with Vec::push_within_capacity.

ChayimFriedman2 · 2025-03-22T18:21:36Z

@joboet I don't think, since the iterator will still be mutably borrowed.

joboet · 2025-03-23T12:41:31Z

No, that's not an issue. I've opened ChayimFriedman2#3 with what I had in mind.

joboet · 2025-04-03T08:57:43Z

@rustbot author
while CI isn't passing

rustbot · 2025-04-03T08:57:47Z

Reminder, once the PR becomes ready for a review, use @rustbot ready.

rustbot assigned joboet Mar 20, 2025

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Mar 20, 2025

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use internal iteration in `Vec::extend_desugared()` #138752

Use internal iteration in `Vec::extend_desugared()` #138752

ChayimFriedman2 commented Mar 20, 2025

Uh oh!

rustbot commented Mar 20, 2025

Uh oh!

compiler-errors commented Mar 20, 2025

Uh oh!

ChayimFriedman2 commented Mar 20, 2025

Uh oh!

rust-log-analyzer commented Mar 20, 2025

Uh oh!

ChayimFriedman2 commented Mar 20, 2025

Uh oh!

joboet commented Mar 21, 2025 •

edited

Loading

Uh oh!

ChayimFriedman2 commented Mar 22, 2025

Uh oh!

joboet commented Mar 23, 2025

Uh oh!

joboet commented Apr 3, 2025

Uh oh!

rustbot commented Apr 3, 2025

Uh oh!

Uh oh!

Use internal iteration in Vec::extend_desugared() #138752

Are you sure you want to change the base?

Use internal iteration in Vec::extend_desugared() #138752

Conversation

ChayimFriedman2 commented Mar 20, 2025

Uh oh!

rustbot commented Mar 20, 2025

Uh oh!

compiler-errors commented Mar 20, 2025

Uh oh!

ChayimFriedman2 commented Mar 20, 2025

Uh oh!

rust-log-analyzer commented Mar 20, 2025

Uh oh!

ChayimFriedman2 commented Mar 20, 2025

Uh oh!

joboet commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ChayimFriedman2 commented Mar 22, 2025

Uh oh!

joboet commented Mar 23, 2025

Uh oh!

joboet commented Apr 3, 2025

Uh oh!

rustbot commented Apr 3, 2025

Uh oh!

Uh oh!

Use internal iteration in `Vec::extend_desugared()` #138752

Use internal iteration in `Vec::extend_desugared()` #138752

joboet commented Mar 21, 2025 •

edited

Loading