Pipelined parallel extract #407

cosmicexplorer · 2023-09-21T04:26:21Z

Upsides

Lots and lots faster when extracting zips with many separate entries, or with large highly compressed individual entries:

> cargo bench -- extract
# ...
running 6 tests
test extract_pipelined_compressible_big   ... bench:  81,470,730 ns/iter (+/- 6,342,681) = 449 MB/s
test extract_pipelined_compressible_small ... bench:     716,842 ns/iter (+/- 256,469) = 8 MB/s
test extract_pipelined_random             ... bench:  23,833,197 ns/iter (+/- 15,791,334) = 424 MB/s
test extract_sync_compressible_big        ... bench: 335,936,236 ns/iter (+/- 6,129,233) = 109 MB/s
test extract_sync_compressible_small      ... bench:     168,939 ns/iter (+/- 7,269) = 37 MB/s
test extract_sync_random                  ... bench:  33,435,824 ns/iter (+/- 1,222,875) = 302 MB/s

The *_pipelined_* benchmarks use the new pipelined parallel extraction method, and the *_compressible_big benchmarks demonstrate almost a 5x speedup, while the *_random benchmarks demonstrate a 1.4x speedup. Note that the *_compressible_small benchmark is slower in the pipelined case, but this is such a small input that we actually lose very little.

Downsides

This brings in rayon and a few other dependencies which we would probably want to assign to a flag. As @NobodyXu mentioned in https://github.com/zip-rs/zip/issues/403#issuecomment-1712451398, this also imposes a Clone requirement on the reader:

or requires the reader to implement Clone by storing File to be stored inside an Arc and keep track of the curent location of cursor in the reader

TODO

As mentioned above, this also loses performance against small inputs. I think a fully async approach with the async-executor crate might be a much cleaner approach than trying to scale our rayon threadpools up and down according to the size of the input.

Pr0methean · 2024-05-01T23:26:22Z

Replaced with zip-rs/zip2#72.

cosmicexplorer added 11 commits September 18, 2023 02:00

and we got a benchmark

47d2c39

significantly faster pipelined extraction!!!

009c6dd

using a secondary rayon threadpool is SLOWER than manual threading!!

5d939b8

convert Vec<u8> to Bytes/BytesMut

ddd7cf5

add once_cell

e0afdd4

conver to rayon::join

ebb9148

reuse the file handle to set permissions

321ffca

clarify randomness in bench (means incompressibility)

24e76c7

move dir creation out of the hot path

1922204

avoid the need to reverse by hand

e4dbf6e

ok the pipelining works perfectly and lucidrously fast now

f4c11f1

cosmicexplorer mentioned this pull request Sep 21, 2023

Parallel extraction zip-rs/zip2#165

Open

cosmicexplorer mentioned this pull request Sep 30, 2023

prototype async API, with demonstrable perf improvements via benchmark #409

Closed

Pr0methean mentioned this pull request May 1, 2024

perf: Pipelined parallel extract zip-rs/zip2#72

Closed

Pr0methean closed this May 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pipelined parallel extract #407

Pipelined parallel extract #407

Uh oh!

cosmicexplorer commented Sep 21, 2023 •

edited

Loading

Uh oh!

Pr0methean commented May 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Pipelined parallel extract #407

Pipelined parallel extract #407

Uh oh!

Conversation

cosmicexplorer commented Sep 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Upsides

Downsides

TODO

Uh oh!

Pr0methean commented May 1, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cosmicexplorer commented Sep 21, 2023 •

edited

Loading