Remove TODO about Replacing Sorting with Bucketing by macvincent · Pull Request #330 · facebookincubator/nimble

macvincent · 2025-11-20T20:36:03Z

Summary: We initially planned to improve the performance of the hard chunking stage by bucketing the streams by size (by most significant bit) instead of sorting them. However, after running benchmarks in D87571945. Sorting always performs better for large stream counts.

Differential Revision: D87573831

meta-codesync · 2025-11-20T20:36:10Z

@macvincent has exported this pull request. If you are a Meta employee, you can view the originating Diff in D87573831.

…ubator#325) Summary: When implementing the stream chunker, we anticipated that stream buffers after chunking will end up growing to the size that previously triggered chunking. As a tradeoff between minimizing reallocations (for performance) and actually releasing memory (to relieve memory pressure), we heuristically determine the new buffer capacity for each stream to be larger that required. The issue with this optimization is that it conflicts with the rest of our memory tracking logic since we now have retained memory in the memory pool that is not accounted for. We now know through local testing that disabling this optimization leads to better memory pressure relief. We performed local DISCO tests with and without this optimization for two tables and included the comparison in https://docs.google.com/spreadsheets/d/1kZvBwhVHZRyB7tg-qT2Et4V_7mDWyNr-VrtDzjG_FKY/view?gid=1209014630#gid=1209014630. It shows that for these two tables we save an average of **15%** improvement in average write memory. We also see **4%** write CPU improvement on average. Though we saw **-6%** regression in CPU for a single 256MB stripe experiment. These results indicate that more testing is required in order to understand its impact on a larger sample size. In this diff, we introduce a JK, [dwio/nimble_chunking:disable_memory_reallocation_optimization](https://www.internalfb.com/intern/justknobs/?name=dwio%2Fnimble_chunking#disable_memory_reallocation_optimization), that will be enabled just for DISCO experiments. This will help us understand the full impact of this optimization and whether it should be retained. Differential Revision: D87494427

…#330) Summary: We initially planned to improve the performance of the hard chunking stage by bucketing the streams by size (by most significant bit) instead of sorting them. However, after running benchmarks in D87571945. Sorting always performs better for large stream counts. Differential Revision: D87573831

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Nov 20, 2025

meta-codesync bot added fb-exported meta-exported labels Nov 20, 2025

macvincent added 2 commits November 20, 2025 12:38

macvincent force-pushed the export-D87573831 branch from 9b475ed to 9c6749a Compare November 20, 2025 20:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove TODO about Replacing Sorting with Bucketing#330

Remove TODO about Replacing Sorting with Bucketing#330
macvincent wants to merge 2 commits intofacebookincubator:mainfrom
macvincent:export-D87573831

macvincent commented Nov 20, 2025

Uh oh!

meta-codesync bot commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

macvincent commented Nov 20, 2025

Uh oh!

meta-codesync bot commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant