pool wire write buffer #2799

max-hoffman · 2024-12-19T19:28:47Z

BytesBuffer is a class that lets us avoid most allocations for spooling values to wire. Notably, the object is responsible for doubling the backing array size when appropriate, and a Grow(n int) interface is necessary to track when this should happen. Letting the runtime do all of this would be preferable, but the runtime doubles based on slice size, and the refactors required to make that workable are more invasive. We pay for 2 mallocs on doubling, because the first one is never big enough. Not calling Grow after allocing, or growing by too small of length compared to the allocations used will stomp previously written memory.

As long as we track bytes used with the Grow interface this works smoothly and shaves ~30% off of tablescans.

perf here: dolthub/dolt#8693

zachmu

Idea is sound, see comments

sql/byte_buffer.go

zachmu · 2024-12-19T20:55:33Z

sql/byte_buffer.go

+	},
+}
+
+type ByteBuffer struct {


This idea has merit and is surely better than allocating a buffer for each request but the way you're managing the memory is suboptimal. Also good to use the same backing array for multiple values in a request.

In the main use of this object in the handler, you're getting a zero-length slice (which has some larger backing array) and then calling append on it byte by byte. This will grow the backing array in some cases, but it's not being done under your deliberate control. Rather, you're then calling Double if the backing array is low on space after the appends have already happened.

Basically: in these methods, you are referring to the len of the byte slice, when your concern is usually the cap. It's fine to let append happen byte by byte as long as they aren't doubling the backing array too often, that's the expensive bit.

I think this would probably work slightly better if you just scrapped the explicit capacity management altogether and just let the Go runtime manage it automatically for you. Either that, or always manage it explicitly yourself, i.e. before you serialize a value with all those append calls. But it's not clear to me that manual management is actually any better if you use the same strategy as the go runtime does (double once we're full).

I agree with all of this, but there are two caveats that limit our ability to let the runtime handle this for us. (1) The runtime chooses doubling based on the cap of the slice, not the full backing array. So for the current setup, the doubled array is usually actually smaller than the original backing array. (2) Doubled arrays are not reference swapped, we need a handle to the new buffer to know when to grow.

I'm not aware of how to override the default runtime growth behavior to ignore the slice cap and instead double based on the backing array cap. SoBytesBuffer still does the doubling, and a Grow(n int) interface to track when this should happen. We pay for 2 mallocs on doubling, because the first one is never big enough. Not calling Grow after allocing, or growing by too small of length compared to the allocations used will stomp previously written memory.

zachmu

LGTM, although a few tests of using the Get() / Grow() sequence across some boundaries would be great.

zachmu · 2024-12-20T22:12:43Z

sql/byte_buffer.go

+// are responsible for accurately reporting which bytes
+// they expect to be protected.
+func (b *ByteBuffer) Grow(n int) {
+	if b.i+n > len(b.buf) {


Actually is there an off-by-one error here? Test at the boundary would be goo

yeah, not an error maybe but definitely better to preemptively double when we know the next Grow will exceed the buffer

max-hoffman added 7 commits December 18, 2024 13:51

rough cut buf

bafac99

edits

6f2abf1

edits

ba35075

Merge branch 'main' into max/session-write-buffer

2236f90

correctness for byte copying

f5fd9b4

Merge branch 'main' into max/session-write-buffer

68d431c

fix bugs

5fd4a67

max-hoffman changed the title ~~session write buffer~~ reuse wire write buffer Dec 19, 2024

max-hoffman changed the title ~~reuse wire write buffer~~ pool wire write buffer Dec 19, 2024

zachmu reviewed Dec 19, 2024

View reviewed changes

max-hoffman added 15 commits December 19, 2024 14:47

simplify

6e3c3fe

correct simplification

281efd8

page sized spool buffer

ee7dc63

fix build

fcae966

comments

ba830b2

bump timeout

e5e4a5a

bump timeout

b362f57

fix race

b2e0e80

try separate sleep error

d17b1c0

vitess bump

49a0a99

see if sleep error masks a different error

5982731

add sleeps back

112556b

more error check where it won't hide other errors

fc476f6

remove handler test race

3ce2865

revert back to racey with sleeps

3a7e950

zachmu approved these changes Dec 20, 2024

View reviewed changes

max-hoffman and others added 2 commits December 20, 2024 15:11

zach comments

210c5c7

[ga-format-pr] Run ./format_repo.sh to fix formatting

4e37c3b

max-hoffman merged commit 999a371 into main Dec 20, 2024
8 checks passed

max-hoffman deleted the max/session-write-buffer branch December 20, 2024 23:27

BrewTestBot mentioned this pull request Dec 26, 2024

dolt 1.45.1 Homebrew/homebrew-core#202514

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

pool wire write buffer #2799

pool wire write buffer #2799

Uh oh!

max-hoffman commented Dec 19, 2024 •

edited

Loading

Uh oh!

zachmu left a comment

Uh oh!

Uh oh!

Uh oh!

zachmu Dec 19, 2024

Uh oh!

max-hoffman Dec 20, 2024

Uh oh!

zachmu left a comment

Uh oh!

zachmu Dec 20, 2024

Uh oh!

max-hoffman Dec 20, 2024 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

pool wire write buffer #2799

pool wire write buffer #2799

Uh oh!

Conversation

max-hoffman commented Dec 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zachmu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zachmu Dec 19, 2024

Choose a reason for hiding this comment

Uh oh!

max-hoffman Dec 20, 2024

Choose a reason for hiding this comment

Uh oh!

zachmu left a comment

Choose a reason for hiding this comment

Uh oh!

zachmu Dec 20, 2024

Choose a reason for hiding this comment

Uh oh!

max-hoffman Dec 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

max-hoffman commented Dec 19, 2024 •

edited

Loading

max-hoffman Dec 20, 2024 •

edited

Loading