perf: eliminate intermediate byte[] copies in StreamMessageConsumer #902

sebthom · 2025-10-25T20:52:54Z

Add MessageJsonHandler.serialize(Message, OutputStream, Charset)
Serialize into ByteArrayOutputStream and write via writeTo(output)
Remove String.getBytes(...) and toByteArray() clone
Cache Charset instead of using encoding String

No breaking changes: existing constructors retained; new overloads are additive.

- Add MessageJsonHandler.serialize(Message, OutputStream, Charset) - Serialize into ByteArrayOutputStream and write via writeTo(output) - Remove String.getBytes(...) and toByteArray() clone - Cache Charset instead of using encoding String No breaking changes: existing constructors retained; new overloads are additive.

pisv · 2025-11-12T12:09:24Z

@sebthom Many thanks for all your contributions to the project.

In general, for performance-related improvements I'd like to see more details about the issue being addressed including realistic benchmarks to check the performance and the actual measurements before and after the change.

Sometimes a small amount of micro-optimisation can make a huge difference. However, it is important to have evidence that we are optimizing an actual bottleneck. Otherwise, the code can end up being harder to maintain, and we'll quite possibly find that we've either missed the real bottleneck, or that our micro-optimisations are harming performance instead of helping.

Again, these general notes apply to all performance-related improvements.

sebthom · 2025-11-12T12:18:00Z

I don't see how to provide realistic benchmarks. What would be the exact criterias? Which tools do you accept etc.? These PRs address issues like #815 The current parsing is memory inefficient. These improvements (similar to #816) reduce CPU churn and GC pressure.

@jonahgraham what is your opinion?

pisv · 2025-11-12T12:59:52Z

I don't see how to provide realistic benchmarks.

OK. But have you measured the actual increase in performance somehow?

pisv · 2025-11-12T15:08:19Z

In this particular case, it is not that obvious when taking a deeper look at the code.

StreamMessageConsumer.consume before the change:

A byte-array is created for a StringWriter (AbstractStringBuilder.value)
It is then copied in StringWriter.toString (but note that StringBuffer.toString is annotated with @HotSpotIntrinsicCandidate, so it must be efficient, I guess)
A byte-array is created as the result of String.getBytes. (The bulk-encoding to UTF-8, which is the only encoding supported right now in LSP, is a special case, and must be quite efficient)

StreamMessageConsumer.consume after the change:

A byte-array is created for a ByteArrayOutputStream (buf)
A byte-buffer is allocated for a StreamEncoder of the OutputStreamWriter
The StreamEncoder creates a new char array and wraps it into a new CharBuffer in each write call

As we can see, the two implementations are quite different. Which would be the more efficient and to what extent? I don't know without actually measuring it. But I do know that the implementation before the change looks more straightforward and readable to me; the content is written in exactly the same way as the header. This is just an example to illustrate my general point.

sebthom changed the title ~~perf: eliminate intermediate byte[] copies in StreamMessageConsume~~ perf: eliminate intermediate byte[] copies in StreamMessageConsumer Oct 25, 2025

sebthom force-pushed the StreamMessageConsumer branch from e3c9abe to 4462bbd Compare November 8, 2025 20:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: eliminate intermediate byte[] copies in StreamMessageConsumer #902

perf: eliminate intermediate byte[] copies in StreamMessageConsumer #902

sebthom commented Oct 25, 2025

Uh oh!

pisv commented Nov 12, 2025

Uh oh!

sebthom commented Nov 12, 2025 •

edited

Loading

Uh oh!

pisv commented Nov 12, 2025

Uh oh!

pisv commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

perf: eliminate intermediate byte[] copies in StreamMessageConsumer #902

Are you sure you want to change the base?

perf: eliminate intermediate byte[] copies in StreamMessageConsumer #902

Conversation

sebthom commented Oct 25, 2025

Uh oh!

pisv commented Nov 12, 2025

Uh oh!

sebthom commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pisv commented Nov 12, 2025

Uh oh!

pisv commented Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sebthom commented Nov 12, 2025 •

edited

Loading