perf: streaming TxHash and block-level scratch buffer for deserialization by icellan · Pull Request #106 · bsv-blockchain/go-wire

icellan · 2026-02-27T13:29:13Z

Summary

Two optimizations targeting block deserialization and transaction hashing, measured against a real 3.64 GB testnet block (28,672 txs, block 1681787).

1. Streaming TxHash (eliminates per-tx buffer allocation)

MsgTx.TxHash() previously serialized the entire transaction into a bytes.Buffer and then hashed the buffer. For large transactions (100+ KB), this allocated a buffer proportional to the tx size just to compute a hash.

Fix: Write transaction data directly to sha256.New() (which implements io.Writer) instead of an intermediate buffer. The double-SHA256 is computed as sha256(sha256(tx)) using a stack-allocated [32]byte for the intermediate hash.

Also adds a cachedHash *chainhash.Hash field to MsgTx that lazily caches the computed hash, invalidated by AddTxIn()/AddTxOut(). Applied the same streaming approach to MsgExtendedTx.TxHash().

Result: MsgTx.TxHash dropped from 3,758 MB to 0 MB in the allocation profile.

2. Block-level scratch buffer for deserialization (eliminates temporary script allocations)

During MsgTx.Bsvdecode, each script is read into a temporary buffer via scriptFreeList.Borrow(), then copied into a per-tx contiguous buffer. Scripts larger than 512 bytes bypass the pool and get a fresh make([]byte, size) allocation. For a 3.64 GB block, this produced ~3.7 GB of temporary allocations that were immediately discarded after copying.

Fix: Add bsvdecodeWithScratch() method that reads all scripts into a shared scratch buffer. The buffer is passed from MsgBlock.Bsvdecode and reused across all transactions:

Reset to len=0 for each tx (capacity preserved)
Grows only when a tx has more total script data than any previous tx
After reading, scripts are copied into the exact-size per-tx contiguous buffer (unchanged)
The scratch buffer's capacity stabilizes after the first few txs

Also pre-allocates MsgTx structs contiguously in MsgBlock.Bsvdecode (make([]MsgTx, txCount)) instead of individual stack-escape allocations per tx.

Single-tx Bsvdecode (non-block path) remains unchanged, using the existing script pool.

Result: Block deserialization allocations dropped from 7,514 MB to 3,790 MB (-49.6%). The remaining 3,716 MB is the irreducible per-tx contiguous script buffer (the actual script data that must live somewhere).

Test changes

msg_block_test.go: Added clearBlockTxCaches helper to clear cachedHash before reflect.DeepEqual comparisons
msg_tx_test.go: Clear cachedHash before reflect.DeepEqual comparisons
msg_extended_tx_test.go: No test changes needed (MsgExtendedTx has no cached hash field)

Test plan

All existing go-wire tests pass (go test -count=1 -short ./...)
Verified against 3.64 GB testnet block in teranode HandleBlockDirect test
TxHash correctness verified by existing TestTxTxHash and TestExtendedTxTxHash tests
Block serialization/deserialization roundtrip verified by existing TestBlockSerialize* tests

…tion TxHash optimization: - Write tx data directly to sha256.New() instead of serializing into an intermediate bytes.Buffer, eliminating a per-tx buffer allocation proportional to the transaction size. - Cache computed hash in MsgTx.cachedHash field, invalidated by AddTxIn/AddTxOut. Applied to both MsgTx and MsgExtendedTx. Block deserialization optimization: - Add bsvdecodeWithScratch method that reads all scripts into a shared scratch buffer (reused across transactions) instead of allocating a fresh buffer per large script via scriptFreeList.Borrow. - MsgBlock.Bsvdecode passes a shared scratch buffer to each tx, growing it only when a script larger than any previous one is encountered. After each tx, the buffer is reset (len=0) but capacity is preserved. - Pre-allocate MsgTx structs contiguously in MsgBlock.Bsvdecode (one slice instead of per-tx heap allocations). For a 3.64 GB testnet block (28,672 txs): - Block deserialization: 7,514 MB → 3,790 MB (-49.6%) - TxHash: 3,758 MB → 0 MB (eliminated from allocation profile)

Pre-declare scriptLen variable to avoid shadowing err from outer scope in the output script reading loop.

sonarqubecloud · 2026-02-27T13:36:26Z

Quality Gate passed

Issues
5 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Replace subtreeData.Serialize() + storer.Write(bytes) with subtreeData.WriteTransactionsToWriter(storer, 0, length), streaming transaction data directly to the blob store FileStorer pipe instead of serializing into a large intermediate byte slice. The FileStorer already implements io.Writer via a pipe connected to SetFromReader, and the file store guarantees atomic writes via temp file + rename. Add HandleBlockDirect memory profiling test for large testnet blocks. Depends on: - bsv-blockchain/go-bt#117 (SerializeTo) - bsv-blockchain/go-subtree (streaming WriteTransactionsToWriter) - bsv-blockchain/go-wire#106 (TxHash + block deserialization)

icellan requested a review from mrz1836 as a code owner February 27, 2026 13:29

github-actions bot added the size/L Large change (201–500 lines) label Feb 27, 2026

github-actions bot assigned mrz1836 Feb 27, 2026

github-actions bot added the performance Performance improvements or optimizations label Feb 27, 2026

fix: resolve govet shadow lint warning in bsvdecodeWithScratch

657c345

Pre-declare scriptLen variable to avoid shadowing err from outer scope in the output script reading loop.

icellan merged commit 9aa279c into master Feb 27, 2026
44 checks passed

github-actions bot deleted the perf/block-deser-and-txhash-optimization branch February 27, 2026 13:40

icellan restored the perf/block-deser-and-txhash-optimization branch February 27, 2026 14:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: streaming TxHash and block-level scratch buffer for deserialization#106

perf: streaming TxHash and block-level scratch buffer for deserialization#106
icellan merged 2 commits intomasterfrom
perf/block-deser-and-txhash-optimization

icellan commented Feb 27, 2026

Uh oh!

sonarqubecloud bot commented Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

icellan commented Feb 27, 2026

Summary

1. Streaming TxHash (eliminates per-tx buffer allocation)

2. Block-level scratch buffer for deserialization (eliminates temporary script allocations)

Test changes

Test plan

Uh oh!

sonarqubecloud bot commented Feb 27, 2026

Quality Gate passed

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants