Optimize Integer <-> ByteString conversions by kozross · Pull Request #7439 · IntersectMBO/plutus

kozross · 2025-11-20T19:51:53Z

This optimizes the conversions between Integer and ByteString, as we can now rely on Integer internals, which lets us use direct copying instead of digit-by-digit extraction and reconstruction.

I did not changelog this, as this change is designed to be invisible at the API level. This will require these operations to be re-costed, as this change means we should see an algorithmic improvement (from $\Theta(n^2)$ to $\Theta(n)$ ).

zliu41

This is quite low level stuff, but I've gone through the logic and it all checks out.

zliu41 · 2025-12-12T01:12:44Z

plutus-core/plutus-core/src/PlutusCore/Bitwise.hs

+        LittleEndian -> pure ()
+        BigEndian    -> reverseBuffer ptr desiredLength
+    goSmallLE :: Ptr Word8 -> Int -> Int# -> IO ()
+    goSmallLE ptr offset remaining#


Add a bang to offset? Ditto for all other Int arguments.

Good catch, no idea how I missed that.

zliu41 · 2025-12-12T01:13:03Z

plutus-core/plutus-core/src/PlutusCore/Bitwise.hs

+      copyByteArrayToAddr ptr (ByteArray ba#) 0 minLength
+      case requestedByteOrder of
+        LittleEndian -> pure ()
+        BigEndian    -> reverseBuffer ptr desiredLength


Any idea how much overhead reverseBuffer has?

The short answer: not very much.

The slightly longer answer: In the 'big endian' case, we have to do a reverse copy. We can do this in one of two ways:

Manually perform a reverse copy by reading bytes from the source, then writing them to the destination with flipped indices; or

Copy normally, then reverse in-place (which is what I chose to do).

A reverse copy is always going to have overheads compared to copyByteArrayToAddr, for several reasons. Some of these are inherent: the way caches work on current-day CPUs means that, in general, traversing an array in ascending order of indices will be much faster than in descending order, for example. However, in our case, the main reason is that copyByteArrayToAddr is implemented using memcpy, which the runtime calls without an FFI penalty. Among other things (also for reasons of caching), for arrays that aren't significantly larger than a memory page (8KiB for all the platforms we care about), copyByteArrayToAddr might as well be a constant-time operation. Nothing I could implement would even come close to that.

Reversing in-place, while still carrying a penalty, is less bad, for two reasons:

The number of loop iterations we must perform to reverse an array in-place is only half of what would be required to do a reverse copy. In line with what I mentioned above with regard to copyByteArrayToAddr's performance, we're actually winning as a result.

The implementation, especially when accelerated with loop sectioning, for an in-place reversal is a lot less confusing than for a reverse copy: reverse copies require 'reading behind yourself', which is written weirdly, and if you want to loop section, you have to worry about byte order as well.

Unsurprisingly, I found reversing in-place was slightly faster, or at least no worse at the upper end of what I benchmarked with. When compared with the 'little endian' case, it's about a factor of 3 overhead at worst.

kwxm · 2025-12-15T19:36:08Z

I don't understand all of the details yet, but I'm going to merge this to get it out of the way and update the costs. I'll come back with a pencil and paper and check the logic of the conversions after that.

kwxm

I'm going to come back and work through the details, but let's merge it.

kozross · 2025-12-15T22:19:45Z

@kwxm - if you need any assistance understanding what's going on here, I'm happy to explain.

kozross added 2 commits November 21, 2025 08:49

Optimize Integer <-> ByteString conversions

c177afa

Remove unnecessary dependency

86ba9f2

basetunnel added Builtins Performance labels Nov 21, 2025

basetunnel requested a review from kwxm November 21, 2025 13:00

zliu41 requested a review from a team December 5, 2025 01:44

zliu41 approved these changes Dec 12, 2025

View reviewed changes

kozross added 2 commits December 15, 2025 09:28

Merge branch 'master' into koz/optimize-cip121

b3c4773

Strictify loop arguments

ebb37a1

kwxm added the No Changelog Required Add this to skip the Changelog Check label Dec 15, 2025

kwxm approved these changes Dec 15, 2025

View reviewed changes

zliu41 merged commit ffc6471 into IntersectMBO:master Dec 15, 2025
1 of 2 checks passed

kwxm mentioned this pull request Dec 16, 2025

Costing for improved integer/bytestring conversions #7491

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Optimize Integer <-> ByteString conversions#7439

Optimize Integer <-> ByteString conversions#7439
zliu41 merged 4 commits intoIntersectMBO:masterfrom
mlabs-haskell:koz/optimize-cip121

kozross commented Nov 20, 2025

Uh oh!

zliu41 left a comment

Uh oh!

zliu41 Dec 12, 2025

Uh oh!

kozross Dec 14, 2025

Uh oh!

zliu41 Dec 12, 2025

Uh oh!

kozross Dec 14, 2025 •

edited

Loading

Uh oh!

kwxm commented Dec 15, 2025

Uh oh!

kwxm left a comment

Uh oh!

kozross commented Dec 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

kozross commented Nov 20, 2025

Uh oh!

zliu41 left a comment

Choose a reason for hiding this comment

Uh oh!

zliu41 Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

kozross Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

zliu41 Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

kozross Dec 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kwxm commented Dec 15, 2025

Uh oh!

kwxm left a comment

Choose a reason for hiding this comment

Uh oh!

kozross commented Dec 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kozross Dec 14, 2025 •

edited

Loading