SIMD-0503: Static Sysvars by deanmlittle · Pull Request #503 · solana-foundation/solana-improvement-documents

deanmlittle · 2026-03-26T04:45:46Z

No description provided.

simd-bot · 2026-03-26T04:45:48Z

Hello deanmlittle! Welcome to the SIMD process. By opening this PR you are affirming that your SIMD has been thoroughly discussed and vetted in the SIMD discussion section. The SIMD PR section should only be used to submit a final technical specification for review. If your design / idea still needs discussion, please close this PR and create a new discussion here.

This PR requires the following approvals before it can be merged:

At least one approval from a Anza team member: t-nelson, steviez, or bw-solana
At least one approval from an Firedancer team member: ptaffet-jump, topointon-jump, 0x0ece, lidatong, ripatel-fd, or rexicon226

Once all requirements are met, you can merge this PR by commenting /merge.

ptaffet-jump · 2026-03-26T14:19:27Z

I don't think it's a good idea to overload memory translation like this. This leads to some strange behavior where

lddw r3, 0x494df715 // SOL_RENT_SYSVAR
ldxdw r3, [r3+0]

loads from a sysvar, but

lddw r3, 0x494df714 // SOL_RENT_SYSVAR - 1
add  r3, 1
ldxdw r3, [r3+0]

OR

lddw r3, 0x494df714 // SOL_RENT_SYSVAR - 1
ldxdw r3, [r3+1]

gives a different value (if I'm understanding correctly). An ISA where these snippets do not have identical behavior seems cursed to me.

I'm less opposed to putting these sysvars at a known, fixed, appropriately aligned location in an existing read-only data segment, but it doesn't seem better enough than passing in the account to justify a change.
Can't you just require the sysvar accounts to be passed in first, and then use pre-computed offsets into the input section? Checking the pubkey takes like 16 CUs, no?

deanmlittle · 2026-03-26T15:21:09Z

You have perfectly demonstrated why this technique is safe and doesn't interfere with any existing APIs or instructions. It simply makes dereferencing certain 32bit addresses that currently point to nothing translate to a pointer to a global variable instead of throwing a memory access error.

I am not sure you have considered the performance benefits. We can load all sysvars once per slot for less than the cost of loading in the largest one just once, and consume them N times within a slot.

Beyond this, it makes writing onchain programs incredibly ergonomic and performant. There really are no downsides and some rather extreme performance and devex benefits. I am yet to see a developer whose mind wasn't blown by how easy it could be for them to write programs with this method.

alnoki · 2026-03-26T17:25:57Z

I am yet to see a developer whose mind wasn't blown by how easy it could be for them to write programs with this method

+1, please land this

ptaffet-jump · 2026-03-26T20:51:45Z

How do you access the other fields of the sysvar? E.g. the epoch value from the clock sysvar?

Don't forget that making address translation in the VM more expensive makes every program slower, even if that's not measurable in CUs.

deanmlittle · 2026-03-27T02:25:29Z

How do you access the other fields of the sysvar? E.g. the epoch value from the clock sysvar?

It's actually super simple. As you've demonstrated, all existing behavior remains unmodified. The only change is that a previously invalid dereference to this address is now valid. Due to the fact that:

We are utilizing an address in the 32-bit range (extended to 64 bits with upper 32 bits masked out in this case, as our wonderful Rust/LLVM/BPF doesn't currently have a good way to produce ALU32 instructions without inline assembly)
All deferences for any address under 0x0100000000 are currently invalid
Write (stx) instructions remain unchanged and continue to fail as they do now
All dereferences (ldx) require a relative offset
The translated address is bound-checked

We simply treat any dereference to this address as the base, and its offset as a relative offset to that address.

In other words, if 0xff395088u64 is the start address of our Clock, then:

lddw r3, 0xff395088 // SOL_CLOCK_SYSVAR
ldxdw r4, [r3+0x00] // load current slot into r4
ldxdw r5, [r3+0x20] // load current timestamp into r5

By virtue of none of our existing sysvars having a length in excess of i16::MAX (32,767 bytes), relative offsets from a known base address is sufficient to encapsulate all existing sysvar account values.

Don't forget that making address translation in the VM more expensive makes every program slower, even if that's not measurable in CUs.

This only happens globally once per slot. The overhead per transaction is a single fat pointer to the sysvar cache. Absolutely negligible considering how much we already copy just to process a transaction. Consider that by comparison today, a single sysvar account load in a single transaction (of which there are MANY more than one per slot) requires serializing >10kb of data, the majority of which is never even used in the VM, whereas the total aggregate of all sysvars today is <30kb – less than the cost of serializing our single largest sysvar account one time, or our smallest sysvar account 3 times in any given slot.

Lichtso

By opening this PR you are affirming that your SIMD has been thoroughly discussed and vetted in the SIMD discussion section.

Did I miss anything?

Lichtso · 2026-03-27T16:14:48Z

proposals/0503-static-sysvars.md

+
+## Summary
+
+Leverage existing static linking infrastructure of JIT compilation to enable 


static linking

Nit on terminology: I know we called them "static syscalls" but they are not static linking (that would imply we would inline their implementation into the SBPF ELF at or before deployment), instead they are still dynamic linking but relocation-less.

existing ... infrastructure

The infrastructure for "static syscalls" is for syscalls only, as the name implies. Linking of (in this case readonly) data is a different process and there is no existing infrastructure to utilize.

JIT compilation

I know we have used JIT-only mode on MNB in Agave for years but with the next version we could have tiered compilation (Interpreter & JIT hybrid). Also Firedancer is using an Interpreter.

Lichtso · 2026-03-27T16:16:21Z

proposals/0503-static-sysvars.md

+
+1. Invoke a specific getter syscall
+2. Invoke the get_sysvar syscall, or
+3. Include a Sysvar account in their program.


Obligatory https://xkcd.com/927/.

That said, yes we could improve upon sysvars and the sysvar syscalls are not great.

Lichtso · 2026-03-27T16:19:55Z

proposals/0503-static-sysvars.md

+other sysvar accounts, such as the murmur hash of `0xff395088` for `Clock`.
+
+Ergo, we can safely and performantly expose any available global variable to 
+the VM without the overhead of additonal account loads or syscall invocation.


Again, this thinks about the JIT only, which is not protocol, it is one possible implementation. What about an interpreter? It would have to do this in address translation on every memory access.

Also, while maybe not that impactful, it would also make the JIT implementation more complex and thus compilation slower as this is a form of macro-op fusion which requires state to be tracked between the instructions. Currently we can compile every instruction independently.

Lichtso · 2026-03-27T16:31:23Z

proposals/0503-static-sysvars.md

+The downsides of all of these approaches are threefold:
+
+1. Sysvar values are globals that are always available to the validator, but 
+   aren't exposed as globals during execution. This is a clunky anti-pattern 


~~Sysvar accounts are already loaded for all transactions even if their message does not mention the sysvar key.~~

Scratch that, it only applies to the sysvar cache but it should be easy to expand to transaction accounts too.

Lichtso · 2026-03-27T16:32:26Z

proposals/0503-static-sysvars.md

+   resulting in degraded developer experience.
+2. Invoking a syscall to access a global requires both a stack allocation and 
+   halting execution – an immense amount of overhead just to read a value that 
+   is already readily available.


Agreed, sysvar syscalls are overkill for reading constant data.

Lichtso · 2026-03-27T16:38:36Z

proposals/0503-static-sysvars.md

+   halting execution – an immense amount of overhead just to read a value that 
+   is already readily available.
+3. Having to pass in an account to access a global results in a 10kb penalty 
+   to data serialization and has negative implications for composability.


Additionally, depending on the design of the entrypoint a program must also find the instruction account of the sysvar, but that will be fixed by SIMD-0449.

10kb penalty to data serialization

That part will be gone after direct mapping and the CU charging adjustments of SIMD-0452, which I still have to rewrite.

negative implications for composability

I think that is the remaining actual downside of sysvar accounts: They waste key space in the instruction invocation.

Lichtso · 2026-03-27T16:41:07Z

proposals/0503-static-sysvars.md

+## Alternatives Considered
+
+- Leverage JIT intrinsics to provide similar syscall functionality.
+- Don't improve the existing design of sysvars/syscalls.


Think taking another look at solving this problem using sysvar accounts is worthwhile. The gap to bridge there might be relatively small (see https://github.com/solana-foundation/solana-improvement-documents/pull/503/changes#r3002020193).

Lichtso · 2026-03-27T16:42:36Z

All deferences for any address under 0x0100000000 are currently invalid

In SIMD-0189 we moved the readonly-data down into the 32 bit address range as most 64 bit load immediate (lddw) only want to load addresses to that and can thus be shrunken to 32 bit load immediate.

So, the murmur hash and references to readonly-data could collide.

deanmlittle · 2026-03-27T17:16:32Z

All deferences for any address under 0x0100000000 are currently invalid

In SIMD-0189 we moved the readonly-data down into the 32 bit address range as most 64 bit load immediate (lddw) only want to load addresses to that and can thus be shrunken to 32 bit load immediate.

So, the murmur hash and references to readonly-data could collide.

It's always a fun time discovering new and exciting reasons why the runtime has continued to trend from "functional" to "borderline unusable" via SIMD trivia. I can guarantee you that devs would unanimously prefer to maintain the viability of the method laid out in this SIMD over curing you of your irrational hatred of lddw. That should be a sign to anyone who takes their end customers seriously to revert the change.

Lichtso · 2026-03-27T17:33:45Z

proposals/0503-static-sysvars.md

+
+```asm
+lddw r3, 0x494df715 // SOL_RENT_SYSVAR
+ldxdw r3, [r3+0]


This would benefit from explaining that sysvars are mapped in as byte slices starting at the murmur hash as address. Otherwise it invites the interpretation that all sysvars are supposed to be accessed via a single u64 value, which would limit them to 8 bytes.

Lichtso · 2026-03-27T17:35:20Z

proposals/0503-static-sysvars.md

+1. We must ensure no intra-slot mutability of any exposed globals.
+2. We must ensure static sysvar pointers remain synchronized with SysvarCache 
+   at the slot boundary.
+3. Despite being 32-bit hashes, it is important that we cast to u64 first as


Alternatively one could also only use the 31 LSBs of the hash / mask out the 1 MSB and always load a i32 immediate value.

LucasSte · 2026-03-27T17:46:33Z

We are utilizing an address in the 32-bit range (extended to 64 bits with upper 32 bits masked out in this case, as our wonderful Rust/LLVM/BPF doesn't currently have a good way to produce ALU32 instructions without inline assembly)

BPF upstream has the +alu32 target feature which achieves that. If generating LDDW, instead of mov32, is a problem, please open an issue in our repo, and we'll fix that.

LucasSte · 2026-03-27T17:48:22Z

It's always a fun time discovering new and exciting reasons why the runtime has continued to trend from "functional" to "borderline unusable" via SIMD trivia.

I don't think this is an appropriate comment for this discussion. You have read and approved the SIMD yourself: #189 (review).

Lichtso · 2026-03-27T18:01:10Z

It's always a fun time discovering new and exciting reasons why the runtime has continued to trend from "functional" to "borderline unusable" via SIMD trivia.

First of all, you approved the SIMD yourself, it did not pass you by unnoticed. You simply, like anybody else, did not have the foresight that there could be more interactions with future ideas.

I can guarantee you that devs would unanimously prefer to maintain the viability of the method laid out in this SIMD

Devs would prefer any method that achieves the same low cost direct access to sysvars, independent of it being is this proposal or some alternative.

over curing you of your irrational hatred of lddw.

Which you seem to be infected with too, reciting this very proposal:

"this would be more ideal, as it would save 8 bytes of binary size."

That should be a sign to anyone who takes their end customers seriously to revert the change.

What if I told you there is an alternative which achieves the same interface and CU costs but does not need to revert SBPFv3?

deanmlittle · 2026-03-27T22:36:29Z

We are utilizing an address in the 32-bit range (extended to 64 bits with upper 32 bits masked out in this case, as our wonderful Rust/LLVM/BPF doesn't currently have a good way to produce ALU32 instructions without inline assembly)

BPF upstream has the +alu32 target feature which achieves that. If generating LDDW, instead of mov32, is a problem, please open an issue in our repo, and we'll fix that.

Yeah, we can do it with upstream already too. The issue is if we tried to force 32 bits, the feature being safe becomes inherently tied to a compiler version which doesn't sound great. If v3 was a feature branch until the feature gates activated and only shipped to master when it was fully baked, it would have been fine to assume alu32. In absence of that, lddw works just fine here. The main issue is just managing sign extension.

deanmlittle · 2026-03-28T00:50:23Z

First of all, you approved the SIMD yourself, it did not pass you by unnoticed. You simply, like anybody else, did not have the foresight that there could be more interactions with future ideas.

It has now become apparent why you wanted to sneak in this unrelated design choice into the header restrictions. I wish I had understood your motivation more clearly at the time instead of rubber stamping it after ripping out all of the bloat.

What if I told you there is an alternative which achieves the same interface and CU costs but does not need to revert SBPFv3?

I'd say you're about to shill me on a new memory address prefix in the upper 32-bits, locking us into 64-bit targets for good. Sad if it has to come to that. Also, was absolutely not suggesting to revert all of V3, just the silly 32-bit mapping change. Ironically, in your quest to kill lddw, you've just made it more useful 🙃

Static sysvars

15b16a4

deanmlittle changed the title ~~Static sysvars~~ SIMD-0503: Static Sysvars Mar 26, 2026

deanmlittle added 3 commits March 26, 2026 12:54

typo

7ac98c3

typo2

ee9debe

compiler actually emits lddw, not mov64

1e8d320

Lichtso reviewed Mar 27, 2026

View reviewed changes

github-actions bot mentioned this pull request Mar 30, 2026

Upstream Updates - Mon Mar 30 00:27:20 UTC 2026 smartcontractkit/chainlink-solana#1491

Open


		## Summary

		Leverage existing static linking infrastructure of JIT compilation to enable

Conversation

deanmlittle commented Mar 26, 2026

Uh oh!

simd-bot bot commented Mar 26, 2026

Uh oh!

ptaffet-jump commented Mar 26, 2026

Uh oh!

deanmlittle commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alnoki commented Mar 26, 2026

Uh oh!

ptaffet-jump commented Mar 26, 2026

Uh oh!

deanmlittle commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lichtso left a comment

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Lichtso commented Mar 27, 2026

Uh oh!

deanmlittle commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Lichtso Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Lichtso Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

LucasSte commented Mar 27, 2026

Uh oh!

LucasSte commented Mar 27, 2026

Uh oh!

Lichtso commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deanmlittle commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

deanmlittle commented Mar 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

deanmlittle commented Mar 26, 2026 •

edited

Loading

deanmlittle commented Mar 27, 2026 •

edited

Loading

Lichtso Mar 27, 2026 •

edited

Loading

Lichtso Mar 27, 2026 •

edited

Loading

deanmlittle commented Mar 27, 2026 •

edited

Loading

Lichtso commented Mar 27, 2026 •

edited

Loading

deanmlittle commented Mar 27, 2026 •

edited

Loading