What is the need for having a tag bit to have capabilities? #549

SaiVK · 2025-06-27T13:58:28Z

SaiVK
Jun 27, 2025

Hello
I have a couple of design questions:

Why does CHERI model require tag bits?
Can we achieve the same security guarantees without having a tag bit?

I had gone through the cheri-iot doc, and it is mentioned tag bits prevent tampering of capabilities. Can we achieve this even without the tag bits?
One case I could think of is, lets say a struct having a pointer as well as an array. And if intra-struct overflow is not prevented then the capability might be corrupted easily in the absence of tag bits. But we can just convert the array to a pointer and allocate the array separately. So capability tampering via intra-object overflows are prevented. Is there any subtle security aspect that I am missing w.r.t tag bits?

Thanks
-Sai

Answered by davidchisnall

Jun 27, 2025

How realistic is this attack?

That depends a lot on the code and the threat model. We want CHERI (in general, and CHERIoT in particular) to be usable for software compartmentalisation. Part of our threat model is that you have arbitrary code provided by a third party and need to enforce memory safety when you run it in a compartment.

Even if you don't want memory safety as a building block for compartmentalisation, it's still quite a common attack vector. You have things like unions of pointers and integers, buffers with imprecise bounds with pointers after them, type-erased things with pointers, and so on. Lots of ways of tricking something into overwriting an integer with a pointer.

A…

View full answer

davidchisnall · 2025-06-27T14:16:39Z

davidchisnall
Jun 27, 2025
Maintainer

The tag bit is how you tell the difference between a pointer and some other data. The point of the tag bit is to ensure that the rest of the metadata (bounds, permissions, and so on) is trustworthy. It is an attestation from the hardware that says that, if you see a value with the tag bit set, it is a valid pointer that has been derived from some pointer at least equally powerful. Without it, you can just do two data writes and now have a pointer with arbitrary bounds or permissions.

In other systems, there are (broadly) two alternatives:

You can sign pointers using some cryptographic primitive. For this to be secure, you need to make them a lot bigger. For example, Arm's PAC can use 24 bits for the signature. That sounds like a lot, you have (on average) a 1/2^23 chance of guessing correctly. But you can do your guesses in speculation and use timing side channels, so you trivially forge PAC pointers if you already have arbitrary code execution. Most of the places PAC is used are trying to prevent you from getting arbitrary code execution, but that isn't sufficient as a building block for compartmentalisation.

The other alternative is to restrict where capabilities can be stored, storing them in special tables. This doesn't work well for C. Now your C pointer has to be an index into a table. This means temporal safety is hard because you don't have a way of differentiating integers that are indexes into a capability table from integers that are just integers. You can remove access to a specific object, but you prevent use-after-free from the same compartment (and sharing becomes a lot harder).

(Aside: There's only one I in CHERIoT)

0 replies

SaiVK · 2025-06-27T15:00:17Z

SaiVK
Jun 27, 2025
Author

Thanks David, I understood the gist. I have a few follow up questions:

Without it, you can just do two data writes and now have a pointer with arbitrary bounds or permissions.

How realistic is this attack? Generally direct manipulation on pointer like casting from int to pointer, occur in driver code/low-level systems/assembly code. And the compiler can emit warnings, so that we can sort of inspect those specific functions alone and ensure that pointer metadata is correct and not corrupted. And the remaining places, it is just pointer arithmetic operations, where the overflows can't corrupt a capability, since the metadata is in the upper 32 bits.

Also, I am not sure how CHERI instruments (inline)assembly code and handling of int to pointer casts? Based on my understanding, the assembly code is manually rewritten using CHERI purecap instructions.
Why does CHERIoT not support hybrid-cap mode? Was it due to the hardware overheads? Lets say I want to run some legacy library for which I might not be able to recompile. In that case, having support for normal load/store would be helpful right?

1 reply

davidchisnall Jun 27, 2025
Maintainer

How realistic is this attack?

That depends a lot on the code and the threat model. We want CHERI (in general, and CHERIoT in particular) to be usable for software compartmentalisation. Part of our threat model is that you have arbitrary code provided by a third party and need to enforce memory safety when you run it in a compartment.

Even if you don't want memory safety as a building block for compartmentalisation, it's still quite a common attack vector. You have things like unions of pointers and integers, buffers with imprecise bounds with pointers after them, type-erased things with pointers, and so on. Lots of ways of tricking something into overwriting an integer with a pointer.

And the compiler can emit warnings, so that we can sort of inspect those specific functions alone and ensure that pointer metadata is correct and not corrupted.

That requires two things:

Your compiler to be in the TCB (we try to avoid that)
You to be willing to make large changes to your code (we try really hard to avoid that)

In the network stack that we're using (completely unmodified), for example, it stores a pointer in the header to the packet buffer. This whole thing is passed around as a char*. You find the header by subtracting from the start and casting. That's not necessarily how I would have written the code, but if you're going to require people to rewrite their code in a different style, now you have added a lot of friction. A load of idioms that are really important to work for low-level code depend on this kind of thing. And if you don't have memory safety for the most important bits of your systems code, what's the point? Everything else will be broken as a result of the bugs in the core parts.

Also, I am not sure how CHERI instruments (inline)assembly code and handling of int to pointer casts? Based on my understanding, the assembly code is manually rewritten using CHERI purecap instructions.

CHERI doesn't instrument any assembly. CHERI loads, stores, and jump instructions all take a capability (pointer) in a register as the base. These instructions then perform the relevant (bounds, permissions, sealing) checks.

Why does CHERIoT not support hybrid-cap mode?

It's a lot of complexity in hardware for no benefit in our use case.

Lets say I want to run some legacy library for which I might not be able to recompile.

Realistically, how many RV32E binary-only libraries for embedded systems exist? If we were able to run unmodified Arm (AArch32) binaries that might be different (there are a bunch of embedded ARMv6 or earlier binaries where people no longer have access to the source code), but there are not a lot of RV32E things where the source code is unavailable. Do you have RV32E binaries that you'd want to link into a CHERIoT compartment?

Answer selected by SaiVK

SaiVK · 2025-06-27T16:28:32Z

SaiVK
Jun 27, 2025
Author

Thanks a lot David. I understand the model better now. Also, the legacy library support I forgot the fact that CHERIoT is targetted at RISC-V. And yes it makes sense, since RISC-V is a recent architecture supporting purecap only is good tradeoff.

Thanks David for taking time and explaining it in detail.

-Sai

0 replies

adam-3bian · 2025-06-28T09:01:37Z

adam-3bian
Jun 28, 2025

On how realistic this attack is: in embedded systems, you often see mixes of pointers and integers, the use of unions, struct overlays and data passed around as void* or char*. In those cases, it’s easy for a value that looks like a pointer to be forged with just two memory writes, especially if metadata like bounds is stored inline.

As for whether a compiler can warn about this, I guess it can! But only if you control all the code, can afford to inspect every warning and include the compiler in your TCB.

Hybrid capability mode makes sense when there’s a legacy base to support. But if your libraries are open source, hybrid support adds complexity and recompiling is probably the better solution from the point of view of security and performance.

One extra note on tag bits. They’re stored in a hidden metadata plane, inaccessible to software and cleared by standard memory operations. So memcpy doesn’t just copy the bits. It invalidates the pointer unless you use CHERI-aware instructions. That’s one of the key ways CHERI enforces non-forgeability even during low level memory handling.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHERIoT Platform

What is the need for having a tag bit to have capabilities? #549

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

CHERIoT Platform

What is the need for having a tag bit to have capabilities? #549

Uh oh!

SaiVK Jun 27, 2025

Replies: 4 comments · 1 reply

Uh oh!

davidchisnall Jun 27, 2025 Maintainer

Uh oh!

Uh oh!

SaiVK Jun 27, 2025 Author

Uh oh!

davidchisnall Jun 27, 2025 Maintainer

Uh oh!

SaiVK Jun 27, 2025 Author

Uh oh!

adam-3bian Jun 28, 2025

SaiVK
Jun 27, 2025

Replies: 4 comments 1 reply

davidchisnall
Jun 27, 2025
Maintainer

SaiVK
Jun 27, 2025
Author

davidchisnall Jun 27, 2025
Maintainer

SaiVK
Jun 27, 2025
Author

adam-3bian
Jun 28, 2025