Support Layer Deltas #494

vraiti · 2025-11-25T18:15:21Z

Introduction

This PR is mainly a rebase of a very old PR from the containers/image repository by @alexlarsson: containers/image#902. It also adds a small amount of extra debug logging to make the optimizations of this PR easily observable.

Rebased PR Info

The matter of network-efficient container updates has been gaining attention from developers in edge computing as container-based technologies to manage edge device applications have gained traction (e.g. Microshift, flightctl):

So, I was investigating how we might support this need for efficient updates, when I came across Alex's PR to containers/image. While it's far from a perfect solution (garbage collection and race conditions were two big concerns that came up in the discussion), the "best" solution would likely be an extension of the OCI spec, which would also likely require much stronger proven demand to justify. After some discussion with engineers from these BU mentioned above as well as @mheon, we decided that rebasing Alex's PR may be a suitable first step.

As all of the logic needed for this PR was already implemented, just in a different codebase, this seemed like an appropriate task for an LLM. In accordance with Red Hat's AI policy, I've annotated the commits as AI generated. While I've done my best to review the generated code, the low-level details of this repository are unfamiliar to me, so I can't conclusively say on my own whether or not this PR is doing all the right things. However, according to my testing everything seems to be working as expected. A demo of the layer delta patch using Skopeo can be found in this repository: https://github.com/vraiti/oci-deltas.

Original PR Info

NOTE: This was mostly copied from containers/image#902. Only change was to the sample manifest media type and layer annotation names.

Deltas are a way to avoid downloading a full copy if a layer tar file
if you have a previous version of the layer available locally. In
testing these deltas have been shown to be around 10x to 100x smaller
than the .tar.gz files for typical Linux base images.

In the typical client-side case we have some previous version of the
image stored in container-storage somewhere, which means that we have
an uncompressed files available, but not the actual tarball
(compressed or not).

This means we can use github.com/alexlarsson/tar-diff which takes
two tar files and produces a delta file which when applied on the
untar:ed content of the first tarfile produces the (bitwise identical)
content of the uncompressed second tarfile. It just happens that the
uncompressed tarfile is exactly what we need to reproduce, because that
is how the layers are refered to in the image config (the DiffIDs).

How this works is that we use OCI artifacts to store, for each regular
image a manifest with information about the available deltas for the
image. This image looks like a regular manifest, except each layer
contains a tar-diff (as a blob) an uses the existing annotations key
to record which DiffIDs the layer applies to.

For example, a manifest would look like this:

{
  "schemaVersion": 2,
  "config": {
    "mediaType": "application/vnd.oci.image.config.v1+json",
    "digest": "sha256:ca3d163bab055381827226140568f3bef7eaac187cebd76878e0b63e9e442356",
    "size": 3
  },
  "layers": [
    {
      "mediaType": "application/vnd.tar-diff",
      "digest":
"sha256:49402288de20a465616174a38aca4746f46be2c3f9519fe4d14fc7f83f44a32a",
      "size": 7059734,
      "annotations": {
          "io.github.containers.delta.from":
"sha256:b9137868142acd7ce4d62216e2b03e63e9800e2b647bf682492d3e9c5e66277c",
          "io.github.containers.delta.to":
"sha256:c88d2d437799c2879fded33ee358429e1eb954968a25f3153e2e0e26fef7ef28"
      }
    }
  ]
}

The config blob is just an json file containing "{}". Ideally it
should not be of type application/vnd.oci.image.config.v1+json,
because that is reserved for docker-style images. However, as
explained in oras-project/oras#129, docker hub
doesn't currently support any other type. For registries that support
OCI artifacts we should instead use some other type so that tooling
can know that this is not a regular image.

The way we attach the delta manifest to the image is that we store it
in the same repo and a tag name based on the manifest digest like
"delta-${shortid}".

The delta layers record which DiffID they apply to, which is what we
want to use to look up the pre-existing layers to use as delta source
material, and it is what the delta apply will generate. This means
however that using the deltas only works if we're allowed to
substitute blobs, but this doesn't seem to be an issue in the typical
case.

podmanbot · 2025-11-25T18:16:55Z

✅ A new PR has been created in buildah to vendor these changes: containers/buildah#6535

mtrmac

https://github.com/containers/container-libs/blob/main/CONTRIBUTING.md#sign-your-prs please, we can’t even look at PRs with unclear copyright status.

mtrmac · 2025-11-25T18:27:15Z

we can’t even look at PRs with unclear copyright status.

… so I’m expressing no opinion on the goal or desirability of the PR at this point.

vraiti · 2025-11-25T18:37:20Z

@mtrmac done

Signed-off-by: Kyounghoon Jang <[email protected]>

vraiti · 2025-11-25T18:50:26Z

@mtrmac ah sorry I misread. Thought that meant cryptographic signing. Commits are actually signed off now

packit-as-a-service · 2025-11-25T18:59:59Z

Packit jobs failed. @containers/packit-build please check.

Generated-By: Claude Code Signed-off-by: Vance <[email protected]>

mheon · 2025-11-25T19:23:08Z

So I imagine y'all will want a test build of Podman with this patch to validate we're seeing the benefits we expect from deltas?

mtrmac

Fair warning, without reading this ~at all:

If this should be included, it would require a lot of restructuring, starting with the external API concerns mentioned in the original PR (those are now possible to solve but tedious)
Given the existence of zstd:chunked nowadays, and unlocked staging of layers on pull, the benefits would have to be very compelling for us to add and maintain yet another pull path.

mtrmac · 2025-11-25T19:29:52Z

common/libnetwork/cni/network.go

vraiti · 2025-11-25T19:43:59Z

So I imagine y'all will want a test build of Podman with this patch to validate we're seeing the benefits we expect from deltas?

@alexlarsson was gracious enough to also create a patch of Skopeo with a new generate-delta command. I've rebased that as well: https://github.com/vraiti/skopeo.

I've made a quick demo of a synthetic example (1-byte edit to large file) here: https://github.com/vraiti/oci-deltas. The result was a 400 KiB patch to update a 1.2 GiB layer.

giuseppe · 2025-11-25T20:36:11Z

I've made a quick demo of a synthetic example (1-byte edit to large file) here: https://github.com/vraiti/oci-deltas. The result was a 400 KiB patch to update a 1.2 GiB layer.

how much do you get with zstd:chunked on the same test?

mheon · 2025-11-25T20:46:23Z

Do we have real-world numbers on AI model updates?

giuseppe · 2025-11-25T20:49:40Z

Do we have real-world numbers on AI model updates?

How so? Do we expect any benefit with AI models with deltas?

mtrmac · 2025-11-25T20:53:32Z

I read that to mean “a 1-byte edit is not representative of anything”.

vraiti · 2025-11-26T15:55:31Z

@giuseppe Just tested, zstd:chunked reports 2 MiB for a 1-byte patch. Also large, but orders of magnitude smaller than I had thought it would be. From documentation and discussion I was under the impression that zstd:chunked did not patch files, only reusing exactly matching files. Does it use something like rolling checksum chunking? That would certainly make it much better for edge computing applications than I had thought.

I will run some additional experiments on more representative workloads (namely, the Openshift upgrades that caused us to lose a Tesco business opportunity) to see whether zstd:chunked is effective there as well.

giuseppe · 2025-11-26T16:00:18Z

@giuseppe Just tested, zstd:chunked reports 2 MiB for a 1-byte patch. Also large, but orders of magnitude smaller than I had thought it would be. From documentation and discussion I was under the impression that zstd:chunked did not patch files, only reusing exactly matching files. Does it use something like rolling checksum chunking? That would certainly make it much better for edge computing applications than I had thought.

I will run some additional experiments on more representative workloads (namely, the Openshift upgrades that caused us to lose a Tesco business opportunity) to see whether zstd:chunked is effective there as well.

yes, zstd:chunked uses a rolling checksum to split a file into multiple chunks.

github-actions bot added the image Related to "image" package label Nov 25, 2025

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 25, 2025

dnm: Vendor changes from containers/container-libs#494

bb3369e

podmanbot mentioned this pull request Nov 25, 2025

Sync: Support Layer Deltas containers/buildah#6535

Draft

mtrmac requested changes Nov 25, 2025

View reviewed changes

vraiti force-pushed the main branch from e005d15 to 4bf6987 Compare November 25, 2025 18:27

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 25, 2025

dnm: Vendor changes from containers/container-libs#494

31e7b59

vraiti force-pushed the main branch from 4bf6987 to d678e11 Compare November 25, 2025 18:34

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 25, 2025

dnm: Vendor changes from containers/container-libs#494

56aa72c

Add a DefaultNetwork field to NetworkInfo

e5d9c8c

Signed-off-by: Kyounghoon Jang <[email protected]>

vraiti force-pushed the main branch from d678e11 to 1544afc Compare November 25, 2025 18:47

github-actions bot added the common Related to "common" package label Nov 25, 2025

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 25, 2025

dnm: Vendor changes from containers/container-libs#494

a248a71

vraiti force-pushed the main branch from 1544afc to 955d215 Compare November 25, 2025 18:58

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 25, 2025

dnm: Vendor changes from containers/container-libs#494

ab31f71

vraiti force-pushed the main branch from 955d215 to 4090f0a Compare November 25, 2025 18:59

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 25, 2025

dnm: Vendor changes from containers/container-libs#494

f53098e

Add delta manifest and delta layer support

8cf69fd

Generated-By: Claude Code Signed-off-by: Vance <[email protected]>

vraiti force-pushed the main branch from 4090f0a to 8cf69fd Compare November 25, 2025 19:02

podmanbot pushed a commit to podmanbot/buildah that referenced this pull request Nov 25, 2025

dnm: Vendor changes from containers/container-libs#494

1a51bd3

mtrmac requested changes Nov 25, 2025

View reviewed changes

common/libnetwork/cni/network.go

Copy link

Contributor

mtrmac Nov 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

???!

Support Layer Deltas #494

Are you sure you want to change the base?

Support Layer Deltas #494

Conversation

vraiti commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Introduction

Rebased PR Info

Original PR Info

Uh oh!

podmanbot commented Nov 25, 2025

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

mtrmac commented Nov 25, 2025

Uh oh!

vraiti commented Nov 25, 2025

Uh oh!

vraiti commented Nov 25, 2025

Uh oh!

packit-as-a-service bot commented Nov 25, 2025

Uh oh!

mheon commented Nov 25, 2025

Uh oh!

mtrmac left a comment

Choose a reason for hiding this comment

Uh oh!

mtrmac Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

vraiti commented Nov 25, 2025

Uh oh!

giuseppe commented Nov 25, 2025

Uh oh!

mheon commented Nov 25, 2025

Uh oh!

giuseppe commented Nov 25, 2025

Uh oh!

mtrmac commented Nov 25, 2025

Uh oh!

vraiti commented Nov 26, 2025

Uh oh!

giuseppe commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

vraiti commented Nov 25, 2025 •

edited

Loading