[df] Enable snapshotting RNTuple cardinality cols #20820

enirolf · 2026-01-08T10:11:29Z

RNTuple's cardinality fields are projected read-only fields, and currently an exception is thrown when a user tries to snapshot fields of this type to a new RNTuple.

To prevent this from happening, with this PR, such fields are instead converted into non-projected fields of the inner ROOT::RNTupleCardinality<SizeT> field (either std::uint32_t or std::int64_t) before they are added to the model of the new RNTuple. A warning is shown to the user when this happens.

A follow-up/alternative approach is to preserve the projection when creating the model for the output RNTuple. However, this comes with the caveat that the source fields must be included in the output RNTuple. This becomes an issue for cardinality fields of collections of anonymous records (i.e., as is the case for NanoAODs, see paragraph below), since the RNTuple data source here only exposes the inner fields and not the collection field itself, because there is no straightforward way to represent the anonymous record in memory.

A notable scenario is the current implementation of CMS NanoAOD, which in the TTree format contain leaflist arrays. When converting to RNTuple these leaflist arrays, e.g. created via tree.Branch("jet_pt", &jet_pt, "jet_pt[njets]"), the RNTupleImporter creates an anonymous collection record, where jet_pt becomes a true collection field, and njets is a projected field of type RNTupleCardinality. As such, currently RDataFrame is not capable of writing out RNTuple NanoAOD data via Snapshot that preserves the column names for both the collection payload and also the size of the collections. We want to be able to preserve the complete NanoAOD schema.

github-actions · 2026-01-08T12:08:39Z

Test Results

22 files 22 suites 3d 20h 56m 38s ⏱️
3 792 tests 3 792 ✅ 0 💤 0 ❌
80 337 runs 80 337 ✅ 0 💤 0 ❌

Results for commit b70d77b.

[df] Enable snapshotting RNTuple cardinality cols

b70d77b

enirolf requested review from hahnjo, jblomer, pcanal and silverweed January 8, 2026 10:11

enirolf self-assigned this Jan 8, 2026

enirolf added the in:RDataFrame label Jan 8, 2026

enirolf requested review from martamaja10 and vepadulano as code owners January 8, 2026 10:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[df] Enable snapshotting RNTuple cardinality cols #20820

[df] Enable snapshotting RNTuple cardinality cols #20820

enirolf commented Jan 8, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[df] Enable snapshotting RNTuple cardinality cols #20820

Are you sure you want to change the base?

[df] Enable snapshotting RNTuple cardinality cols #20820

Conversation

enirolf commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 8, 2026

Test Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

enirolf commented Jan 8, 2026 •

edited

Loading