[ntuple] add streaming vector tutorial (v2) #19748

jblomer · 2025-08-26T08:51:48Z

Replaces #17139 with comments in that PR incorporated.

silverweed · 2025-08-26T09:37:57Z

tutorials/io/ntuple/ntpl016_streaming_vector.C

+// so that never the entire vector needs to stay in memory.
+// Note that we don't need to implement loading chunks of data explicitly. Simply by asking for a single vector element
+// at every iteration step, the RNTuple views will take care of keeping only the currently required data pages
+// in memory.


It would be helpful here to say how one can tune the maximum amount of memory used by the StreamingVector

I think the important point is to turn off the cluster cache, otherwise we will load entire clusters anyway. Beyond that, the memory consumption should be that of a page, which is determined by the input file. The reading code doesn't have much control over it...

github-actions · 2025-08-26T11:39:54Z

Test Results

21 files 21 suites 3d 17h 35m 53s ⏱️
3 619 tests 3 477 ✅ 0 💤 142 ❌
74 235 runs 74 074 ✅ 17 💤 144 ❌

For more details on these failures, see this check.

Results for commit 311a905.

hahnjo

Thanks for the other fixes to RNTupleLocalRange and RNTupleCollectionView - we may consider backporting these to the next release of v6.36. Some comments inline for the ntpl016_streaming_vector.C tutorial.

hahnjo · 2025-09-01T09:27:12Z

tutorials/io/ntuple/ntpl016_streaming_vector.C

+constexpr char const *kNTupleName = "ntpl";
+constexpr char const *kFieldName = "LargeVector";
+constexpr unsigned int kNEvents = 10;
+constexpr unsigned int kVectorSize = 1000000;


alma9 modules_off runtime_cxxmodules=Off is slightly unhappy about this line because TGeometry.h defines const Int_t kVectorSize = 3; I guess we have to rename the constant here...

hahnjo · 2025-09-01T09:27:35Z

tutorials/io/ntuple/ntpl016_streaming_vector.C

+#include <vector>
+#include <utility>
+
+constexpr char const *kFileName = "ntpl015_streaming_vector.root";


Suggested change

constexpr char const *kFileName = "ntpl015_streaming_vector.root";

constexpr char const *kFileName = "ntpl016_streaming_vector.root";

hahnjo · 2025-09-01T09:32:26Z

tutorials/io/ntuple/ntpl016_streaming_vector.C

+   // A lightweight iterator used in StreamingVectorView::begin() and StreamingVectorView::end().
+   // Used to iterate over the elements of an RNTuple on-disk vector for a certain entry.
+   // Dereferencing the iterator returns the corresponding value of the item view.
+   class iterator {


Should probably have the usual iterator using definitions, for std::iterator_traits

hahnjo · 2025-09-01T09:34:04Z

tutorials/io/ntuple/ntpl016_streaming_vector.C

+// so that never the entire vector needs to stay in memory.
+// Note that we don't need to implement loading chunks of data explicitly. Simply by asking for a single vector element
+// at every iteration step, the RNTuple views will take care of keeping only the currently required data pages
+// in memory.


I think the important point is to turn off the cluster cache, otherwise we will load entire clusters anyway. Beyond that, the memory consumption should be that of a page, which is determined by the input file. The reading code doesn't have much control over it...

jblomer requested review from hahnjo, pcanal, silverweed, vepadulano and enirolf August 26, 2025 08:51

jblomer self-assigned this Aug 26, 2025

jblomer requested a review from couet as a code owner August 26, 2025 08:51

jblomer added the in:RNTuple label Aug 26, 2025

jblomer mentioned this pull request Aug 26, 2025

[ntuple] add streaming vector tutorial #17139

Closed

jblomer force-pushed the ntuple-tutorial-streaming-vector-v2 branch from acbebf5 to 311a905 Compare August 26, 2025 08:55

jblomer added 3 commits August 26, 2025 10:56

[ntuple] remove const qualifier from RNTupleLocalRange members

c9757d3

[ntuple] fix moving of RNTupleCollectionView

ae35be0

[ntuple] add streaming vector tutorial

311a905

silverweed reviewed Aug 26, 2025

View reviewed changes

hahnjo approved these changes Sep 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ntuple] add streaming vector tutorial (v2) #19748

[ntuple] add streaming vector tutorial (v2) #19748

Uh oh!

jblomer commented Aug 26, 2025 •

edited

Loading

Uh oh!

silverweed Aug 26, 2025

Uh oh!

hahnjo Sep 1, 2025

Uh oh!

github-actions bot commented Aug 26, 2025

Uh oh!

hahnjo left a comment

Uh oh!

hahnjo Sep 1, 2025

Uh oh!

hahnjo Sep 1, 2025

Uh oh!

hahnjo Sep 1, 2025

Uh oh!

hahnjo Sep 1, 2025

Uh oh!

Uh oh!

	constexpr char const *kFileName = "ntpl015_streaming_vector.root";
	constexpr char const *kFileName = "ntpl016_streaming_vector.root";

[ntuple] add streaming vector tutorial (v2) #19748

Are you sure you want to change the base?

[ntuple] add streaming vector tutorial (v2) #19748

Uh oh!

Conversation

jblomer commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

silverweed Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

hahnjo Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Aug 26, 2025

Test Results

Uh oh!

hahnjo left a comment

Choose a reason for hiding this comment

Uh oh!

hahnjo Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

hahnjo Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

hahnjo Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

hahnjo Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jblomer commented Aug 26, 2025 •

edited

Loading