CNDB-16350: Optimize ChronicleMap access, iteration to reduce serde cost #2189

michaeljmarshall · 2026-01-08T00:06:57Z

What is the issue

Fixes: https://github.com/riptano/cndb/issues/16350

What does this PR fix and why was it fixed

ChronicleMap gives us several ways to use lower level APIs to avoid deserializing keys/values and the associated allocation that comes with them. The first key thing to mention is that iteration is very expensive as these maps get big, so we want to avoid it if possible. The second is that if we use the typical map iteration methods, they deserialize the key and the value eagerly. Since the key is typically a high dimensional vector, it is valuable to avoid such deserialization. This change:

Removes unnecessary iteration leveraging the fact that compaction is additive
Replaces forEach with forEachEntry, which gives better semantics
Updates the maybeAddVector method to avoid serializing the vector key twice by using the searchContext. The ChronicleMap#put method uses this pattern internally

I added two sets of benchmarks, however the VectorCompactionBench doesn't seem to register the benefit of the ChronicleMap. I am leaving VectorCompactionBench in place since it is still useful. Likely, this is because ChronicleMap's cost isn't as expensive as graph construction.

…cation Results from my machine: [java] Benchmark (dupeVectorFactor) Mode Cnt Score Error Units [java] VectorCompactionBench.compactVectorIndex 0 avgt 5 588.260 ± 50.314 ms/op [java] VectorCompactionBench.compactVectorIndex 0.009 avgt 5 577.705 ± 43.717 ms/op [java] VectorCompactionBench.compactVectorIndex 0.999 avgt 5 640.866 ± 14.616 ms/op [java] VectorCompactionBench.compactVectorIndex 10 avgt 5 1253.919 ± 27.370 ms/op

[java] Benchmark (dupeVectorFactor) Mode Cnt Score Error Units [java] VectorCompactionBench.compactVectorIndex 0 avgt 5 576.955 ± 21.165 ms/op [java] VectorCompactionBench.compactVectorIndex 0.009 avgt 5 562.493 ± 25.208 ms/op [java] VectorCompactionBench.compactVectorIndex 0.999 avgt 5 641.528 ± 10.381 ms/op [java] VectorCompactionBench.compactVectorIndex 10 avgt 5 1270.086 ± 15.054 ms/op

This drops the dimension from each key and then adds a config that ensures chronicle knows the keys are a fixed size. [java] Benchmark (dupeVectorFactor) Mode Cnt Score Error Units [java] VectorCompactionBench.compactVectorIndex 0 avgt 5 594.055 ± 88.097 ms/op [java] VectorCompactionBench.compactVectorIndex 0.009 avgt 5 564.272 ± 20.759 ms/op [java] VectorCompactionBench.compactVectorIndex 0.999 avgt 5 629.680 ± 16.790 ms/op [java] VectorCompactionBench.compactVectorIndex 10 avgt 5 1207.623 ± 17.931 ms/op

benchmark results before change: [java] Benchmark (dimension) (numVectors) Mode Cnt Score Error Units [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 768 100000 avgt 5 271.569 ± 3.473 ms/op [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 768 1000000 avgt 5 5452.393 ± 227.905 ms/op [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 1536 100000 avgt 5 1392.607 ± 30.388 ms/op [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 1536 1000000 avgt 5 11496.696 ± 345.886 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 768 100000 avgt 5 242.049 ± 20.708 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 768 1000000 avgt 5 2365.691 ± 84.173 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 1536 100000 avgt 5 265.395 ± 4.167 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 1536 1000000 avgt 5 3641.557 ± 130.649 ms/op after change: [java] Benchmark (dimension) (numVectors) Mode Cnt Score Error Units [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 768 100000 avgt 5 5.721 ± 1.727 ms/op [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 768 1000000 avgt 5 124.536 ± 22.464 ms/op [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 1536 100000 avgt 5 5.662 ± 0.610 ms/op [java] V5VectorPostingsWriterBench.createGenericIdentityMapping 1536 1000000 avgt 5 122.671 ± 3.343 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 768 100000 avgt 5 5.364 ± 1.194 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 768 1000000 avgt 5 119.449 ± 4.809 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 1536 100000 avgt 5 5.379 ± 0.552 ms/op [java] V5VectorPostingsWriterBench.describeForCompactionOneToMany 1536 1000000 avgt 5 121.293 ± 3.040 ms/op

github-actions · 2026-01-08T00:07:12Z

eolivelli

LGTM

I have one minor comment about "var", not a big deal

eolivelli · 2026-01-08T09:22:01Z

src/java/org/apache/cassandra/index/sai/disk/vector/CompactionGraph.java

    public boolean isEmpty()
    {
-        return postingsMap.values().stream().allMatch(VectorPostings::isEmpty);
+        return rowsAdded == 0;


nice trick !

eolivelli · 2026-01-08T09:24:00Z

src/java/org/apache/cassandra/index/sai/disk/vector/CompactionGraph.java

+                int ordinal = useSyntheticOrdinals ? nextOrdinal++ : segmentRowId;
+                maxOrdinal = ordinal; // always increasing
+                var postings = new CompactionVectorPostings(ordinal, segmentRowId);
+                var data = postingsQueryContext.wrapValueAsData(postings);


I would prefer to not use "var" here, because it it not really clear from the context which is the type, we are using this unusal QueryContext from CM.

Agreed, and data didn't make it any clearer :)

Side note: Cassandra community decided we are using var only in the tests, so maybe it will be easier when porting or rebasing if we stick to that rule in our fork too?

eolivelli · 2026-01-08T09:25:29Z

src/java/org/apache/cassandra/index/sai/disk/vector/CompactionGraph.java

+                    var trainingVectors = new ArrayList<VectorFloat<?>>(postingsMap.size());
+                    var vectorsByOrdinal = new Int2ObjectHashMap<VectorFloat<?>>();
+                    postingsMap.forEachEntry(entry -> {
+                        // TODO can we skip this copy?


are you willing to fix this in this PR ? otherwise let's open a ticket and link it here

I started looking into this last night. There are some cases where we modify an existing object (this is the using != null in the Marshaller. I am pretty sure, but not positive, those are just cases where the key was still in memory and not yet flushed to disk. Planning on checking today to see.

cassci-bot · 2026-01-10T01:24:01Z

❌ Build ds-cassandra-pr-gate/PR-2189 rejected by Butler

2 regressions found
See build details here

Found 2 new test failures

Test	Explanation	Runs	Upstream
o.a.c.index.sai.QueryContextTest.testWideTableWithStatics[dc] (compression)	REGRESSION	🔴🔵	0 / 21
o.a.c.index.sai.cql.VectorCompaction100dTest.testZeroOrOneToManyCompaction[dc false]	REGRESSION	🔴⚪	0 / 21

Found 2 known test failures

michaeljmarshall added 8 commits December 18, 2025 14:26

Avoid reserializing FloatVector when inserting/updating posting list

c7f4406

Extract deserialization logic into Marshaller class

3ff72eb

Fix license header

38e7fbf

Add VectorPostingsMarshallerTest

6c89ff1

michaeljmarshall requested a review from eolivelli January 8, 2026 00:06

michaeljmarshall self-assigned this Jan 8, 2026

eolivelli approved these changes Jan 8, 2026

View reviewed changes

Add types; update comment

e4ee339

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CNDB-16350: Optimize ChronicleMap access, iteration to reduce serde cost #2189

CNDB-16350: Optimize ChronicleMap access, iteration to reduce serde cost #2189

Uh oh!

michaeljmarshall commented Jan 8, 2026

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

eolivelli left a comment

Uh oh!

eolivelli Jan 8, 2026

Uh oh!

eolivelli Jan 8, 2026

Uh oh!

michaeljmarshall Jan 8, 2026

Uh oh!

ekaterinadimitrova2 Jan 8, 2026

Uh oh!

eolivelli Jan 8, 2026

Uh oh!

michaeljmarshall Jan 8, 2026

Uh oh!

cassci-bot commented Jan 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CNDB-16350: Optimize ChronicleMap access, iteration to reduce serde cost #2189

Are you sure you want to change the base?

CNDB-16350: Optimize ChronicleMap access, iteration to reduce serde cost #2189

Uh oh!

Conversation

michaeljmarshall commented Jan 8, 2026

What is the issue

What does this PR fix and why was it fixed

Uh oh!

github-actions bot commented Jan 8, 2026

Checklist before you submit for review

Uh oh!

eolivelli left a comment

Choose a reason for hiding this comment

Uh oh!

eolivelli Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

eolivelli Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

michaeljmarshall Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

ekaterinadimitrova2 Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

eolivelli Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

michaeljmarshall Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

cassci-bot commented Jan 10, 2026

❌ Build ds-cassandra-pr-gate/PR-2189 rejected by Butler

Found 2 new test failures

Found 2 known test failures

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants