CASSANDRA-20476 & CASSANDRA-20736 Handle CMS member addresses changing concurrently by beobal · Pull Request #4613 · apache/cassandra

beobal · 2026-02-13T18:09:56Z

Changing broadcast address has always been supported, but it requires the node to inform the CMS of the change at startup. If a majority of the CMS members attempt to do this concurrently, they have no way to establish the quorum required to make those metadata changes, leading to a deadlocked startup.
This is addressed by the combination of 2 patchsets:

CASSANDRA-20736 modifies ClusterMetadata to represent the CMS membership as a set of node ids, rather than addresses.
CASSANDRA-20476 introduces a protocol for nodes starting up to discover the current address for CMS members if they have changed while that node was down. The node can then construct a temporary address lookup which it uses to establish contact with CMS members and update/get the latest agreed ClusterMetadata. When the starting node is itself a CMS member, this lookup enables it to form a consensus group with the other members so that address changes can be durably committed & disseminated.

… remove these

…sions with TCM

…change

…ed before allowing startup to proceed

…og processing is suspended

krummas · 2026-02-16T07:55:40Z

src/java/org/apache/cassandra/tcm/CMSMembership.java

+
+    public ImmutableSet<NodeId> joiningMembers()
+    {
+        return ImmutableSet.copyOf(joiningMembers);


nit; BTreeSet is already immutable, we probably don't need to copy it

krummas · 2026-02-16T07:56:06Z

src/java/org/apache/cassandra/tcm/CMSMembership.java

+
+    /**
+     * Used to derive a CMSMembership when deserializing a ClusterMetadata instance written with a metadata version
+     * prior to V7. At that time, CMS membership was always inferred from the data placements of the distributed


"prior to V9" I think?

krummas · 2026-02-16T07:56:45Z

src/java/org/apache/cassandra/tcm/CMSLookup.java

+    }
+
+    private final Map<NodeId, Pair<InetAddressAndPort, InetAddressAndPort>> overrides;
+    private final BiMap<InetAddressAndPort, InetAddressAndPort> addressMap;


addressMap is only used in the toString method

krummas · 2026-02-16T07:58:04Z

src/java/org/apache/cassandra/tcm/CMSLookup.java

+        return new InitialBuilder(metadata);
+    }
+
+    private final Map<NodeId, Pair<InetAddressAndPort, InetAddressAndPort>> overrides;


this should probably be an ImmutableMap for clarity?

and if we make InitialBuilder and rebuild below build immutablemaps we can avoid the copying

krummas · 2026-02-16T07:58:18Z

src/java/org/apache/cassandra/tcm/CMSLookup.java

+        return state == State.ACTIVE;
+    }
+
+    public InetAddressAndPort getAddressOverride(NodeId id)


krummas · 2026-02-16T08:12:31Z

src/java/org/apache/cassandra/tcm/ClusterMetadata.java

+            else
+            {
+                // This cluster did not previously upgrade from a gossip based version (i.e. pre-6.0) but did at some point
+                // run a version prior to MetadataVersion.V7 where we started to encode CMS membership directly. This


krummas · 2026-02-16T08:13:43Z

src/java/org/apache/cassandra/tcm/ClusterMetadata.java

+                    // so we can derive the CMSMembership using the data placement and directory.
+                    DataPlacement placement = placements.get(metadataKs.params.replication);
+                    cmsMembership = CMSMembership.reconstruct(placement, dir);
+                    placements = placements.unbuild().without(metadataKs.params.replication).build();


I think this is unnecessary - we do the same thing directly after the if stmt

krummas · 2026-02-16T08:14:48Z

src/java/org/apache/cassandra/tcm/Startup.java

+        int currentRound = 0;
+        long roundTimeNanos = Math.min(TimeUnit.SECONDS.toNanos(4),
+                                       DatabaseDescriptor.getDiscoveryTimeout(TimeUnit.NANOSECONDS) / maxRounds);
+        // TODO a non-CMS node only needs to be able to contact a single CMS member to commit its STARTUP


should we fix this? It feels like we'll most often discover the full CMS if its up

and if it is not yet up, it might be better to wait here before trying to commit Startup?

krummas · 2026-02-16T08:15:11Z

src/java/org/apache/cassandra/tcm/Startup.java

+
+        int maxRounds = 5;
+        int currentRound = 0;
+        long roundTimeNanos = Math.min(TimeUnit.SECONDS.toNanos(4),


is 4s enough here? Should we add another "discover survey" config setting?

krummas · 2026-02-16T08:15:36Z

src/java/org/apache/cassandra/tcm/Startup.java

+        Map<NodeId, InetAddressAndPort> confirmedCMS = new HashMap<>();
+
+        Set<InetAddressAndPort> candidates = new HashSet<>(previousCMS.values());
+        candidates.add(newAddress);


any reason we don't add the seeds to candidates here? Feels like it could save us a discovery round

beobal and others added 23 commits February 2, 2026 11:40

[CASSANDRA-20736] Add CMS membership directly to ClusterMetadata

27743d2

[CASSANDRA-20736] Init singleton CMS cluster with restarts

6f1b7de

[CASSANDRA-20736] Update CMS reconfiguration

c7871bd

[CASSANDRA-20736] Update Startup transformation

8a68f97

[CASSANDRA-20736] Update legacy CMS membership transformations - TODO…

307e878

… remove these

[CASSANDRA-20736] New MetadataKey for CMS membership

5db6f87

[CASSANDRA-20736] Support for upgrades from gossip & from earlier ver…

5a9ccbf

…sions with TCM

[CASSANDRA-20736] Update CancelCMSReconfiguration

4c012b1

[CASSANDRA-20736] Properly set lastModified on DataPlacements

c9faf3f

[CASSANDRA-20736] AtomicLongProcessor can always accept commits

12a2f55

[CASSANDRA-20736] Separate MetaStrategy placements from others

36d62e7

[CASSANDRA-20736] Make DataPlacements private on ClusterMetadata

cb9ebbb

[CASSANDRA-20736] Test fixes

aaa2049

[CASSANDRA-20736] Rework CMS initialization

01b2350

[CASSANDRA-20476] Add dtest for CMS rediscovery

51e1c6c

[CASSANDRA-20476] Prep for CMSLookup

d346661

[CASSANDRA-20476] Introduce CMSLookup

bed6f56

[CASSANDRA-20476] Perform rediscovery of CMS at startup if addresses …

7807fd5

…change

[CASSANDRA-20476] Attempt to wait for all address changes to be enact…

ba2256c

…ed before allowing startup to proceed

[CASSANDRA-20476] Some minor logging additions

76e9f1d

[CASSANDRA-20476] Make sure to start messaging service

214261f

[CASSANDRA-20476] Don't attempt to catch up from peers or CMS while l…

a52733a

…og processing is suspended

Make shadow gossip round parameters configurable for testing

dc2fab7

beobal requested review from krummas and removed request for krummas February 13, 2026 18:10

beobal changed the title ~~CASSANDRA-20476 & CASSANDRA-20736 Handle all CMS member addresses changing concurrently~~ CASSANDRA-20476 & CASSANDRA-20736 Handle CMS member addresses changing concurrently Feb 13, 2026

krummas requested changes Feb 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CASSANDRA-20476 & CASSANDRA-20736 Handle CMS member addresses changing concurrently #4613

CASSANDRA-20476 & CASSANDRA-20736 Handle CMS member addresses changing concurrently #4613
beobal wants to merge 23 commits intoapache:trunkfrom
beobal:samt/CASSANDRA-20476

beobal commented Feb 13, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

krummas Feb 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

beobal commented Feb 13, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants