You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit addresses TODOs left after introducing network partitions
and node restarts to kvnemesis:
- All fault mode tests run with 4 nodes to ensure that liveness tests
can always maintain a majority quorum of replicas on nodes 1 and 2.
If the tests run with 5 nodes, we need to take extra care to ensure 3
replicas are healthy for all RF=5 ranges (e.g. the systems ranges).
- Splits are now allowed in both partition and restart liveness modes.
Previously, if r3 was unavailable, the range ID allocator would get
stuck retrying the increment operation that generates a new range ID.
This commit ensures all systems ranges are available in liveness
mode. Moreover, for the restart liveness variant, some error handling
tweaks were needed to make sure the split is retried if it races with
a node shutdown.
- A few operations are re-enabled across the two partition test
variants: lease transafers and delete range. It's not clear what
changed but these don't cause the tests to fail anymore (or at least
it's hard to repro). Potentially, the theory for why they could fail
was not quite right. Will investigate more if they fail in CI.
Part of: #64828
Part of: #114814
Release note: None
0 commit comments