Commit eaf1d17
committed
Improve flaky chainsaw test for service failover
One chainsaw test consists in abruptly cutting one galera node away from the
galera cluster and verify that the active endpoint moves to one of the
remaining two galera instances.
In doing so, we currently kill -9 the target mysqld server. By design,
this can take by default up to 15s for the remaining galera nodes to
acknowlege the node went away and react to that. This is a problem for
the test as if the pod comes back online before the 15s, the galera
cluster won't move the endpoint and the test will fail.
To prevent flaky result in the unit test, use the STOP signal instead
of the KILL signal. This doesn't kill the pod, and by default galera
will mark the node as not responding after 3s, and switch the endpoint.
This achieves the same result, which is to make sure that an unexpected
disconnection still trigger a endpoint switch.1 parent 0a36c2e commit eaf1d17
1 file changed
+1
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
91 | 91 | | |
92 | 92 | | |
93 | 93 | | |
94 | | - | |
| 94 | + | |
95 | 95 | | |
96 | 96 | | |
97 | 97 | | |
| |||
0 commit comments