Skip to content

Commit 7de94ab

Browse files
committed
Explain success vs rapid in alternatives
Signed-off-by: Laura Lorenz <[email protected]>
1 parent 64fe4cd commit 7de94ab

File tree

2 files changed

+18
-0
lines changed

2 files changed

+18
-0
lines changed

keps/sig-node/4603-tune-crashloopbackoff/README.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1231,6 +1231,24 @@ those updated with `FastOnSuccess` would get truer fast restart behavior.
12311231
However, then it becomes impossible for a workload to opt into both
12321232
`restartPolicy: FastOnSuccess` and `restartPolicy: Rapid`.
12331233

1234+
##### Related: `Succeeded` vs `Rapid`ly failing: who's getting the better deal?
1235+
1236+
When both a flat rate `Succeeded` and a `Rapid` implementation were combined in
1237+
this proposal, depending on the variation of the initial value, the first few
1238+
restarts of a failed container would be faster than a successful container,
1239+
which at first look seems backwards.
1240+
1241+
!["A graph showing the intersection of delay curves between a linear rate for
1242+
success and an exponential rate for rapid
1243+
failures"](successvsrapidwhenfailed.png "success vs rapid CrashLoopBackoff
1244+
dcay")
1245+
1246+
However, based on the use cases, this is still correct because the goal of
1247+
restarting failed containers is to take maximum advantage of quickly recoverable
1248+
situations, while the goal of restarting successful containers is only to get
1249+
them to run again sometime and not penalize them with longer waits later when
1250+
they've behaving as expected.
1251+
12341252
### Front loaded decay with interval
12351253
In an effort
12361254
to anticipate API server stability ahead of the experiential data we can collect
14.4 KB
Loading

0 commit comments

Comments
 (0)