You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix Test SharedClusterSnapshotRestoreIT.testDataFileFailureDuringRestore (#80515) (#124282)
This is a test/assertion only issue. We were removing the tracking of a
shard restore after invoking the listener for the restore. The whole
mechanics around `onGoingRestores` though is used to wait for the
blobstore to go idle during node shutdown.
The problem with removing the tracking for the shard after resolving the
listener is that if the restore is retried very quickly due to some
reroute or so, then we have a race where it's retried before the
failed restore is removed from `onGoingRestores`.
=> fixed by just removing the tracking before resolving the listener
which is more correct anyway since we are done with the blobstore
at this point.
closes#80477
(cherry picked from commit ea93bdb)
Co-authored-by: Armin Braun <[email protected]>
0 commit comments