-
Notifications
You must be signed in to change notification settings - Fork 25.5k
Remove FailedToCommitClusterStateException Check #135846
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Remove FailedToCommitClusterStateException Check #135846
Conversation
Removes a check on `FailedToCommitClusterStateException` with a message "node closed" inside QueryableBuiltInRolesSynchronizer .isExpectedFailure since this exception was changed to a NotMasterException inside elastic#135008 Relates to: ES-13061
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM once the other changes have landed.
@DaveCTurner The pre-requisite PR for this was #135008 which is merged. It's not blocked by any other changes |
Ah right I thought this was blocked on #135706 too but I guess not. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Pinging @elastic/es-distributed-coordination (Team:Distributed Coordination) |
This is the fourth part of a series of PRs fixing how the
FailedToCommitClusterStateException
is used in ElasticSearch. As per #135017,FailedToCommitClusterStateException
is defined as:Currently,
FailedToCommitClusterStateException
is used as a 'catch-all' exception thrown at multiple places throughout theCoordinator
andMasterService
during the publication process. Semantically however, it doesn't make sense to throw this exception before the cluster state update is actually sent over the wire, since at this point, we know for certain that the cluster state update failed.FailedToCommitClusterStateException
is intended to display ambiguity.This work is a pre-requisite to #134213.
Changes
Removes a check on
FailedToCommitClusterStateException
with a message "node closed" insideQueryableBuiltInRolesSynchronizer.isExpectedFailure
sinceFailedToCommitClusterStateException
s with this error message were replaced withNotMasterExceptions
inside #135008, andNotMasterExceptions
is already included inside this OR expressionAs a note, this is not blocking and is not blocked by, any other change
Next Steps
The goal of this work is to fix up all erroneously used
FailedToCommitClusterStateException
.Done:
FailedToCommitClusterStateException
thrown insideMasterService.Batch.onResponse()
when draining the queue after the threadpool has shut down - Change FailedToCommitClusterStateException to NotMasterException #135008FailedToCommitClusterStateException
toNotMasterException
during the pre-publication process: Changes FailedToCommitClusterStateException to NotMasterException #135548FailedToCommitClusterStateExceptions
prior to the publication of the cluster state update over the wire withFailedToPublishClusterStateException
: Replace pre publication failed to commit cluster state exceptions #135706Todo:
FailedToCommitClusterStateException
exception insideMasterService.BatchingTaskQueue.submitTask
, (here) with aNotMasterException
.FailedToCommitClusterStateException
that need to be updatedRelates to: ES-13061