-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Wait on shard failures #14252
Labels
:Distributed Indexing/Distributed
A catch all label for anything in the Distributed Area. Please avoid if you can.
>enhancement
Meta
release highlight
resiliency
v5.0.0-alpha1
Comments
+1 |
sounds good to me too |
This was referenced Jan 3, 2016
jasontedor
added a commit
that referenced
this issue
Jan 17, 2016
Wait for new master when failing shard Relates #14252
will this one resovled issue:7572? |
Merged
clintongormley
added
:Distributed Indexing/Distributed
A catch all label for anything in the Distributed Area. Please avoid if you can.
and removed
:Cluster
labels
Feb 13, 2018
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
:Distributed Indexing/Distributed
A catch all label for anything in the Distributed Area. Please avoid if you can.
>enhancement
Meta
release highlight
resiliency
v5.0.0-alpha1
Currently when executing an action (e.g., bulk, delete, or indexing operations) on all shards, if an exception occurs while executing the action on a replica shard we send a shard failure message to the master. However, we do not wait for the master to acknowledge this message and do not handle failures in sending this message to the master. This is problematic because it means that we will acknowledge the action and this can result in losing writes. For example, in a situation where a primary is isolated from the master and its replicas, the following sequence of events can occur:
In this case, the replica will not have the write that was acknowledged to the client and this amounts to data loss.
Instead, if we waited on the master to acknowledge the shard failures we would never have acknowledged the write to the client in this case.
The text was updated successfully, but these errors were encountered: