Verify AllocationIDs in replication actions #20320

bleskes · 2016-09-04T18:58:05Z

Replicated operation consist of a routing action (the original), which is in charge of sending the operation to the primary shard, a primary action which executes the operation on the resolved primary and replica actions which performs the operation on a specific replica. This commit adds the targeted shard's allocation id to the primary and replica actions and makes sure that those match the shard the actions end up executing on.

This helps preventing extremely rare failure mode where a shard moves off a node and back to it, all between an action is sent and the time it's processed.

For example:

Primary action is sent to a relocating primary on node A.
The primary finishes relocation to node B and start relocating back.
The relocation back gets to the phase and opens up the target engine, on the original node, node A.
The primary action is executed on the target engine before the relocation finishes, at which the shard copy on node B is still the official primary - i.e., it is executed on the wrong primary.

s1monw · 2016-09-05T15:27:59Z

core/src/main/java/org/elasticsearch/action/support/replication/TransportReplicationAction.java

+
+        RequestWithAllocationID(Supplier<R> requestSupplier) {
+            request = requestSupplier.get();
+            allocationId = null;


what is the invariant that this can be null?

oh that is for deserialization :( bummer can you document it?

yeah :(. maybe we should add a variant of registerRequestHandler which takes a function of a stream and returns a request.

s1monw · 2016-09-05T15:33:51Z

left some suggestions LGTM in general

jasontedor · 2016-09-05T15:44:44Z

.../test/java/org/elasticsearch/action/support/replication/TransportReplicationActionTests.java

+            ActionListener<Releasable> callback = (ActionListener<Releasable>) invocation.getArguments()[1];
+            final long primaryTerm = indexShard.getPrimaryTerm();
+            if (term < primaryTerm) {
+                throw new IllegalArgumentException(LoggerMessageFormat.format("{} operation term [{}] is too old (current [{}])",


Can we not add another use of LoggerMessageFormat; I'd like to remove it?

I copied from another place - do you have a decent suggestion for an alternative (the obvious one is chaining strings)

String.format(Locale.ROOT, "%s operation term [%d] is too old (current [%d])", shardId, term, primaryTerm)

jasontedor · 2016-09-05T16:08:10Z

.../test/java/org/elasticsearch/action/support/replication/TransportReplicationActionTests.java

+        }
+    }
+
+    /** test that a replica request is reject if it arrives at a shard with a wrong allocation id */


Nit: reject -> rejected

jasontedor · 2016-09-05T16:11:24Z

I left a few comments, but it looks good.

jasontedor · 2016-09-05T18:47:11Z

core/src/main/java/org/elasticsearch/action/support/replication/TransportReplicationAction.java

+            allocationId = null;
+        }
+
+        RequestWithAllocationID(R request, String allocationId) {


I guess this constructor parameter should be targetAllocationID too to be consistent with my other suggestions.

bleskes · 2016-09-05T18:52:38Z

thx @s1monw , @jasontedor . I pushed a commit addressing all comments

s1monw · 2016-09-05T19:15:52Z

I don't get why we call these properties targetAllocationID? what's wrong with allocationId?

bleskes · 2016-09-05T19:17:09Z

I don't get why we call these properties targetAllocationID? what's wrong with allocationId?

Because @jasontedor asked.

s1monw · 2016-09-05T19:18:52Z

Because @jasontedor asked.

can we keep it simple?

s1monw · 2016-09-06T08:27:37Z

@bleskes I don't wanna be in the way for such an improvement just because of some internal debatable naming. lets get it in!

bleskes · 2016-09-06T12:33:02Z

Thx @s1monw and @jasontedor for the review.

Replicated operation consist of a routing action (the original), which is in charge of sending the operation to the primary shard, a primary action which executes the operation on the resolved primary and replica actions which performs the operation on a specific replica. This commit adds the targeted shard's allocation id to the primary and replica actions and makes sure that those match the shard the actions end up executing on. This helps preventing extremely rare failure mode where a shard moves off a node and back to it, all between an action is sent and the time it's processed. For example: 1) Primary action is sent to a relocating primary on node A. 2) The primary finishes relocation to node B and start relocating back. 3) The relocation back gets to the phase and opens up the target engine, on the original node, node A. 4) The primary action is executed on the target engine before the relocation finishes, at which the shard copy on node B is still the official primary - i.e., it is executed on the wrong primary.

bleskes added 11 commits September 2, 2016 15:29

add aid to replication requests

132745d

finx IndicesRequestIT

ca27290

Merge remote-tracking branch 'upstream/master' into replication_with_aid

5a6fe29

TransportReplicationActionTests with mocks

ebfebab

Merge remote-tracking branch 'upstream/master' into replication_with_aid

9144f54

add unit tests

f66da90

better error message

767c01c

better error message when shard is not assigned locally

a8fc4be

fix testReplicaActionRejectsWrongAid

e722c95

meh

712a87c

Merge remote-tracking branch 'upstream/master' into replication_with_aid

c8a090d

bleskes added >enhancement resiliency :Core/Infra/Core Core issues without another label v5.0.0-beta1 labels Sep 4, 2016

fix retry on replica

22e4f84

s1monw reviewed Sep 5, 2016
View reviewed changes

jasontedor reviewed Sep 5, 2016
View reviewed changes

feedback

4a63c09

one more rename

83ac99b

bleskes added 4 commits September 6, 2016 14:08

Merge remote-tracking branch 'upstream/master' into replication_with_aid

67151e0

line length

c178ad3

be concrete

c199657

locale

edfdff2

bleskes merged commit c56cd46 into elastic:master Sep 6, 2016

bleskes deleted the replication_with_aid branch September 6, 2016 12:32

clintongormley added :Allocation and removed :Core/Infra/Core Core issues without another label labels Sep 7, 2016

lcawl added :Distributed Indexing/Distributed A catch all label for anything in the Distributed Area. Please avoid if you can. and removed :Allocation labels Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify AllocationIDs in replication actions #20320

Verify AllocationIDs in replication actions #20320

bleskes commented Sep 4, 2016

s1monw Sep 5, 2016

s1monw Sep 5, 2016

bleskes Sep 5, 2016

s1monw commented Sep 5, 2016

jasontedor Sep 5, 2016 •

edited

Loading

bleskes Sep 5, 2016

jasontedor Sep 5, 2016 •

edited

Loading

jasontedor Sep 5, 2016

jasontedor commented Sep 5, 2016

jasontedor Sep 5, 2016

bleskes commented Sep 5, 2016

s1monw commented Sep 5, 2016

bleskes commented Sep 5, 2016

s1monw commented Sep 5, 2016

s1monw commented Sep 6, 2016

bleskes commented Sep 6, 2016

Verify AllocationIDs in replication actions #20320

Verify AllocationIDs in replication actions #20320

Conversation

bleskes commented Sep 4, 2016

s1monw Sep 5, 2016

Choose a reason for hiding this comment

s1monw Sep 5, 2016

Choose a reason for hiding this comment

bleskes Sep 5, 2016

Choose a reason for hiding this comment

s1monw commented Sep 5, 2016

jasontedor Sep 5, 2016 • edited Loading

Choose a reason for hiding this comment

bleskes Sep 5, 2016

Choose a reason for hiding this comment

jasontedor Sep 5, 2016 • edited Loading

Choose a reason for hiding this comment

jasontedor Sep 5, 2016

Choose a reason for hiding this comment

jasontedor commented Sep 5, 2016

jasontedor Sep 5, 2016

Choose a reason for hiding this comment

bleskes commented Sep 5, 2016

s1monw commented Sep 5, 2016

bleskes commented Sep 5, 2016

s1monw commented Sep 5, 2016

s1monw commented Sep 6, 2016

bleskes commented Sep 6, 2016

jasontedor Sep 5, 2016 •

edited

Loading

jasontedor Sep 5, 2016 •

edited

Loading