`failover_promote` returns an error via `force_inconsistency: true` #1399

dokshina · 2021-04-29T12:49:23Z

Bug description

failover_promote returns an error via force_inconsistency: true. Reproduced in both stateboard cases (tarantool and etcd).
Vclockkeeper value in stateboard storage was changed between get_vclockkeeper and set_vclockkeeper calls in the force_inconsistency function.
That check should be on the server (stateboard or etcd), not the client.

cartridge/cartridge/stateboard-client.lua

Lines 113 to 118 in 5142624

    
           if keeper.instance_uuid == instance_uuid 
        
           and vclock == nil 
        
           then 
        
               -- No update needed 
        
               return true 
        
           end

Needs extra investigation to fix etcd error.

Steps to reproduce

1. The cluster with simple topology is deployed:

    core_1_replicaset:
      hosts:
        core-1:
      vars:
        replicaset_alias: core-1
        roles:
          - vshard-router
          - failover-coordinator

    storage_1_replicaset:
      hosts:
        storage-1-leader:
        storage-1-replica:
        storage-1-replica-2:
      vars:
        replicaset_alias: storage-1
        failover_priority:
          - storage-1-leader
          - storage-1-replica-2
          - storage-1-replica
        roles:
          - vshard-storage

    storage_2_replicaset:
      hosts:
        storage-2-leader:
        storage-2-replica:
      vars:
        replicaset_alias: storage-2
        failover_priority:
          - storage-2-leader
          - storage-2-replica
        roles:
          - vshard-storage

The failover is set to

    cartridge_failover_params:
      mode: stateful
      state_provider: stateboard
      stateboard_params:
        uri: vm1:4001
        password: secret-stateboard

Leaders are promoted to storage-1-replica and storage-2-replica with force_inconsistency: false.
Leaders are promoted to core-1, storage-1, storage-2 with force_inconsistency: true.

On step 3 the error is returned: Failed to promote leaders: Promotion succeeded, but inconsistency wasn't forced: Ordinal comparison failed (requested 5, current 7).

use test from Add stateful test for promote from ticket 1399 #1710
add fiber.sleep before

cartridge/cartridge/stateboard-client.lua

Line 122 in 869a2bd

'set_vclockkeeper', {
Profit

Actual behavior

Promotion succeeded, but inconsistency wasn't forced: Compare failed (101): [223 != 253]

cartridge/cartridge/etcd2-client.lua

Line 219 in 869a2bd

local resp, err = session.connection:request('PUT',

Promotion succeeded, but inconsistency wasn't forced: Ordinal comparison failed (requested 5, current 7)

cartridge/cartridge/stateboard-client.lua

Line 122 in 869a2bd

'set_vclockkeeper', {

Expected behavior

No error returned.

The text was updated successfully, but these errors were encountered:

rosik · 2021-04-29T12:54:28Z

Related to #1398

filonenko-mikhail · 2022-01-21T15:12:34Z

Already covered by tests in other pr #1682

opomuc · 2022-02-11T07:22:44Z

The issue is still reproduced. @filonenko-mikhail DM me for details, please

filonenko-mikhail · 2022-02-11T09:08:42Z

Please provide reproducer:

cartridge version
env
step to reproduce

yngvar-antonsson · 2022-02-17T10:07:24Z

Promotion succeeded, but inconsistency wasn't forced: Compare failed (101): [223 != 253]
It seems that the value by last index (prevIndex) was changed between get_vclockkeeper and set_vclockkeeper in

cartridge/cartridge/etcd2-client.lua

Line 219 in 869a2bd

local resp, err = session.connection:request('PUT',

the same in

cartridge/cartridge/stateboard-client.lua

Line 122 in 869a2bd

'set_vclockkeeper', {

I'll try to write a repro test

yngvar-antonsson · 2022-02-22T18:14:17Z

Great thanks to @rosik for help to investigate the problem!

rosik added the bug Something isn't working label Apr 29, 2021

rosik added teamS Scaling cartridge labels Jul 16, 2021

kyukhin added this to the wishlist milestone Aug 19, 2021

filonenko-mikhail removed this from the wishlist milestone Jan 12, 2022

filonenko-mikhail added teamX and removed teamS Scaling labels Jan 12, 2022

filonenko-mikhail mentioned this issue Jan 18, 2022

Add stateful test for promote from ticket 1399 #1710

Closed

1 task

filonenko-mikhail closed this as completed Jan 21, 2022

opomuc reopened this Feb 11, 2022

filonenko-mikhail added the 8sp label Feb 21, 2022

yngvar-antonsson added help wanted and removed help wanted labels Feb 22, 2022

filonenko-mikhail assigned yngvar-antonsson Mar 1, 2022

yngvar-antonsson mentioned this issue Mar 3, 2022

Fix failover_promote when vclockkeeper ordinal changed #1772

Merged

3 tasks

filonenko-mikhail closed this as completed in #1772 Mar 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`failover_promote` returns an error via `force_inconsistency: true` #1399

`failover_promote` returns an error via `force_inconsistency: true` #1399

dokshina commented Apr 29, 2021 •

edited by yngvar-antonsson

Loading

rosik commented Apr 29, 2021

filonenko-mikhail commented Jan 21, 2022 •

edited by yngvar-antonsson

Loading

opomuc commented Feb 11, 2022

filonenko-mikhail commented Feb 11, 2022

yngvar-antonsson commented Feb 17, 2022 •

edited

Loading

yngvar-antonsson commented Feb 22, 2022

failover_promote returns an error via force_inconsistency: true #1399

failover_promote returns an error via force_inconsistency: true #1399

Comments

dokshina commented Apr 29, 2021 • edited by yngvar-antonsson Loading

Bug description

Steps to reproduce

Actual behavior

Expected behavior

rosik commented Apr 29, 2021

filonenko-mikhail commented Jan 21, 2022 • edited by yngvar-antonsson Loading

opomuc commented Feb 11, 2022

filonenko-mikhail commented Feb 11, 2022

yngvar-antonsson commented Feb 17, 2022 • edited Loading

yngvar-antonsson commented Feb 22, 2022

`failover_promote` returns an error via `force_inconsistency: true` #1399

`failover_promote` returns an error via `force_inconsistency: true` #1399

dokshina commented Apr 29, 2021 •

edited by yngvar-antonsson

Loading

filonenko-mikhail commented Jan 21, 2022 •

edited by yngvar-antonsson

Loading

yngvar-antonsson commented Feb 17, 2022 •

edited

Loading