-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[2pt] Revise healthcheck predicates #1139
Comments
After a brief discussion with @olegrok and @mtrempoltsev we agreed that RPC should consider 'suspects' as healthy. |
And I think that OperationError shouldn't be considered unhealthy even if it sounds silly. RPC DoS and additional config applications are even worse taking into account that OperationError is the result of |
Dmitry Sharonov, [15.06.21 14:00] Dmitry Sharonov, [15.06.21 14:00] Yaroslav Dynnikov, [15.06.21 14:10] Да, стейтфул не триггерится на OperationError. |
Nowadays there are several different health check predicates in Cartridge.
Eventual failover
cartridge/cartridge/failover.lua
Lines 150 to 161 in e5ca132
Stateful failover
cartridge/cartridge/roles/coordinator.lua
Lines 34 to 47 in e5ca132
RPC
cartridge/cartridge/rpc.lua
Lines 41 to 52 in e5ca132
I see two potential problems here:
get_active_leaders
may return a suspect leader and the RPC call would return an error "No remotes with role %q available" preliminary.The text was updated successfully, but these errors were encountered: