-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Raft node failing to recover from non-graceful shutdown #807
Comments
Thanks @jimthematrix for the reference. We'll look into pulling it from upstream. |
Hi @jimthematrix, the raft node non-graceful shutdown issues is a separate one and this fix will not solve the same. We are looking at a different solution for raft node failing to recover from non-graceful shutdown. I am raising a separate issue for this. Since the upstream issue was fixed for Clique consensus, I have tried simulate the issue with Clique and so far have not been able to simulate it. I wanted to check if you had tested with Clique consensus as well and if this issue was observed? |
Hi @vsmk98 |
Hi @jimmy-dg if you are using Quorum version 2.2.5 this issue should not happen. If you are using earlier versions then please bring up Geth with |
We'd like to advocate for this fix to be ported to Quorum:
ethereum/go-ethereum#19862
Reason: if a Quorum node experiences a non-graceful shutdown (equivalent of
kill -9
), the persisted chain gets corrupted because the head has not been properly flushed from memory (which is performed during graceful shutdown procedure).Geth/v1.8.18-stable-ef256cb2(quorum-v2.2.3)/linux-amd64/go1.10.1
when the node comes back, it saw the corrupt head and reverted to the last validated block:
then later it failed to rebuild the chain from the past block (I tried the same non-graceful shutdown with an IBFT chain, also got it to have to revert to an earlier block but it was able to reconcile and catch up to the head):
The text was updated successfully, but these errors were encountered: