-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Parachain and Rococo relaychain randomly crash at 0.9.22 and 0.9.23 #5639
Comments
Seems like a panic in libp2p? Could be related to paritytech/substrate#11009? cc @kpp @tomaka |
If it relates to libp2p, in my local net, I ran 2 parachain nodes in different VMs, |
Seems like this is libp2p/rust-libp2p#2598. cc @mxinden |
Upstream bug report: libp2p/rust-libp2p#2599 |
Same issue when running the Substrate tutorial: https://docs.substrate.io/tutorials/v3/cumulus/start-relay/ Both relay chain validator and parachain node panic. |
I have the same issue after updating parachain to 0.9.26. @jasl have you managed to fix this and run parachain on higher versions, and is this only happened on Rococo connected parachains? |
This is a libp2p bug and not fix yet, so latest code affacts too, and not only affact Rococo relaychain, Polkadot & Kusama too, so IMO this issue should be a medium or high important |
According to nimiq/core-rs-albatross#732 (comment):
Do you observe similar things with the given conditions? |
For us it might not relate to memory, we have 16GB mem and 16GB swap the problem occured in this server more than 5 times, but latest 1 week is good I got a bunch of reports from our users, I think I may asking them |
Ah I forgot I've downgrade to polkadot 0.9.18 so it's good. |
Phala-Network/khala-parachain#150 (comment) here's an user replied
|
Hi everyone, can someone provide logs in the time range 10 seconds before the panic with debug level ( |
Cross-referencing libp2p/rust-yamux#137 here. Help testing would be very much appreciated. |
I can't stable reproduce the problem in short time, but I'll try to build a testnet to run few days |
How can we prove the issue is fixed? just run and watch no more crash? |
In case you no longer see any |
Hey guys, do you consider this bug fixed then? We are experiencing the same error message in
|
👋 It is not yet fixed. I prepared a dirty patch ("next-frame-error" branch) to validate my suspicion. According to the above testing, my suspicion is correct. Thus I created libp2p/rust-yamux#138 which is a patch I would consider ready for merge (i.e. not dirty). In case you want to help this process, you could run libp2p/rust-yamux#138 on one of your test networks via the below in your [patch.crates-io]
yamux = { git = 'https://github.com/mxinden/yamux', branch = "next-result-immediately } Once I release libp2p/rust-yamux#138, you only need to run |
Heads up, Please upgrade to the new version. Thanks for the help everyone! |
@ordian could you help to release a new polkadot v0.9.27 binary? |
As commented here, no need to create a new polkadot release: libp2p/rust-libp2p#2598 (comment) |
thanks @mxinden , I can confirm that applying the patch yesterday solved our problem, now we just updated yamux to |
polkadot-v0.9.23
branch) and Polkadotrococo-local
(as relaychain forpolkadot-launch
)khala-node
(connect to live net)polkadot-v0.9.22
polkadot-v0.9.20
and below has this issuepolkadot-v0.9.20
branch, restart works, the user not record any log)the crash log
The text was updated successfully, but these errors were encountered: