-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Channel ERROR: "failing link: unable to resolve fwd pkgs: bucket not found with error: internal error" #6593
Comments
The rebalancing scripts were working fine. I guess the main issue was trying to run too many instances in the same time, this killed my system (out of memory). |
I run LND 14.2. and yesterday, I opened a channel with Bcash who I'm told runs CLN. This got force closed remotely this morning. Looked at the logs. This stands out : ChannelLink(2a4829c56e036b97422be0c61d5a6d926a47317 86650a4b00ba55f1cc71a98b7:2): failing link: unable to update commitment: cannot add duplicate ChannelLink(2a4829c56e036b97422be0c61d5a6d926a4731786650a4b00ba55f1cc71a98b7:2): failing link: unable to complete dance with error: remote unresponsive [ERR] HSWC: ChannelLink(2a4829c56e036b97422be0c61d5a6d926a4731786650a4b00ba55f1cc71a98b7:2): failing link: unable to synchronize channel states: first message sent to sync should be ChannelReestablish, instead received: *lnwire.Error with error: unable to resume channel, recovery required [ERR] HSWC: ChannelLink(2a4829c56e036b97422be0c61d5a6d926a4731786650a4b00ba55f1cc71a98b7:2): failing link: unable to update commitment: cannot add duplicate keystone with error: internal error |
@indomitorum Your log looks like it's related to @C-Otto's issue, which should be resolved #6485 |
Is this the same issue as #6485? If so, it'll be resolved in 0.15. |
The reason for the error message seems to be something different in this case: |
This is not the same issue |
@ZoltanAB do you have more logs for this channel for several minutes before and after the above error? When did the node OOM? Relevant log categories would be HSWC, PEER, LNWL, CHDB. |
Does the log file contain any sensitive information? If not, I could send you the log file around that date and hour. Please advise. Thank you. |
Just an info from me regarding the severity of this issue: I have had 12 force closes due to this issue in the last seven days. That's a little more than 1% of my channels. |
It contains privacy-leaking information (channel points, etc) - which I don't need if you want to redact them out. I am eugene on the lnd slack |
Thank you for looking into this @Crypt-iQ. |
@Crypt-iQ can I use your email address (el.....l@gmail.com) to send you the generated log files? |
And FYI, today I had another similar FC around 02:10 AM GMT. Here is a graph of my load on the server for the last 24 hours: |
yup |
Did you get the same |
Yes. Sending you now. |
Just sent you the logs. Thank you. |
Thanks for the logs, I know why this happens. I'll start working on a fix |
Glad I could contribute a little.
…On Wed, Jun 8, 2022, 19:53 Eugene ***@***.***> wrote:
Thanks for the logs, I know why this happens. I'll start working on a fix
—
Reply to this email directly, view it on GitHub
<#6593 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ASBNXJV55776I4YQHTEOBO3VODFY7ANCNFSM5XJWSZIQ>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
@Crypt-iQ any update on this? Just had another FC due to this issue. Thank you, If needed, can send you more logs. |
This won't get into 0.15 since that is right around the corner and I want the fix to receive review w/o being subject to a deadline. I could provide a patch this week that you could apply to your node if you are comfortable, but you'd need to revert it first when upgrading to any other version |
preliminary fix is here #6642 - hopefully it survives review - it did fix my local repro case. I would recommend not patching this on your node until it receives adequate review or 0.15.1 is released |
Background
I run a CLN node and have experienced quite a couple of instances where my node force-closed a channel, due to the LND peer sending an
internal error
message.I finally had this error with a peer that was able to provide the relevant logs (@ZoltanAB)
LND environment
0.14.2-beta
Linux ipayblue-1 5.10.0-13-amd64 #1 SMP Debian 5.10.106-1 (2022-03-17) x86_64 GNU/Linux
rebalance-lnd
script(if that's relevant)
Steps to reproduce
Have a channel between LND / CLN that forwards HTLCs.
Expected behaviour
LND should not send an error.
Actual behaviour
LND sends an error.
Logs
LND Logs (peer A)
CLN logs (peer B)
Additional info
The LND node was heavily rebalancing and thus running into memory issues about 7 minutes before the event (no log entries up to
2022-05-29 21:40:02.124
).As you can tell from the graph, they stopped their rebalancing script a couple of hours after the crash.
The text was updated successfully, but these errors were encountered: