-
Notifications
You must be signed in to change notification settings - Fork 902
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No such ledger exists on Bookies but ledgermetadata exist #4287
Comments
@hamadodene It would be helpful to share Pulsar version & Bookkeeper version & possible customized Ensemble size (E), write quorum (Qw) and ack quorum (Qa) size. |
@lhotari Yes, we have pulsar 3.0.3, bk 4.16.4 |
@hamadodene noticed this in the output that you shared:
in Pulsar, the default is When you run Bookkeeper, do you use This might not be relevant in this context, but just just wondering if large ZNodes with low jute.maxbuffer value could result in inconsistencies. |
When running Bookkeeper with Pulsar's |
@hamadodene do you use offloading? I found issue apache/pulsar#21737 which could be related in that case. |
also apache/pulsar#15464 |
@lhotari We recently forced the This caused problems because during the update, the ledger Pulsar ZNodes were written with hierarchical layout, while other nodes were written with flat layout. Perhaps this caused the inconsistencies. However, Bookkeeper seemed to write without errors (at least it wrote the ZNodes); perhaps the missing ledgers in the logs are those written before we fixed the layout? The update was made from Pulsar 2.9.5 to 3.0.3 and Bookkeeper 4.14.4 to 4.16.4. |
I suggest to use that and let the clients automatically discover. |
@eolivelli our system is pretty old. In the znode After the pulsar upgrade we noticed that the ledgers for pulsar topics were created with the But the strange thing @hamadodene is reporting, is that pulsar was (apparently) able to publish messages on the topics, but could not read the messages because bk was throwing Is it possible that bk was creating the znode for the ledger (with |
BUG REPORT
Describe the bug
We have noticed a strange behavior in our Bookkeeper cluster in production. In summary, we are currently unable to access the data of some ledgers that should have been created by Bookkeeper and therefore should exist. When we try to find the ledger using the Bookkeeper CLI:
However, when we try to read the ledger using the CLI:
./bookkeeper readledger -ledgerid 15543
Note:
We also checked in the entry log files, and it really seems that ledger does not exist.
Furthermore, when that ledger was created by Apache Pulsar, Pulsar did not give any errors during writing. But when trying to read the ledger, Bookkeeper responded with "No such ledger exists on Bookies."
Do you have any information on what the problem might be or how we can debug this issue?
To Reproduce
We were unable to reproduce the issue.
Expected behavior
Given that the metadata exists, I expect the ledger to actually exist on Bookkeeper as well. We have not performed any ledger deletions on Bookkeeper.
Pulsar version: 3.0.3
Bookeeper version: 4.16.4
The text was updated successfully, but these errors were encountered: