Fix for cursor reset after topic reload #315

merlimat · 2017-03-24T06:29:05Z

Motivation

As reported in #309, the cursor reset operation will fail after a topic reload when no new mark-delete operation was issued, since the cursor ledger was set to a read-only ledger.

Modifications

Changed the reset logic to rely on internalAsyncMarkDelete() which forces to create a new ledger when the cursor is in NoLedger state.

Fix #309

merlimat · 2017-03-24T17:06:39Z

cc @sschepens

rdhabalia · 2017-03-24T21:39:52Z

managed-ledger/src/main/java/org/apache/bookkeeper/mledger/impl/ManagedCursorImpl.java

@@ -240,7 +240,6 @@ protected void recoverFromLedger(final ManagedCursorInfo info, final VoidCallbac
            }

            // Read the last entry in the ledger
-            cursorLedger = lh;


so, if we don't initialize cursorLedger at the time recovery then it will be null initially and at that time internal-stat will not return correct value of cursorLedgerLastEntry. So, is there any specific reason to not initialize it at recovery time?

very good point. I removed it because it is confusing to use the same variable to store 2 different kinds of ledger handler (one read-write and the other read-only)

Anyway I think it still make sense for the internal stats. In this case we would see:

cursorLedger = -1 cursorLedgerLastEntry = -1

and that is appropriate since we are in NoLedger state.

This below is the code that returns the -1:

public long getCursorLedger() { LedgerHandle lh = cursorLedger; return lh != null ? lh.getId() : -1; } public long getCursorLedgerLastEntry() { LedgerHandle lh = cursorLedger; return lh != null ? lh.getLastAddConfirmed() : -1; }

rdhabalia · 2017-03-24T21:53:24Z

managed-ledger/src/main/java/org/apache/bookkeeper/mledger/impl/ManagedCursorImpl.java

-            } else {
-                internalFlushPendingMarkDeletes();
-            }
+            internalFlushPendingMarkDeletes();


internalFlushPendingMarkDeletes just clears previous pendingMarkDeleteOps and not completes the callback of PendingMarkDeleteEntry and there could be possibility if someone is waiting for the callback. One example PersistentReplicator waiting to just do debug-log and which is fine but just concern is that as it has callback so, one can wait on it..

Actually, that is covered 😉

Take a look at internalMarkDelete(), there's this code there:

// Trigger the final callback after having (eventually) triggered the switchin-ledger operation. This // will ensure that no race condition will happen between the next mark-delete and the switching // operation. if (mdEntry.callbackGroup != null) { // Trigger the callback for every request in the group for (PendingMarkDeleteEntry e : mdEntry.callbackGroup) { e.callback.markDeleteComplete(e.ctx); } } else { // Only trigger the callback for the current request mdEntry.callback.markDeleteComplete(mdEntry.ctx); }

So, 1 single write and triggers all the callbacks

rdhabalia

👍

Fixes apache#315 The original `listeners` config's semantics is wrong that it mixed the `listeners` and `advertised.listeners` semantics of Kafka. So this PR adds a `kafkaAdvertisedListeners` as the listeners exposed to client, and only use `listeners` as the bind address. To avoid conflict with other protocol handlers, mark `listeners` as deprecated and use `kafkaListeners` instead. For convenience, this PR adds an `EndPoint` class to do the listener url parse work, which can be applied to both `kafkaListeners` and `kafkaAdvertisedListeners`. It also handles `SASL_XXX` protocols which were not handled before. And the related tests are added. The existed `KafkaApisTest#testBrokerHandleTopicMetadataRequest` could verify the `kafkaAdvertisedListeners` because the tests' `kafkaAdvertisedListeners` is `127.0.0.1:<port>` while `kafkaListeners` is `localhost:<port>`.

Fix for cursor reset after topic reload

27622e8

merlimat added the type/bug The PR fixed a bug or issue reported a bug label Mar 24, 2017

merlimat added this to the 1.17 milestone Mar 24, 2017

merlimat self-assigned this Mar 24, 2017

merlimat requested review from rdhabalia and saandrews March 24, 2017 06:29

saandrews approved these changes Mar 24, 2017

View reviewed changes

rdhabalia reviewed Mar 24, 2017

View reviewed changes

merlimat merged commit 80986d7 into apache:master Mar 24, 2017

xiaotongwang1 mentioned this pull request Aug 4, 2021

Pulsar 2.7.0+ KOP 2.7.2.x getPartitionedTopicMetadata timeout #11532

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for cursor reset after topic reload #315

Fix for cursor reset after topic reload #315

merlimat commented Mar 24, 2017

merlimat commented Mar 24, 2017

rdhabalia Mar 24, 2017

merlimat Mar 24, 2017

rdhabalia Mar 24, 2017

merlimat Mar 24, 2017

rdhabalia Mar 24, 2017

rdhabalia left a comment

Fix for cursor reset after topic reload #315

Fix for cursor reset after topic reload #315

Conversation

merlimat commented Mar 24, 2017

Motivation

Modifications

merlimat commented Mar 24, 2017

rdhabalia Mar 24, 2017

Choose a reason for hiding this comment

merlimat Mar 24, 2017

Choose a reason for hiding this comment

rdhabalia Mar 24, 2017

Choose a reason for hiding this comment

merlimat Mar 24, 2017

Choose a reason for hiding this comment

rdhabalia Mar 24, 2017

Choose a reason for hiding this comment

rdhabalia left a comment

Choose a reason for hiding this comment