Fence writer zombies (breaking change) #255

fqtab · 2024-05-23T16:39:04Z

Implements writer zombie fencing by using consumer-side zombie fencing (rather than producer-side zombie fencing)
Removes unnecessary control-group-id
- Note: the Coordinator consumer group remains
Fixes documentation about consumer groups

Out of scope for this PR: Coordinator zombie fencing

fqtab · 2024-05-23T16:44:05Z

kafka-connect/src/main/java/io/tabular/iceberg/connect/channel/CommitterImpl.java

-    send(events, offsets, new ConsumerGroupMetadata(config.controlGroupId()));
-    send(ImmutableList.of(), offsets, new ConsumerGroupMetadata(config.connectGroupId()));
+    send(events, offsets, consumerGroupMetadata);


Notice how we commit offsets against only one consumer group now; the connect-<connector-name> consumer group.

We no longer commit source topic offsets to the config.controlGroupId and TBH I don't understand why we ever did since we could have always taken this approach (irrespective of zombie fencing) cc @bryanck if you can shed any light here as to why this was necessary in the past or if it was just an oversight.

fqtab · 2024-05-23T20:26:57Z

kafka-connect/src/main/java/io/tabular/iceberg/connect/channel/CommitterImpl.java

+    try {
+      groupMetadata = KafkaUtils.consumerGroupMetadata(context);
+    } catch (IllegalArgumentException e) {
+      LOG.warn("Could not extract ConsumerGroupMetadata from consumer inside Kafka Connect, falling back to simple ConsumerGroupMetadata which can result in duplicates from zombie tasks");
+      groupMetadata = new ConsumerGroupMetadata(config.connectGroupId());
+    }


We fetch the consumer-group-metadata via reflection from inside the Kafka Connect framework. This is technically unsafe as we are relying on private, implementation details. Hence I also implemented falling back to simple ConsumerGroupMetadata (which is basically what we were doing previously) and does not do zombie fencing.

Why not just fail?

tabmatfournier · 2024-05-23T21:08:37Z

kafka-connect/src/main/java/io/tabular/iceberg/connect/channel/KafkaUtils.java

+  private static final String WorkerSinkTaskContextClassName =
+          WorkerSinkTaskContext.class.getName();
+
+  @SuppressWarnings("unchecked")


worth a comment around using reflection to get at some very specific implementation detail stuff here but otherwise 👍

tabmatfournier · 2024-05-23T21:08:58Z

README.md

@@ -170,6 +172,105 @@ from the classpath are loaded. Next, if `iceberg.hadoop-conf-dir` is specified,
 are loaded from that location. Finally, any `iceberg.hadoop.*` properties from the sink config are
 applied. When merging these, the order of precedence is sink config > config dir > classpath.

+# Upgrade


Nice docs. Thanks for looking out for the users.

fqtab changed the title ~~Fence writer zombies~~ Fence writer zombies (breaking change) May 23, 2024

fqtab force-pushed the fence_writer_zombies branch 2 times, most recently from 36f5b85 to 0e62a0d Compare May 23, 2024 20:14

fqtab commented May 23, 2024

View reviewed changes

fqtab marked this pull request as ready for review May 23, 2024 20:28

tabmatfournier reviewed May 23, 2024

View reviewed changes

tabmatfournier approved these changes May 23, 2024

View reviewed changes

fqtab marked this pull request as draft May 31, 2024 18:56

fqaiser94 added 2 commits June 4, 2024 11:24

Fence writer zombies

80e541a

Should not need to rewind offsets anymore

682a5ad

fqtab force-pushed the fence_writer_zombies branch from 0e62a0d to 682a5ad Compare June 4, 2024 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fence writer zombies (breaking change) #255

Fence writer zombies (breaking change) #255

fqtab commented May 23, 2024 •

edited

Loading

fqtab May 23, 2024

fqtab May 23, 2024

tabmatfournier May 23, 2024

tabmatfournier May 23, 2024

tabmatfournier May 23, 2024

Fence writer zombies (breaking change) #255

Are you sure you want to change the base?

Fence writer zombies (breaking change) #255

Conversation

fqtab commented May 23, 2024 • edited Loading

fqtab May 23, 2024

Choose a reason for hiding this comment

fqtab May 23, 2024

Choose a reason for hiding this comment

tabmatfournier May 23, 2024

Choose a reason for hiding this comment

tabmatfournier May 23, 2024

Choose a reason for hiding this comment

tabmatfournier May 23, 2024

Choose a reason for hiding this comment

fqtab commented May 23, 2024 •

edited

Loading