You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#4442 addressed an issue where offsets where not committed when leader epoch was the default (-1).
We've seen a different corner case, where offsets are not committed if the leader epoch is zero. This happens when enable.partition.eof is set to true and the fetcher reaches _PARTITION_EOF, as can be seen in the attached logs
At 11:07:40.304, the first FETCHERR with _PARTITION_EOF is logged, and the subsequent stored offset's epoch is reset to 0. Also, stored offset > committed offset (96011 vs 96010)
The consumer consumed several messages afterwards. Once the offsets of the fetched messages went past the stored offset, leader epoch went back to its previous value (1738) and auto-commits were triggered
At 11:07:46.788, another FETCHERR/_PARTITION_EOF appears in the log, stored offset epoch goes back to 0, no more triggered auto-commits.
From that point on, this partition received very few messages (less than 1 per minute). Because each CONSUME returned a single message, we hit this pathological behavior where stored offsets kept growing by 1, lag continued to grow, and only recovered once the partition received enough traffic to, once again, move past the stored offset (this happened many hours later)
Description
#4442 addressed an issue where offsets where not committed when leader epoch was the default (-1).
We've seen a different corner case, where offsets are not committed if the leader epoch is zero. This happens when
enable.partition.eof
is set to true and the fetcher reaches_PARTITION_EOF
, as can be seen in the attached logsHow to reproduce
Client configuration:
Checklist
IMPORTANT: We will close issues where the checklist has not been completed.
Please provide the following information:
2.4.0
3.4.1
<REPLACE with e.g., message.timeout.ms=123, auto.reset.offset=earliest, ..>
Red Hat Enterprise Linux 8.4 (x64)
debug=..
as necessary) from librdkafkaThe text was updated successfully, but these errors were encountered: