CORE-2321: cloud_storage: Fix remote_partition stuck reader #17805

Lazin · 2024-04-11T15:25:52Z

This PR adds new fuzz test that reproduces that stuck reader problem and adds a fix. The fix resets the reader in case it reached the max_offset.

Fixes #17788

Backports Required

Release Notes

Bug Fixes

Fix problem in Tiered-Storage that could potentially cause consumers to get stuck

The 'partition_record_batch_reader_impl' uses internal 'remote_segment_batch_reader' to fetch data. When the internal reader reaches the 'max_offset' it returns an empty result. The 'partition_record_batch_reader_impl::do_load_slice' also returns the empty result in this case. Since the top level reader is invoked by the 'model::record_batch_reader'. It has an internal async loop which is running until the EOS is reached. If its internal slice is not empty it's invoking a user provided callback, then it checks if the stream reached EOS (it exits if this is the case), and then it calls 'load_slice' of the underlying stream (this is the place where 'do_load_slice' method of the 'remote_segment_batch_reader' is invoked). Since the low level segment reader reached max_offset it will always return an empty value and the top level record batch reader will loop forever. This commit breaks this loop by checking the current offset of the segment reader if the empty result is returned. If the current offset overshoot max-offset it resets the low level stream. The top level loop will exit after that.

Add fuzz test that generates data with a lot of tx-fence batches and scans the entire partition reading one batch at a time by setting max_bytes=1 and max_offset to 'reader_config.start_offset + 1'.

The exception is no longer thrown anywhere so it's safe to remove the 'catch' block.

dotnwat · 2024-04-11T20:50:32Z

src/v/cloud_storage/remote_partition.cc

+                            // Reader overshoot the offset. If we will not reset
+                            // the stream the loop inside the
+                            // record_batch_reader will keep calling this method
+                            // again and again. We will be returning empty
+                            // result every time because the current offset
+                            // overshoot the max allowed offset. Resetting the
+                            // segment reader fixes the issue.
+                            //
+                            // We can get into the situation when the current
+                            // reader returns empty result in several cases:
+                            // - we reached max_offset (covered here)
+                            // - we reached end of stream (covered above right
+                            //   after the 'read_some' call)
+                            //
+                            // If we reached max-bytes then the result won't be
+                            // empty. It will have at least one record batch.


vbotbuildovich · 2024-04-11T20:56:02Z

/backport v23.3.x

andijcr

looks good

andijcr · 2024-04-11T21:00:14Z

src/v/cloud_storage/tests/util.cc

+    if (maybe_max_segments) {
+        config::shard_local_cfg()
+          .cloud_storage_max_materialized_segments_per_shard.set_value(
+            maybe_max_segments);
+    }
+    if (maybe_max_readers) {
+        config::shard_local_cfg()
+          .cloud_storage_max_segment_readers_per_shard.set_value(
+            maybe_max_readers);
+    }


maybe they should be the scoped_config, to autoreset at the end of the function

yes, but it doesn't matter that much for fuzz tests
but agree, we should do this eventually

andijcr · 2024-04-11T21:05:19Z

src/v/cloud_storage/tests/util.cc

+        auto headers_read
+          = reader.consume(test_consumer(), model::no_timeout).get();


is this the point where it could get stuck, without this fix?

Lazin added 2 commits April 11, 2024 15:19

cloud_storage: Fuzz tx-fence handling with low max_offset

fb1b92a

Add fuzz test that generates data with a lot of tx-fence batches and scans the entire partition reading one batch at a time by setting max_bytes=1 and max_offset to 'reader_config.start_offset + 1'.

Lazin requested review from abhijat and andijcr April 11, 2024 15:25

github-actions bot added the area/redpanda label Apr 11, 2024

Lazin added 2 commits April 11, 2024 15:32

cloud_storage: Remove stuck_reader_exception handling

2cfafec

The exception is no longer thrown anywhere so it's safe to remove the 'catch' block.

cloud_storage: Remove stuck_reader_exception

7e13c43

dotnwat changed the title ~~cloud_storage: Fix remote_partition stuck reader~~ CORE-2321: cloud_storage: Fix remote_partition stuck reader Apr 11, 2024

dotnwat approved these changes Apr 11, 2024

View reviewed changes

dotnwat merged commit 4ff87bc into redpanda-data:dev Apr 11, 2024
18 checks passed

This was referenced Apr 11, 2024

[v23.3.x] cloud_storage: The remote partition reader can get stuck #17817

Closed

[v23.3.x] CORE-2321: cloud_storage: Fix remote_partition stuck reader #17818

Merged

andijcr approved these changes Apr 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CORE-2321: cloud_storage: Fix remote_partition stuck reader #17805

CORE-2321: cloud_storage: Fix remote_partition stuck reader #17805

Lazin commented Apr 11, 2024

dotnwat Apr 11, 2024

vbotbuildovich commented Apr 11, 2024

andijcr left a comment

andijcr Apr 11, 2024

Lazin Apr 16, 2024

andijcr Apr 11, 2024

Lazin Apr 16, 2024

		auto headers_read
		= reader.consume(test_consumer(), model::no_timeout).get();

CORE-2321: cloud_storage: Fix remote_partition stuck reader #17805

CORE-2321: cloud_storage: Fix remote_partition stuck reader #17805

Conversation

Lazin commented Apr 11, 2024

Backports Required

Release Notes

Bug Fixes

dotnwat Apr 11, 2024

Choose a reason for hiding this comment

vbotbuildovich commented Apr 11, 2024

andijcr left a comment

Choose a reason for hiding this comment

andijcr Apr 11, 2024

Choose a reason for hiding this comment

Lazin Apr 16, 2024

Choose a reason for hiding this comment

andijcr Apr 11, 2024

Choose a reason for hiding this comment

Lazin Apr 16, 2024

Choose a reason for hiding this comment