fixes race condition in Searcher #1464

PSeitz · 2022-08-23T08:39:09Z

fixes #1461
fixes a race condition in Searcher, by avoiding repeated calls to open_segment_readers and passing them instead as argument

fulmicoton · 2022-08-23T09:08:41Z

@shikhar can you review #1464

fixes #1461 fixes a race condition in Searcher, by avoiding repeated calls to open_segment_readers and passing them instead as argument

src/core/searcher.rs

codecov-commenter · 2022-08-23T11:14:30Z

Codecov Report

Merging #1464 (227c252) into main (67d94f5) will decrease coverage by 0.00%.
The diff coverage is 95.65%.

@@            Coverage Diff             @@
##             main    #1464      +/-   ##
==========================================
- Coverage   94.07%   94.06%   -0.01%     
==========================================
  Files         238      239       +1     
  Lines       44564    44636      +72     
==========================================
+ Hits        41922    41989      +67     
- Misses       2642     2647       +5

Impacted Files	Coverage Δ
src/core/searcher.rs	`77.93% <85.71%> (+0.39%)`	⬆️
src/reader/mod.rs	`92.94% <100.00%> (-0.05%)`	⬇️
src/fastfield/serializer/mod.rs	`90.47% <0.00%> (-1.09%)`	⬇️
src/fastfield/writer.rs	`90.80% <0.00%> (-0.29%)`	⬇️
fastfield_codecs/src/lib.rs	`96.87% <0.00%> (-0.03%)`	⬇️
src/query/mod.rs	`100.00% <0.00%> (ø)`
src/query/query.rs	`85.71% <0.00%> (ø)`
src/query/boost_query.rs	`69.76% <0.00%> (ø)`
fastfield_codecs/src/bitpacked.rs	`100.00% <0.00%> (ø)`
src/query/disjunction_max_query.rs	`0.00% <0.00%> (ø)`
... and 14 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

shikhar

@PSeitz aside from clippy nits, LGTM

I did have one unrelated change I was sneaking into my PR, that I wanted to ask about.

Is Relaxed memory order safe here? Both a load and store are happening, so AcqRel seems safer to me. And this is by no means a hot path so being conservative seems fair.

tantivy/src/reader/mod.rs

Line 212 in 998b126

    
           let generation_id = searcher_generation_counter.fetch_add(1, atomic::Ordering::Relaxed);

shikhar · 2022-08-23T13:59:38Z

src/reader/mod.rs

@@ -204,7 +201,7 @@ impl InnerIndexReader {
        Ok(segment_readers)
    }

-    fn create_new_searcher_generation(
+    fn track_segment_readers_in_inventory(


The previous name seems better to me, since we are creating a SearcherGeneration and making sure to do it in a way that we track it int the inventory, and the inventory tracks SearcherGeneration not segment readers.

my issue with the previous name was that it's not clear that inventory is used to track and not the source of the generation

PSeitz · 2022-08-24T07:09:39Z

@PSeitz aside from clippy nits, LGTM

I did have one unrelated change I was sneaking into my PR, that I wanted to ask about.

Is Relaxed memory order safe here? Both a load and store are happening, so AcqRel seems safer to me. And this is by no means a hot path so being conservative seems fair.

tantivy/src/reader/mod.rs

Line 212 in 998b126

let generation_id = searcher_generation_counter.fetch_add(1, atomic::Ordering::Relaxed);

Yeah, I think there could be scenarios where simultaneous reloads are triggered at the same time from multiple threads.

fulmicoton · 2022-08-24T08:35:22Z

@shikhar The operation is still Atomic... I think we just want to ensure that the id delivered are unique, and Relaxed is sufficient for that. AcqRel stuff is about instruction reordering.

PSeitz · 2022-08-24T09:17:46Z

@shikhar I think we just want to ensure that the id delivered are unique, and Relaxed is sufficient for that... That being said it is not a hot part of the code. We can use SeqCst and not sweat it.

I think it could be an issue when two threads increment, but due to weak fencing, we get two times the same value back. If now one of those callers has a new segment in their list, we have the same id to different SearcherGeneration.

It's very unlikely though, and on many platforms Relaxed is already AcqRel.

adamreichold · 2022-08-24T09:45:39Z

I think it could be an issue when two threads increment, but due to weak fencing, we get two times the same value back.

The memory orderings mostly determine how other memory accesses are ordered w.r.t. the atomic accesses, they do not affect the atomicity of an operation like fetch_add, i.e. multiple threads doing fetch_add(1, Ordering::Relaxed) will never see identical values (ignoring overflow).

However, their accesses to other memory locations that are shared with other threads will not be ordered w.r.t. the ID generation, e.g. their accesses to searcher_generation_inventory are do not necessarily reflect their accesses to searcher_generation_counter, meaning that a thread with lower ID might see the stores to the inventory of a thread with a higher ID.

While I suspect that this is not an issue (this would have involve unsafe code if I understand things correctly), my personal choice would be Ordering::AcqRel as well just to capture any implicit dependencies which I do not see ATM. (I would suggest avoiding Ordering::SeqCst as mixing those two orderings can have unexpected effects and sticking to AcqRel is often preferable if global ordering is not required as it is simpler to reason about the explicit pairings between acquire and release operations to the same memory address.)

fulmicoton · 2022-08-24T12:14:54Z

It's very unlikely though, and on many platforms Relaxed is already AcqRel.

This is a very bad argument :). Also ARM is pretty popular.

we get two times the same value back

You cannot get two times the same value back... That's not what ordering is about.
The explanation here is ok:
https://en.cppreference.com/w/cpp/atomic/memory_order

...

Anyway let's defensively go for Ordering::AcqRel...

shikhar · 2022-08-24T15:02:22Z

Thanks folks, I need to learn more about memory orderings :)

fulmicoton requested a review from shikhar August 23, 2022 09:07

fixes race condition in Searcher

097a0b6

fixes #1461 fixes a race condition in Searcher, by avoiding repeated calls to open_segment_readers and passing them instead as argument

PSeitz force-pushed the searcher_race_condition branch from 667edff to 097a0b6 Compare August 23, 2022 09:13

fulmicoton reviewed Aug 23, 2022

View reviewed changes

src/core/searcher.rs Outdated Show resolved Hide resolved

debug_assert to assert

2f5b89c

shikhar reviewed Aug 23, 2022

View reviewed changes

shikhar mentioned this pull request Aug 23, 2022

Fix possibility of inconsistency between Searcher and SearcherGeneration #1462

Closed

use AcqRel

227c252

fulmicoton approved these changes Aug 24, 2022

View reviewed changes

fulmicoton merged commit bb01e99 into main Aug 24, 2022

fulmicoton deleted the searcher_race_condition branch August 24, 2022 12:17

This was referenced Jan 13, 2023

truncation comment PSeitz/tantivy#30

Closed

use stats PSeitz/tantivy#31

Closed

PSeitz mentioned this pull request Jan 31, 2023

update lz4 flex PSeitz/tantivy#33

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixes race condition in Searcher #1464

fixes race condition in Searcher #1464

PSeitz commented Aug 23, 2022

fulmicoton commented Aug 23, 2022

codecov-commenter commented Aug 23, 2022 •

edited

Loading

shikhar left a comment

shikhar Aug 23, 2022

PSeitz Aug 24, 2022

PSeitz commented Aug 24, 2022

fulmicoton commented Aug 24, 2022 •

edited

Loading

PSeitz commented Aug 24, 2022

adamreichold commented Aug 24, 2022 •

edited

Loading

fulmicoton commented Aug 24, 2022 •

edited

Loading

shikhar commented Aug 24, 2022

fixes race condition in Searcher #1464

fixes race condition in Searcher #1464

Conversation

PSeitz commented Aug 23, 2022

fulmicoton commented Aug 23, 2022

codecov-commenter commented Aug 23, 2022 • edited Loading

Codecov Report

shikhar left a comment

Choose a reason for hiding this comment

shikhar Aug 23, 2022

Choose a reason for hiding this comment

PSeitz Aug 24, 2022

Choose a reason for hiding this comment

PSeitz commented Aug 24, 2022

fulmicoton commented Aug 24, 2022 • edited Loading

PSeitz commented Aug 24, 2022

adamreichold commented Aug 24, 2022 • edited Loading

fulmicoton commented Aug 24, 2022 • edited Loading

shikhar commented Aug 24, 2022

codecov-commenter commented Aug 23, 2022 •

edited

Loading

fulmicoton commented Aug 24, 2022 •

edited

Loading

adamreichold commented Aug 24, 2022 •

edited

Loading

fulmicoton commented Aug 24, 2022 •

edited

Loading