`str::contains` is very slow for substring checks #25483

netvl · 2015-05-16T08:33:14Z

Originally found in SO question.

This Rust code

fn main(){

    let needle   = (0..100).map(|_| "b").collect::<String>();
    let heystack = (0..100_000).map(|_| "a").collect::<String>();

    println!("Data ready.");

    for _ in 0..1_000_000 {
        if heystack.contains( &needle ) {
            // Stuff...
        }
    }

}

Executes for up to three minutes on my machine (even with -C opt-level=3), while the equivalent Ruby or Python code:

needle   = 'b' * 100
heystack = 'a' * 100_000

puts 'Data ready.'

1_000_000.times do
    heystack.include? needle
end

finishes in several seconds. Java variant finishes almost instantly (but that seems to be due to JIT optimizations).

It looks like that pattern searching for substrings could be severely optimized.

The text was updated successfully, but these errors were encountered:

bluss · 2015-05-16T11:36:34Z

StrSearcher has a lot of panic branches that can probably avoided (slicing and unwrapping that will never happen).

ArtemGr · 2015-05-16T11:41:12Z

I wonder wether LLVM can move the comparison out of the loop and if not, why?

huonw · 2015-05-16T12:53:43Z

I think this is fundamentally algorithmic problem, since StrSearcher is performing a naive string search at the moment. Eliminating unnecessary panics etc should probably wait until that problem is resolved (I could be wrong).

bluss · 2015-05-16T13:22:42Z

Oh wow, I thought that was something that we had already fixed.

gereeter · 2015-05-16T18:37:53Z

The algorithmic problem was fixed, but this fix got disabled in the switch to the pattern API. The two-way search code is still present as dead code.

DanielKeep · 2015-05-17T12:41:52Z

For comparison, I just did a straight translation of a C implementation of BMH. Results for the test in the SO question were:

test tests::boyermoore_horspool_memmem::case_tasos  ... bench:      5241 ns/iter (+/- 222)
test tests::naive::case_tasos                       ... bench:   1567384 ns/iter (+/- 64776)
test tests::std_str_find::case_tasos                ... bench:    864885 ns/iter (+/- 45622)

Where naive is just a 100% naive char-by-char search.

alexcrichton · 2015-05-28T16:23:03Z

triage: I-nominated

bluss · 2015-05-28T19:56:52Z

@DanielKeep Can it be adapted to the pattern API?

DanielKeep · 2015-05-29T04:58:11Z

@bluss I don't see why not; the signature is just (&str, &str) -> Option<usize>, but given that there's already a (not being used) higher-performance searcher in the stdlib, it'd probably make more sense to restore that first and then consider further improvements independently. So long as what's there isn't pathologically bad, it'll still be a considerable improvement. :)

(Not that my BMH implementation is likely to be an improvement: it was literally just copy, pasted, and translated from the C reference code. Hell, it uses... indexing. I was in a hurry. :P)

Kimundi · 2015-05-29T10:09:52Z

Yeah, after implementing the pattern API I haven't found the time to reintegrate the old fast searcher code yet. Part of the issue why I haven't done it directly is that the fast string searcher only supports single ended search, while with the Pattern API you'd want double ended search.

One way to resolve this is to just hack the naive and the fast search together into a hybrid that uses the fast search for forward iteration (which is all that happened before anyway), and the naive for backwards iteration, but due to me not fully understanding the fast algorithm I was reluctant to do that as part of the original patch.

Wouldn't harm to just do this now and see if any bugs appear though...

alexcrichton · 2015-06-01T17:10:53Z

triage: P-medium

bluss · 2015-06-14T23:21:30Z

This is allegedly a very interesting improvement upon our two-way search algorithm. http://drops.dagstuhl.de/opus/volltexte/2011/3355/

@gereeter

To improve our substring search performance, revive the two way searcher and adapt it to the Pattern API. Fixes rust-lang#25483, a performance bug: that particular case now completes faster in optimized rust than in ruby (but they share the same order of magnitude). Much thanks to @gereeter who helped me understand the reverse case better and wrote the comment explaining `next_back` in the code. I had quickcheck to fuzz test forward and reverse searching thoroughly. The two way searcher implements both forward and reverse search, but not double ended search. The forward and reverse parts of the two way searcher are completely independent. The two way searcher algorithm has very small, constant space overhead, requiring no dynamic allocation. Our implementation is relatively fast, especially due to the `byteset` addition to the algorithm, which speeds up many no-match cases. A bad case for the two way algorithm is: ``` let haystack = (0..10_000).map(|_| "dac").collect::<String>(); let needle = (0..100).map(|_| "bac").collect::<String>()); ``` For this particular case, two way is not much faster than the naive implementation it replaces.

@gereeter

Update substring search to use the Two Way algorithm To improve our substring search performance, revive the two way searcher and adapt it to the Pattern API. Fixes #25483, a performance bug: that particular case now completes faster in optimized rust than in ruby (but they share the same order of magnitude). Many thanks to @gereeter who helped me understand the reverse case better and wrote the comment explaining `next_back` in the code. I had quickcheck to fuzz test forward and reverse searching thoroughly. The two way searcher implements both forward and reverse search, but not double ended search. The forward and reverse parts of the two way searcher are completely independent. The two way searcher algorithm has very small, constant space overhead, requiring no dynamic allocation. Our implementation is relatively fast, especially due to the `byteset` addition to the algorithm, which speeds up many no-match cases. A bad case for the two way algorithm is: ``` let haystack = (0..10_000).map(|_| "dac").collect::<String>(); let needle = (0..100).map(|_| "bac").collect::<String>()); ``` For this particular case, two way is not much faster than the naive implementation it replaces.

huonw added the I-slow Issue: Problems and improvements with respect to performance of generated code. label May 16, 2015

alexcrichton added the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label May 28, 2015

rust-highfive added the I-nominated label May 28, 2015

rust-highfive added P-medium Medium priority and removed I-nominated labels Jun 1, 2015

bluss mentioned this issue Jun 15, 2015

Update substring search to use the Two Way algorithm #26327

Merged

bors closed this as completed in #26327 Jun 30, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`str::contains` is very slow for substring checks #25483

`str::contains` is very slow for substring checks #25483

netvl commented May 16, 2015

bluss commented May 16, 2015

ArtemGr commented May 16, 2015

huonw commented May 16, 2015

bluss commented May 16, 2015

gereeter commented May 16, 2015

DanielKeep commented May 17, 2015

alexcrichton commented May 28, 2015

bluss commented May 28, 2015

DanielKeep commented May 29, 2015

Kimundi commented May 29, 2015

alexcrichton commented Jun 1, 2015

bluss commented Jun 14, 2015

str::contains is very slow for substring checks #25483

str::contains is very slow for substring checks #25483

Comments

netvl commented May 16, 2015

bluss commented May 16, 2015

ArtemGr commented May 16, 2015

huonw commented May 16, 2015

bluss commented May 16, 2015

gereeter commented May 16, 2015

DanielKeep commented May 17, 2015

alexcrichton commented May 28, 2015

bluss commented May 28, 2015

DanielKeep commented May 29, 2015

Kimundi commented May 29, 2015

alexcrichton commented Jun 1, 2015

bluss commented Jun 14, 2015

`str::contains` is very slow for substring checks #25483

`str::contains` is very slow for substring checks #25483