Fix all ASAN issues in vectorscan #93

danlark1 · 2022-02-18T17:55:52Z

Closes #91
Closes #100

There were lots of problems in small ranges, I tried to fix them all by carefully reading the code.

Also adopt all changes we made to make it compile like setbit unsetting in tests.

We tested with all combinations HS_OPTIMIZE/DEBUG/HASWELL/SSE4.2/ARM/ASAN/MSAN/NOSANITIZERS. Haven't tested AVX512, PPC and SVE2

Resubmitting to a develop branch. I'll take a look at AVX512 failure from FAT_RUNTIME

markos · 2022-02-18T17:59:20Z

It would be great if you could also share how you enabled and tested ASAN in the builds, I would love to add this to our CI for all platforms even, it might take a bit longer, but we're in the process of adding more cores to our build farm so it will balance out.

danlark1 · 2022-02-18T18:37:50Z

-DSANITIZE=undefined
-DSANITIZE=address
-DSANITIZE=memory

to cmake command

Some might not work with the ifunc attribute so FAT_RUNTIME might be hard to test

And I tested only clang

danlark1 · 2022-02-18T19:40:49Z

I believe I fixed all tests :)

danlark1 · 2022-02-18T20:02:14Z

Overall feedback: I might guess I missed in a couple places some mask lowering because of zero byte inputs, however, in general, it took a while lot to comprehend and to match what was missing

Can you share what is the idea behind such major rewrites? I thought Vectorscan just replaced SIMD to portable ones and was very surprised to see ASAN issues and then I realized these major string search algos were rewritten.

Given hyperscan might be supported in the future, it makes new integrations much more complex and even if hyperscan decides not to support ARM for other reasons, merges will become exponentially complex. I saw just a couple optimizations which are missed in hyperscan and presented here. In the end we will have different bugs in two repos and from our view it makes just life more complex

Anyway, thanks for your work on portability, no issues with this, at least

markos · 2022-02-18T20:22:51Z

well, the idea with the refactoring is that original hyperscan is heavily tailored against Intel SIMD, using movemasks which are extremely expensive to emulate on Arm and Power. My initial fork was indeed just a SIMD port, but performance on Arm suffered heavily. After a lot of searching we decided we care only about API/ABI compatibility and would rather get better performance on other platforms even if it makes it much harder to integrate code back into Vectorscan, when Intel release a new version. In the unlikely event that Hyperscan decides to become more favourable against other architectures in the future, we will still support Vectorscan as I have found multiple places that performance would increase and code size reduced. I will just note that code size in the refactored places was reduced to almost 1/3rd of the original, and keeping more or less the same performance on Intel, but increasing performance on our Arm systems by almost 200% in some microbenchmarks. I could not do that with just SIMD intrinsic implementations without refactoring the algorithms themselves. Furthermore, SIMD engines like SVE2 require much heavier refactorings than even the current because of the major breakthroughs in the logic, especially head and tail of the loops, as they use predicates. The original code, where all the loop functions are implemented for each SIMD engine just does not scale.
Also, by being free from the limitations of the original project we are able to only keep relevant code -eg. Windows support was completely removed as unneeded and irrelevant. I'm actually pondering whether AVX512 support is actually beneficial at all, it makes the code much more complicated and so far the benefit in all but a few benchmarks has been negligible. But that's a decision for the future.
In short, expect more and heavier refactoring, however with the goal to always maintain API/ABI compatibility.

markos · 2022-02-18T20:42:35Z

Having said that, I should note that I will be adding ASAN to the CI, so such bugs should be caught early on next time. I do honestly appreciate the pointer for that and the PR of course.

danlark1 · 2022-02-18T20:59:02Z

Thanks

I have also some fuzzing targets, I will try to find all differences between hyperscan and Vectorscan in terms of output

Fix all ASAN issues in vectorscan

9af996b

Add sanitize options

b3e88e4

Fix a couple of tests

5f8729a

estebanpw mentioned this pull request Apr 12, 2022

Unexpected behavior in aarch64 #100

Closed

Palkovsky mentioned this pull request Apr 12, 2022

Different behavior on x64 and aarch64 #99

Closed

markos merged commit 5fa22e6 into VectorCamp:develop Apr 18, 2022

markos mentioned this pull request Apr 18, 2022

Sanitizers fail a lot, known? #91

Closed

deenp03 mentioned this pull request Nov 5, 2024

Correctness regression on x86 architecture #317

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix all ASAN issues in vectorscan #93

Fix all ASAN issues in vectorscan #93

danlark1 commented Feb 18, 2022 •

edited

Loading

markos commented Feb 18, 2022

danlark1 commented Feb 18, 2022 •

edited

Loading

danlark1 commented Feb 18, 2022

danlark1 commented Feb 18, 2022

markos commented Feb 18, 2022 •

edited

Loading

markos commented Feb 18, 2022

danlark1 commented Feb 18, 2022

Fix all ASAN issues in vectorscan #93

Fix all ASAN issues in vectorscan #93

Conversation

danlark1 commented Feb 18, 2022 • edited Loading

markos commented Feb 18, 2022

danlark1 commented Feb 18, 2022 • edited Loading

danlark1 commented Feb 18, 2022

danlark1 commented Feb 18, 2022

markos commented Feb 18, 2022 • edited Loading

markos commented Feb 18, 2022

danlark1 commented Feb 18, 2022

danlark1 commented Feb 18, 2022 •

edited

Loading

danlark1 commented Feb 18, 2022 •

edited

Loading

markos commented Feb 18, 2022 •

edited

Loading