Add support for NEON (128-bit wide SIMD for ARM) for 32-bit architectures #21
Labels
acceptance: go ahead
Reviewed, implementation can start
area: performance
Performance improvements
contribute: simd
Requires SIMD knowledge
help wanted
External contributions welcome
type: feature
New feature or request
Milestone
Is your feature request related to a problem? Please describe.
Currently we have SIMD acceleration for x86 only. ARM has its own standardised SIMD intrinsics set, called NEON. Supporting it (starting with 32-bit ARM) would be very beneficial.
Describe the solution you'd like
This should be coordinated with #14, since both should work on 32-bit architectures. Similar interfaces would be used, only the classifier implementations would be different (since they need different instruction sets).
Additional context
Find NEON intrinsics documentation here.
The text was updated successfully, but these errors were encountered: