Integer absolute value instructions #128

Maratyszcza · 2019-10-28T23:48:54Z

Introduction

Integer absolute value instructions are well-supported on the most popular architecture (on x86 since SSSE3, on ARM since the first version of NEON), and naturally complement floating-point absolute value instructions already existing in WebAssembly SIMD.

This PR introduce three new WebAssembly instructions for integer absolute value operations, i8x16.abs, i16x8.abs, and i32x4.abs, which operate on vectors of 8-bit, 16-bit, and 32-bit integers accordingly. 64-bit version is omitted due to lack of support in common SIMD instruction sets.

Mapping to Common Instruction Sets

This section illustrates how the new WebAssembly instructions can be lowered on common instruction sets. However, these patterns are provided only for convenience, compliant WebAssembly implementations do not have to follow the same code generation patterns.

x86/x86-64 processors with AVX instruction set

i8x16.abs
- y = i8x16.abs(x) is lowered to VPABSB xmm_y, xmm_x
i16x8.abs
- y = i16x8.abs(x) is lowered to VPABSW xmm_y, xmm_x
i32x4.abs
- y = i32x4.abs(x) is lowered to VPABSD xmm_y, xmm_x

x86/x86-64 processors with SSSE3 instruction set

i8x16.abs
- y = i8x16.abs(x) is lowered to PABSB xmm_y, xmm_x
i16x8.abs
- y = i16x8.abs(x) is lowered to PABSW xmm_y, xmm_x
i32x4.abs
- y = i32x4.abs(x) is lowered to PABSD xmm_y, xmm_x

x86/x86-64 processors with SSE2 instruction set

i8x16.abs
- x = i8x16.abs(x) is lowered to PXOR xmm_tmp, xmm_tmp + PSUBB xmm_tmp, xmm_x + PMINUB xmm_x, xmm_tmp
- y = i8x16.abs(x) is lowered to PXOR xmm_y, xmm_y + PSUBB xmm_y, xmm_x + PMINUB xmm_y, xmm_x
i16x8.abs
- x = i16x8.abs(x) is lowered to PXOR xmm_tmp, xmm_tmp + PSUBW xmm_tmp, xmm_x + PMAXSW xmm_x, xmm_tmp
- y = i16x8.abs(x) is lowered to PXOR xmm_y, xmm_y + PSUBW xmm_y, xmm_x + PMAXSW xmm_y, xmm_x
i32x4.abs
- y = i32x4.abs(x) is lowered to:
  - PXOR xmm_tmp, xmm_tmp
  - PCMPGT xmm_tmp, xmm_x
  - MOVDQA xmm_y, xmm_x
  - PXOR xmm_y, xmm_tmp
  - PSUBD xmm_y, xmm_tmp

ARM64 processors

i8x16.abs
- y = i8x16.abs(x) is lowered to ABS Vy.16B, Vx.16B
i16x8.abs
- y = i16x8.abs(x) is lowered to ABS Vy.8H, Vx.8H
i32x4.abs
- y = i32x4.abs(x) is lowered to ABS Vy.4S, Vx.4S

ARMv7 processors with NEON instruction set

i8x16.abs
- y = i8x16.abs(x) is lowered to VABS.S8 Qy, Qx
i16x8.abs
- y = i16x8.abs(x) is lowered to VABS.S16 Qy, Qx
i32x4.abs
- y = i32x4.abs(x) is lowered to VABS.S32 Qy, Qx

POWER processors with VMX (Altivec) instruction set

i8x16.abs
- y = i8x16.abs(x) is lowered to VXOR VRtmp, VRtmp, VRtmp + VSUBUBM VRtmp, VRtmp, VRx + VMAXSB VRy, VRx, VRtmp
i16x8.abs
- y = i16x8.abs(x) is lowered to VXOR VRtmp, VRtmp, VRtmp + VSUBUHM VRtmp, VRtmp, VRx + VMAXSH VRy, VRx, VRtmp
i32x4.abs
- y = i32x4.abs(x) is lowered to VXOR VRtmp, VRtmp, VRtmp + VSUBUWM VRtmp, VRtmp, VRx + VMAXSW VRy, VRx, VRtmp

MIPS processors with MSA instruction set

i8x16.abs
- y = i8x16.abs(x) is lowered to LDI.B Wtmp, 0 + ASUB_S.B Wy, Wx, Wtmp
i16x8.abs
- y = i16x8.abs(x) is lowered to LDI.H Wtmp, 0 + ASUB_S.H Wy, Wx, Wtmp
i32x4.abs
- y = i32x4.abs(x) is lowered to LDI.W Wtmp, 0 + ASUB_S.W Wy, Wx, Wtmp

Maratyszcza · 2020-01-21T07:11:34Z

The need for integer absolute value instructions was suggested by @jan-wassenberg in the recent SIMD WG sync, and in #176. As no one voiced either support nor critique for the proposal in this PR, I'd like to explicitly put it to vote:

In favor of including Integer Absolute Value ops in the current proposal, please respond with 👍
Against including Integer Absolute Value ops in the current proposal, please respond with 👎

dtig · 2020-01-21T19:56:51Z

Thanks @Maratyszcza for adding this poll, voted, but also explicitly in favor of adding this set of operations to the proposal as it benefits different sets of applications and maps to one instruction on most relevant platforms. As the votes look in favor, this will be merged after waiting for a reasonable time to vote, please respond here for any concerns/objections to including the integer value operations to the current SIMD proposal.

munrocket · 2020-01-23T13:07:46Z

Thank you for this PR.

dtig · 2020-02-07T21:23:38Z

@AlphaHot as the dissenting vote, would you like to share why you object to adding these operations to the proposal?

Summary: These were merged to the SIMD proposal in WebAssembly/simd#128. Depends on D76397 to avoid merge conflicts. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76399

Adds full support for the {i8x16,i16x8,i32x4}.abs instructions merged to the SIMD proposal in WebAssembly/simd#128 as well as the {i8x16,i16x8,i32x4}.bitmask instructions proposed in WebAssembly/simd#201.

Summary: These were merged to the SIMD proposal in WebAssembly/simd#128. Depends on D76397 to avoid merge conflicts. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76399

aqrit · 2021-02-19T13:15:28Z

The illustrated i32x4.abs lowering for SSE2 is incorrect.

(x - y) ^ y should have been (x ^ y) - y or (x + y) ^ y.

Maratyszcza · 2021-02-19T16:58:51Z

@aqrit Thanks for reporting. Fixed.

Maratyszcza mentioned this pull request Oct 29, 2019

SIMD Sync meeting 10/22/2019 Agenda #121

Closed

Maratyszcza force-pushed the integer-abs branch from 9412e76 to d85e2f5 Compare November 24, 2019 08:16

Maratyszcza force-pushed the integer-abs branch from d85e2f5 to d4e1f89 Compare January 14, 2020 18:30

Maratyszcza mentioned this pull request Jan 20, 2020

Proposal to add integer abs() #176

Closed

Maratyszcza force-pushed the integer-abs branch from d4e1f89 to 9b40fdd Compare January 21, 2020 07:03

ngzhian requested a review from dtig February 6, 2020 18:53

dtig approved these changes Feb 7, 2020

View reviewed changes

Integer absolute value instructions

e261fdd

Maratyszcza force-pushed the integer-abs branch from 9b40fdd to e261fdd Compare February 7, 2020 21:41

dtig merged commit 77e7fda into WebAssembly:master Feb 11, 2020

Honry mentioned this pull request Feb 14, 2020

[simd] Add support for integer abs ops WAVM/WAVM#258

Closed

tlively mentioned this pull request Mar 20, 2020

SIMD integer abs and bitmask instructions WebAssembly/binaryen#2703

Merged

binji mentioned this pull request Aug 31, 2020

Looking for i64.bswap in future impl WebAssembly/design#1334

Closed

Maratyszcza mentioned this pull request Sep 29, 2020

Implement _mm_abs_epi* with wasm_*_abs emscripten-core/emscripten#12372

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integer absolute value instructions #128

Integer absolute value instructions #128

Maratyszcza commented Oct 28, 2019 •

edited

Loading

Maratyszcza commented Jan 21, 2020

dtig commented Jan 21, 2020

munrocket commented Jan 23, 2020

dtig commented Feb 7, 2020

aqrit commented Feb 19, 2021

Maratyszcza commented Feb 19, 2021

Integer absolute value instructions #128

Integer absolute value instructions #128

Conversation

Maratyszcza commented Oct 28, 2019 • edited Loading

Introduction

Mapping to Common Instruction Sets

x86/x86-64 processors with AVX instruction set

x86/x86-64 processors with SSSE3 instruction set

x86/x86-64 processors with SSE2 instruction set

ARM64 processors

ARMv7 processors with NEON instruction set

POWER processors with VMX (Altivec) instruction set

MIPS processors with MSA instruction set

Maratyszcza commented Jan 21, 2020

dtig commented Jan 21, 2020

munrocket commented Jan 23, 2020

dtig commented Feb 7, 2020

aqrit commented Feb 19, 2021

Maratyszcza commented Feb 19, 2021

Maratyszcza commented Oct 28, 2019 •

edited

Loading