Leverage experimental Extended Multiplication WAsm SIMD instructions #1202

copybara-service · 2020-12-04T20:57:01Z

Leverage experimental Extended Multiplication WAsm SIMD instructions

PiperOrigin-RevId: 345735031

omnisip · 2020-12-06T01:46:25Z

src/qs8-dwconv/gen/up16x9-minmax-wasmsimd-mul16.c

+      const v128_t vq31prod01 = wasm_i64x2_shl(vprod01, 1);
+      const v128_t vq31prod23 = wasm_i64x2_add(vprod23, vprod23);
+      const v128_t vq31prod45 = wasm_i64x2_shl(vprod45, 1);
+      const v128_t vq31prod67 = wasm_i64x2_add(vprod67, vprod67);


@Maratyszcza what benefit do you get out of using a wasm_i64x2_shl(...,1) over adding it twice? Is it because of the latency and the ALU?

More even distribution of uops across execution ports and shorter average latency. Older x86 CPUs (e.g. pre-Nehalem and Atom uarchs) have high latency for [V]PADDQ, so it pays off to replace additions with shifts. However, CPUs often have fewer SIMD shift ports than SIMD ALU ports, so we don't want to replace all additions with shifts.

Leverage experimental Extended Multiplication WAsm SIMD instructions

f63a54a

PiperOrigin-RevId: 345735031

google-cla bot added the cla: yes label Dec 4, 2020

Maratyszcza mentioned this pull request Dec 4, 2020

Extended multiplication instructions WebAssembly/simd#376

Merged

omnisip reviewed Dec 6, 2020

View reviewed changes

copybara-service bot closed this Sep 20, 2022

copybara-service bot deleted the test_345735031 branch September 20, 2022 17:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Leverage experimental Extended Multiplication WAsm SIMD instructions #1202

Leverage experimental Extended Multiplication WAsm SIMD instructions #1202

copybara-service bot commented Dec 4, 2020

omnisip Dec 6, 2020

Maratyszcza Dec 6, 2020 •

edited

Loading

Leverage experimental Extended Multiplication WAsm SIMD instructions #1202

Leverage experimental Extended Multiplication WAsm SIMD instructions #1202

Conversation

copybara-service bot commented Dec 4, 2020

omnisip Dec 6, 2020

Choose a reason for hiding this comment

Maratyszcza Dec 6, 2020 • edited Loading

Choose a reason for hiding this comment

Maratyszcza Dec 6, 2020 •

edited

Loading