Optimize System.Collections.BitArray using arm64 intrinsics #33309

BruceForstall · 2020-03-06T22:30:39Z

This item tracks the conversion of the System.Collections.BitArray class to use arm64 intrinsics.

Related: #33308

Gnbrkm41 · 2020-03-13T14:51:11Z

I think I can put up a PR for this, but we seem to be lacking a couple instructions that would be nice to have:

Unloading values of single SIMD register where the vector contains a single element (e.g. Vector64 returned from AddAcross) to non-vector registers - vduph_laneq_s16 etc?
Unloading values of single SIMD register to a memory location ("Store") - vst1q_s16 etc?

echesakov · 2020-03-13T17:12:04Z

Unloading values of single SIMD register where the vector contains a single element (e.g. Vector64 returned from AddAcross) to non-vector registers - vduph_laneq_s16 etc?

I believe this should be supported via ToScalar() method. cc @tannergooding to confirm

Unloading values of single SIMD register to a memory location ("Store") - vst1q_s16 etc?

I am working on these #33461 #33535

Gnbrkm41 · 2020-03-13T17:55:56Z

Ah, did not notice ToScalar extension method 🙂. I did manage to work around using a different approach which probably is better.

It still would be nice to have those exposed as intrinsics for feature parity, though.

tannergooding · 2020-03-13T18:02:28Z

It still would be nice to have those exposed as intrinsics for feature parity, though.

For some cases, there isn't a corresponding intrinsic on the native side. Where there is, we also expose it there (for example int Sse2.ConvertToInt32(Vector128<int> value) on the x86 side, which corresponds to MOVD reg/m32, xmm).

echesakov · 2020-03-13T21:52:28Z

It still would be nice to have those exposed as intrinsics for feature parity, though.

The similar functionality will be exposed as a part of #24588 - I am working on this at the moment

echesakov · 2020-03-19T01:34:51Z

@Gnbrkm41 FYI, I just merged Store intrinsics implementation (#33535), so you should be unblocked now.

Let me know if you need some help with running you pr on an arm64 device/collecting jit disasm for this issue.

BruceForstall added the arch-arm64 label Mar 6, 2020

BruceForstall added this to the 5.0 milestone Mar 6, 2020

BruceForstall assigned echesakov Mar 6, 2020

Dotnet-GitSync-Bot added area-System.Collections untriaged New issue has not been triaged by the area owner labels Mar 6, 2020

BruceForstall mentioned this issue Mar 11, 2020

Optimize library code using arm64 intrinsics #33308

Closed

Gnbrkm41 mentioned this issue Mar 19, 2020

Vectorise BitArray for ARM64 #33749

Merged

tannergooding closed this as completed in #33749 Mar 30, 2020

tannergooding removed the untriaged New issue has not been triaged by the area owner label Mar 30, 2020

tannergooding unassigned echesakov Mar 30, 2020

ghost locked as resolved and limited conversation to collaborators Dec 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize System.Collections.BitArray using arm64 intrinsics #33309

Optimize System.Collections.BitArray using arm64 intrinsics #33309

BruceForstall commented Mar 6, 2020

Gnbrkm41 commented Mar 13, 2020

echesakov commented Mar 13, 2020

Gnbrkm41 commented Mar 13, 2020

tannergooding commented Mar 13, 2020

echesakov commented Mar 13, 2020

echesakov commented Mar 19, 2020

Optimize System.Collections.BitArray using arm64 intrinsics #33309

Optimize System.Collections.BitArray using arm64 intrinsics #33309

Comments

BruceForstall commented Mar 6, 2020

Gnbrkm41 commented Mar 13, 2020

echesakov commented Mar 13, 2020

Gnbrkm41 commented Mar 13, 2020

tannergooding commented Mar 13, 2020

echesakov commented Mar 13, 2020

echesakov commented Mar 19, 2020