Add ARM64 encodings for group IF_SVE_GQ_3A #98352

snickolls-arm · 2024-02-13T13:26:41Z

Adds encodings for bfcvtnt, fcvtlt, fcvtnt and fcvtxnt.

bfcvtnt z3.h, p0/m, z4.s
fcvtlt z0.d, p7/m, z1.s
fcvtlt z14.s, p7/m, z20.h
fcvtnt z18.h, p3/m, z9.s
fcvtnt z12.s, p3/m, z5.d
fcvtxnt z1.s, p2/m, z3.d

Contributing towards #94549.

ghost · 2024-02-13T13:26:50Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

Adds encodings for bfcvtnt, fcvtlt, fcvtnt and fcvtxnt.

bfcvtnt z3.h, p0/m, z4.s
fcvtlt z0.d, p7/m, z1.s
fcvtlt z14.s, p7/m, z20.h
fcvtnt z18.h, p3/m, z9.s
fcvtnt z12.s, p3/m, z5.d
fcvtxnt z1.s, p2/m, z3.d

Contributing towards #94549.

Author:	snickolls-arm
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

ryujit-bot · 2024-02-13T15:13:18Z

Diff results for #98352

Throughput diffs

Throughput diffs for linux/arm64 ran on linux/x64

MinOpts (-0.01% to +0.00%)

Collection	PDIFF
libraries.pmi.linux.arm64.checked.mch	-0.01%
realworld.run.linux.arm64.checked.mch	-0.01%

Details here

kunalspathak · 2024-02-13T15:29:49Z

src/coreclr/jit/emitarm64.h

@@ -827,6 +827,10 @@ static insOpts optMakeArrangement(emitAttr datasize, emitAttr elemsize);
 //    For the given 'datasize' and 'opt' returns true if it specifies a valid vector register arrangement
 static bool isValidArrangement(emitAttr datasize, insOpts opt);

+// Expands an option that has different size operands (INS_OPTS_*_TO_*) into a pair of scalable options where
+// the first describes the size of the destination operand and the second describes the size of the source operand.
+std::pair<insOpts, insOpts> optExpandConversionPair(insOpts opt);


we do not use standard libraries function like std::pair. Can you instead change it to something like

Suggested change

std::pair<insOpts, insOpts> optExpandConversionPair(insOpts opt);

void optExpandConversionPair(insOpts opt, insOpts& dst, insOpts& src);

kunalspathak · 2024-02-13T15:38:13Z

src/coreclr/jit/emitarm64.cpp

+                case INS_sve_fcvtnt:
+                    if (id->idInsOpt() == INS_OPTS_S_TO_H)
+                    {
+                        code ^= (1 << 22 | 1 << 17);


can you instead change the below to:

runtime/src/coreclr/jit/instrsarm64sve.h

Line 1316 in eb0c20c

// FCVTNT <Zd>.S, <Pg>/M, <Zn>.D SVE_GQ_3A 0110010011001010 101gggnnnnnddddd 64CA A000

- INST2(fcvtnt, "fcvtnt", 0, IF_SVE_2BJ, 0x64CAA000, 0x650A3C00 ) - // FCVTNT <Zd>.S, <Pg>/M, <Zn>.D SVE_GQ_3A 0110010011001010 101gggnnnnnddddd 64CA A000 + INST2(fcvtnt, "fcvtnt", 0, IF_SVE_2BJ, 0x6488A000, 0x650A3C00 ) + // FCVTNT <Zd>.H, <Pg>/M, <Zn>.S SVE_GQ_3A 0110010010001000 101gggnnnnnddddd 6488 A000

and then use |:

Suggested change

code ^= (1 << 22 | 1 << 17);

code |= (1 << 22 | 1 << 17);

kunalspathak · 2024-02-13T15:40:00Z

src/coreclr/jit/emitarm64.cpp

+                case INS_sve_fcvtlt:
+                    if (id->idInsOpt() == INS_OPTS_H_TO_S)
+                    {
+                        code ^= (1 << 22 | 1 << 17);


likewise for fcvtlt?

kunalspathak · 2024-02-13T15:40:36Z

src/coreclr/jit/emitarm64.cpp

+        case INS_sve_fcvtxnt:
+        case INS_sve_bfcvtnt:
+            assert(isVectorRegister(reg1));    // ddddd
+            assert(isPredicateRegister(reg2)); // ggg


Suggested change

assert(isPredicateRegister(reg2)); // ggg

assert(isLowPredicateRegister(reg2)); // ggg

and at other places too, the assert should be for LowPredicateRegister().

snickolls-arm · 2024-02-14T11:21:14Z

@a74nh @kunalspathak @dotnet/arm64-contrib

kunalspathak

LGTM. Thanks!

kunalspathak · 2024-02-14T13:45:26Z

Failures are known and superpmi-x64 is clean.

ryujit-bot · 2024-02-14T13:53:02Z

Diff results for #98352

Throughput diffs

Throughput diffs for linux/arm64 ran on windows/x64

MinOpts (-0.01% to +0.00%)

Collection	PDIFF
realworld.run.linux.arm64.checked.mch	-0.01%

Throughput diffs for osx/arm64 ran on windows/x64

MinOpts (-0.01% to +0.00%)

Collection	PDIFF
realworld.run.osx.arm64.checked.mch	-0.01%

Throughput diffs for windows/arm64 ran on windows/x64

MinOpts (-0.00% to +0.01%)

Collection	PDIFF
libraries.pmi.windows.arm64.checked.mch	+0.01%

Details here

Add ARM64 encodings for group IF_SVE_GQ_2A

081b054

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Feb 13, 2024

kunalspathak mentioned this pull request Feb 13, 2024

Arm64: Implement SVE encodings #94549

Closed

kunalspathak changed the title ~~Add ARM64 encodings for group IF_SVE_GQ_2A~~ Add ARM64 encodings for group IF_SVE_GQ_3A Feb 13, 2024

kunalspathak requested changes Feb 13, 2024

View reviewed changes

kunalspathak added the arm-sve Work related to arm64 SVE/SVE2 support label Feb 13, 2024

build-analysis bot mentioned this pull request Feb 13, 2024

System.Net.Security.Tests.SslStreamCertificateContextOcspLinuxTests.RefreshOcspResponse_BeforeExpiration test failure #97779

Closed

snickolls-arm marked this pull request as ready for review February 14, 2024 10:50

snickolls-arm added 2 commits February 14, 2024 10:59

Merge branch 'main' into github-IF_SVE_GQ_2A

5e651cb

Address review comments

33431b8

kunalspathak approved these changes Feb 14, 2024

View reviewed changes

kunalspathak merged commit 9561bed into dotnet:main Feb 14, 2024
125 of 129 checks passed

build-analysis bot mentioned this pull request Feb 14, 2024

System.Net.Security.Tests.SslStreamCertificateContextOcspLinuxTests.FetchOcspResponse_FirstInvalidThenValid test failure #97836

Closed

github-actions bot locked and limited conversation to collaborators Mar 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ARM64 encodings for group IF_SVE_GQ_3A #98352

Add ARM64 encodings for group IF_SVE_GQ_3A #98352

snickolls-arm commented Feb 13, 2024

ghost commented Feb 13, 2024

ryujit-bot commented Feb 13, 2024

Throughput diffs

Throughput diffs for linux/arm64 ran on linux/x64

kunalspathak Feb 13, 2024

kunalspathak Feb 13, 2024

kunalspathak Feb 13, 2024

kunalspathak Feb 13, 2024

kunalspathak Feb 13, 2024

snickolls-arm commented Feb 14, 2024

kunalspathak left a comment

kunalspathak commented Feb 14, 2024

ryujit-bot commented Feb 14, 2024

Throughput diffs

Throughput diffs for linux/arm64 ran on windows/x64

Throughput diffs for osx/arm64 ran on windows/x64

Throughput diffs for windows/arm64 ran on windows/x64

	std::pair<insOpts, insOpts> optExpandConversionPair(insOpts opt);
	void optExpandConversionPair(insOpts opt, insOpts& dst, insOpts& src);

	assert(isPredicateRegister(reg2)); // ggg
	assert(isLowPredicateRegister(reg2)); // ggg

Add ARM64 encodings for group IF_SVE_GQ_3A #98352

Add ARM64 encodings for group IF_SVE_GQ_3A #98352

Conversation

snickolls-arm commented Feb 13, 2024

ghost commented Feb 13, 2024

ryujit-bot commented Feb 13, 2024

Throughput diffs

Throughput diffs for linux/arm64 ran on linux/x64

kunalspathak Feb 13, 2024

Choose a reason for hiding this comment

kunalspathak Feb 13, 2024

Choose a reason for hiding this comment

kunalspathak Feb 13, 2024

Choose a reason for hiding this comment

kunalspathak Feb 13, 2024

Choose a reason for hiding this comment

kunalspathak Feb 13, 2024

Choose a reason for hiding this comment

snickolls-arm commented Feb 14, 2024

kunalspathak left a comment

Choose a reason for hiding this comment

kunalspathak commented Feb 14, 2024

ryujit-bot commented Feb 14, 2024

Throughput diffs

Throughput diffs for linux/arm64 ran on windows/x64

Throughput diffs for osx/arm64 ran on windows/x64

Throughput diffs for windows/arm64 ran on windows/x64