JIT: Don't use addressing modes for volatile loads for gc types #70794

EgorBo · 2022-06-15T19:27:10Z

No description provided.

ghost · 2022-06-15T19:27:22Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

null

Author:	EgorBo
Assignees:	-
Labels:	`area-CodeGen-coreclr`
Milestone:	-

EgorBo · 2022-06-15T22:40:48Z

/azp run runtime-coreclr outerloop, runtime-coreclr jitstress, runtime-coreclr jitstressregs, runtime-coreclr gcstress0x3-gcstress0xc

azure-pipelines · 2022-06-15T22:41:18Z

Azure Pipelines successfully started running 4 pipeline(s).

…ldar

EgorBo · 2022-06-17T00:19:34Z

/azp run runtime-coreclr outerloop, runtime-coreclr jitstress, runtime-coreclr jitstressregs, runtime-coreclr gcstress0x3-gcstress0xc

azure-pipelines · 2022-06-17T00:20:02Z

Azure Pipelines successfully started running 4 pipeline(s).

…ldar # Conflicts: # src/coreclr/scripts/superpmi_diffs.py

EgorBo · 2023-07-29T00:32:59Z

/azp run runtime-coreclr outerloop, runtime-coreclr jitstress, runtime-coreclr jitstressregs, runtime-coreclr gcstress0x3-gcstress0xc

EgorBo · 2023-07-29T10:06:00Z

Diff is a size regression as expected: https://dev.azure.com/dnceng-public/public/_build/results?buildId=355867&view=ms.vss-build-web.run-extensions-tab

E.g.

-            mov     w2, #0xD1FFAB1E
-            dmb     ish
-            stp     w2, w1, [x0, #0x34]
+            add     x2, x0, #52
+            mov     w3, #0xD1FFAB1E
+            stlr    w3, [x2]
+            str     w1, [x0, #0x38]
             ldp     fp, lr, [sp], #0x10
             ret     lr
-;Total bytes of code 28
+;Total bytes of code 32

We no longer can merge two stores with stp because on of the is volatile (the other is not) -- it actually smells like a potential bug in Main - should we even merge stores/load together for volatile?

Another kind of size regressions where no longer can use addressing modes becuase ldlr/stlr don't support them. This can be mitigated with #64457

But overall it should be a perf win because memory barriers are heavy.

PTAL @AndyAyersMS

AndyAyersMS

Did you get a chance to try it out on this case where it should help perf? #87194 (comment)

If not, I can do it.

EgorBo · 2023-07-29T18:57:44Z

Did you get a chance to try it out on this case where it should help perf? #87194 (comment)

If not, I can do it.

Method	Job	Toolchain	Size	Mean	Error	StdDev	Median	Min	Max	Ratio	Gen0	Gen1	Gen2	Allocated	Alloc Ratio
ConcurrentBag	Job-QLVLMF	/Core_Root/corerun	512	7.115 us	0.1261 us	0.1179 us	7.073 us	6.969 us	7.312 us	0.89	1.9496	0.9607	0.0283	16.16 KB	1.00
ConcurrentBag	Job-CQJZMM	/Core_Root_base/corerun	512	7.978 us	0.0392 us	0.0327 us	7.983 us	7.891 us	8.024 us	1.00	1.9770	0.9885	0.0319	16.16 KB	1.00

so 11% improvement on Apple M2 Max, but it's a completely different CPU from the one in the regression

AndyAyersMS · 2023-07-30T05:08:19Z

We don't have ampere data yet, but you can see this fixed the regression on the surface

DrewScoggins · 2023-08-28T07:15:58Z

Looks like this was not fixed, at least on Ubuntu Ampere hw.

EgorBo · 2023-08-28T07:56:59Z

Looks like this was not fixed, at least on Ubuntu Ampere hw.

Lots of benchmarks regressed because of this PR, but that is expected - it was a BDN issue: BDN uses volatile to store results of benchmarks in a loop so we optimized the "overhead" part of benchmarks, there was an thread discussing it somewhere..

Don't use addressing modes for volatile loads for gc types

90cb0d0

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jun 15, 2022

ghost assigned EgorBo Jun 15, 2022

EgorBo added 4 commits June 16, 2022 16:27

Merge branch 'main' of github.com:dotnet/runtime into enable-gctypes-…

5444ac2

…ldar

Merge branch 'main' of github.com:dotnet/runtime into enable-gctypes-…

c0b688e

…ldar

print perfscore

08a61db

fix assert

1f76c43

EgorBo closed this Jun 17, 2022

ghost locked as resolved and limited conversation to collaborators Jul 18, 2022

EgorBo reopened this Jul 28, 2023

Merge branch 'main' of github.com:dotnet/runtime into enable-gctypes-…

8600b4e

…ldar # Conflicts: # src/coreclr/scripts/superpmi_diffs.py

EgorBo marked this pull request as ready for review July 29, 2023 09:49

AndyAyersMS approved these changes Jul 29, 2023

View reviewed changes

EgorBo merged commit 95f3bcc into dotnet:main Jul 29, 2023

EgorBo deleted the enable-gctypes-ldar branch July 29, 2023 19:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Don't use addressing modes for volatile loads for gc types #70794

JIT: Don't use addressing modes for volatile loads for gc types #70794

EgorBo commented Jun 15, 2022

ghost commented Jun 15, 2022

EgorBo commented Jun 15, 2022

azure-pipelines bot commented Jun 15, 2022

EgorBo commented Jun 17, 2022

azure-pipelines bot commented Jun 17, 2022

EgorBo commented Jul 29, 2023

EgorBo commented Jul 29, 2023

AndyAyersMS left a comment

EgorBo commented Jul 29, 2023

AndyAyersMS commented Jul 30, 2023

DrewScoggins commented Aug 28, 2023

EgorBo commented Aug 28, 2023

JIT: Don't use addressing modes for volatile loads for gc types #70794

JIT: Don't use addressing modes for volatile loads for gc types #70794

Conversation

EgorBo commented Jun 15, 2022

ghost commented Jun 15, 2022

EgorBo commented Jun 15, 2022

azure-pipelines bot commented Jun 15, 2022

EgorBo commented Jun 17, 2022

azure-pipelines bot commented Jun 17, 2022

EgorBo commented Jul 29, 2023

EgorBo commented Jul 29, 2023

AndyAyersMS left a comment

Choose a reason for hiding this comment

EgorBo commented Jul 29, 2023

AndyAyersMS commented Jul 30, 2023

DrewScoggins commented Aug 28, 2023

EgorBo commented Aug 28, 2023