Add an AccessPath abstraction and formalize memory access #34126

atrick · 2020-09-30T05:37:15Z

Things that have come up recently but are somewhat blocked on this:

- Moving AccessMarkerElimination down in the pipeline
- SemanticARCOpts correctness and improvements
- AliasAnalysis improvements
- LICM performance regressions
- RLE/DSE improvements

Begin to formalize the model for valid memory access in SIL. Ignoring
ownership, every access is a def-use chain in three parts:

object root -> formal access base -> memory operation address

AccessPath abstracts over this path and standardizes the identity of a
memory access throughout the optimizer. This abstraction is the basis
for a new AccessPathVerification.

With that verification, we now have all the properties we need for the
type of analysis requires for exclusivity enforcement, but now
generalized for any memory analysis. This is suitable for an extremely
lightweight analysis with no side data structures. We currently have a
massive amount of ad-hoc memory analysis throughout SIL, which is
incredibly unmaintainable, bug-prone, and not performance-robust. We
can begin taking advantage of this verifably complete model to solve
that problem.

The properties this gives us are:

Access analysis must be complete over memory operations: every memory
operation needs a recognizable valid access. An access can be
unidentified only to the extent that it is rooted in some non-address
type and we can prove that it is at least *not* part of an access to a
nominal class or global property. Pointer provenance is also required
for future IRGen-level bitfield optimizations.

Access analysis must be complete over address users: for an identified
object root all memory accesses including subobjects must be
discoverable.

Access analysis must be symmetric: use-def and def-use analysis must
be consistent.

AccessPath is merely a wrapper around the existing accessed-storage
utilities and IndexTrieNode. Existing passes already very succesfully
use this approach, but in an ad-hoc way. With a general utility we
can:

- update passes to use this approach to identify memory access,
  reducing the space and time complexity of those algorithms.

- implement an inexpensive on-the-fly, debug mode address lifetime analysis

- implement a lightweight debug mode alias analysis

- ultimately improve the power, efficiency, and maintainability of
  full alias analysis

- make our type-based alias analysis sensistive to the access path

atrick · 2020-09-30T05:37:47Z

Discussion in #33121

atrick · 2020-09-30T05:38:22Z

@swift-ci test

atrick · 2020-09-30T05:38:37Z

@swift-ci test source compatibility

swift-ci · 2020-09-30T07:00:54Z

Build failed
Swift Test Linux Platform
Git Sha - 0fd6be8

swift-ci · 2020-09-30T09:30:22Z

Build failed
Swift Test OS X Platform
Git Sha - 0fd6be8

eeckstein

The documentation is much better now, thanks!
I still have a few comments.
And I still have to review the implementation.

docs/SILProgrammersManual.md

atrick · 2020-09-30T20:49:00Z

@swift-ci test

swift-ci · 2020-09-30T22:14:46Z

Build failed
Swift Test Linux Platform
Git Sha - 810a62047788db057dcd7770fa00c383595ce480

swift-ci · 2020-09-30T23:33:28Z

Build failed
Swift Test OS X Platform
Git Sha - 810a62047788db057dcd7770fa00c383595ce480

atrick · 2020-10-02T01:08:55Z

@swift-ci test

swift-ci · 2020-10-02T02:29:04Z

Build failed
Swift Test Linux Platform
Git Sha - 810a62047788db057dcd7770fa00c383595ce480

eeckstein · 2020-10-05T15:16:21Z

include/swift/SIL/MemAccessUtils.h

+    case AccessedStorage::Unidentified:
+      return getValue(); // Can be invalid for Unidentified storage.
+    case AccessedStorage::Global:
+      return SILValue();


Why is a global not a valid root?

eeckstein · 2020-10-05T15:18:00Z

include/swift/SIL/MemAccessUtils.h

@@ -427,18 +464,17 @@ class AccessedStorage {

  /// If this is a uniquely identified formal access, then it cannot
  /// alias with any other uniquely identified access to different storage.
-  ///
-  /// This determines whether access markers may conflict, so it cannot assume
-  /// that exclusivity is enforced.
  bool isUniquelyIdentified() const {
    switch (getKind()) {
    case Box:
    case Stack:
    case Global:
      return true;


And here: why is a global uniquely identified, while a class is not?

eeckstein · 2020-10-05T17:30:50Z

include/swift/SIL/MemAccessUtils.h

+
+  // Special-case this indirect enum pattern:
+  //   unchecked_take_enum_data_addr -> load -> project_box
+  // (the individual load and project_box are not access projections)


Are you sure you want to model indirect enum cases as address projections?
It might work, but can you prove that this does not introduce inconsistencies with the model?
It would imply to deal with potentially infinitely large projection paths.

This feels like you are adding an abstraction layer for exactly this pattern. Without this we would have the simple model of that everything which is a reference is a root object.

eeckstein · 2020-10-05T19:55:54Z

include/swift/SIL/Projection.h

@@ -136,11 +136,18 @@ static inline bool isCastProjectionKind(ProjectionKind Kind) {
 /// that immediately contains it.
 ///
 /// This lightweight utility maps a SIL address projection to an index.
+///
+/// project_box does not have a projection index. At the SIL level, the box
+/// storage is considered part of the same object as the. The box projection is


typo: "... as the."

eeckstein · 2020-10-05T20:48:51Z