Annotate (most) Julia reference accesses as atomic unordered #36507

vtjnash · 2020-07-01T22:45:20Z

Per discussion in #35535 (though I made a new PR as I wanted to preserve the current state of that branch for comparison), this works towards guaranteeing that references stored in memory are valid and cannot corrupt the GC. It's free on essentially all processors, and just requests the compiler to respect our expectations of that.

Keno · 2020-07-01T22:55:29Z

What's the thing that relaxed guarantees? Basically just that there are no torn writes, so it doesn't which value you see, but it guarantees that any value you see was written at some point?

vtjnash · 2020-07-01T23:25:20Z

Correct: this mostly just ensures it doesn't turn into a memcpy. There's also a monotonicity promise with relaxed (once you observe a new value, it can't un-happen) that's absent from llvm::Unordered (and present in Java, but absent from C). That can also make hoisting difficult (it's a stronger conditions on aliasing)—but our UndefVar check depends on that being prohibited already.

Keno · 2020-07-01T23:56:21Z

Alright, makes sense to me. Presumably x86 memcpy could still be rep stosq, but I guess LLVM will do that if we use the right intrinsic (or maybe the loop idiom recognition pass will do the right thing already).

Keno · 2020-07-01T23:58:38Z

src/datatype.c

@@ -1057,11 +1057,12 @@ JL_DLLEXPORT jl_value_t *jl_get_nth_field_checked(jl_value_t *v, size_t i)

 void set_nth_field(jl_datatype_t *st, void *v, size_t i, jl_value_t *rhs) JL_NOTSAFEPOINT
 {
+    if (rhs == NULL) // TODO: this should be invalid, but it happens frequently in ircode.c


What's the purpose of this behavior change in the context of this change (don't really have an independent opinion on the change, but seems odd here).

We're going to potentially attempt to read the type tag (for a write barrier or to figure out how to copy the data). I suppose we should actually store the NULL, but the use case (ircode) doesn't care. It's mostly here because I had added an assert here in April, and it started failing when I tried to rebase this.

In the ircode usage is the previous field always NULL? If so maybe assert that here?

Yeah, it's a new object (GC hasn't run yet since allocation), otherwise we'd sometimes segfault on the write barrier a couple lines later.

Alright, I'd say let's add that assert then, since that way there's at least no silent behavior change if anybody else thought they'd be clever by using this to unset a field.

Might as well. We should probably also look into avoiding it in that file also, but it's not causing problems yet.

Keno · 2020-07-02T00:01:38Z

src/codegen.cpp

-        return mark_julia_type(ctx, tbaa_decorate(tbaa_binding, ctx.builder.CreateLoad(bp)), true, (jl_value_t*)jl_any_type);
+        LoadInst *v = ctx.builder.CreateAlignedLoad(T_prjlvalue, bp, sizeof(void*));
+        v->setOrdering(AtomicOrdering::Unordered);
+        tbaa_decorate(tbaa_binding, v);


This is technically fine since it mutates in place, but I think traditionally we've used the return value of this call, in case it ever needs to rewrite the instruction or allocate anew one.

I think we just did that for the convenience of chaining calls, since TBAA shouldn't need to replace the instruction.

I agree it shouldn't need to, as currently designed, but it's just the kind of thing that's bound to cause a use after free if we ever do need to change this. Maybe not worth defending against until we decide to rewrite the whole C++ part in Rust ;).

vtjnash · 2020-07-02T00:03:45Z

I'm not sure why it'd ever pick rep stosq over mov %eax, [%eax], but this just kindly asks it not to.

Keno · 2020-07-02T00:04:45Z

rep stosq is faster and I think still legal here (since stosq work operates on 64 bit chunks at a time).

vtjnash · 2020-07-02T00:07:34Z

Ah, I leave it entirely up to LLVM to decide then. There's usually no chunks though, just single loads (followed by UndefVar checks).

This lets us say that anytime you observe a pointer in memory, it's certain to not be from thin-air. It may require fences to be valid to examine the type of the object however. This is free on most hardware, though much weaker than what x86 promises.

And add load/store alignment annotations, because LLVM now prefers that we try to specify those explicitly, even though it's not required. This does not yet include correct load/store behaviors for objects with inlined references (the recent #34126 PR).

Followup to #36507; see the discussion there. Also slightly weakens non-atomic pointer modification, since we generally don't need DRF swap guarantees at all (even monotonic), only the atomic-release property. This would correspond to atomic-unordered failure order in many cases, but the LLVM LangRef says that this case is "uninteresting", and thus declares it is invalid. n.b. this still does not cover embedded references inside inlined structs

vtjnash requested review from yuyichao, JeffBezanson and Keno July 1, 2020 22:45

Keno reviewed Jul 1, 2020

View reviewed changes

Keno reviewed Jul 2, 2020

View reviewed changes

vtjnash added 2 commits July 7, 2020 15:48

vtjnash force-pushed the jn/relaxed branch from 2699591 to 2b510b4 Compare July 7, 2020 19:49

vtjnash changed the title ~~Annotate Julia reference accesses as atomic relaxed~~ Annotate (most) Julia reference accesses as atomic relaxed Jul 7, 2020

vtjnash changed the title ~~Annotate (most) Julia reference accesses as atomic relaxed~~ Annotate (most) Julia reference accesses as atomic unordered Jul 8, 2020

vtjnash merged commit b19a357 into master Jul 8, 2020

vtjnash deleted the jn/relaxed branch July 8, 2020 17:48

vtjnash mentioned this pull request Jul 10, 2020

lock: optimize re-entrant performance #34227

Merged

vtjnash mentioned this pull request May 27, 2022

strengthen reference stores to give release/consume semantics #45484

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Annotate (most) Julia reference accesses as atomic unordered #36507

Annotate (most) Julia reference accesses as atomic unordered #36507

vtjnash commented Jul 1, 2020

Keno commented Jul 1, 2020

vtjnash commented Jul 1, 2020

Keno commented Jul 1, 2020

Keno Jul 1, 2020

vtjnash Jul 2, 2020

Keno Jul 2, 2020

vtjnash Jul 2, 2020

Keno Jul 2, 2020

vtjnash Jul 2, 2020

Keno Jul 2, 2020

vtjnash Jul 2, 2020

Keno Jul 2, 2020

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020

Annotate (most) Julia reference accesses as atomic unordered #36507

Annotate (most) Julia reference accesses as atomic unordered #36507

Conversation

vtjnash commented Jul 1, 2020

Keno commented Jul 1, 2020

vtjnash commented Jul 1, 2020

Keno commented Jul 1, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vtjnash commented Jul 2, 2020

Keno commented Jul 2, 2020

vtjnash commented Jul 2, 2020