implement "dynamic drop" semantics using flags on the stack rather than zeroing #5016

thestinger · 2013-02-18T23:07:01Z

(Tracking issue for RFC 320.)

Original Description

The current liveness check has a concept of "alive" and "maybe dead", so I'll just describe this in terms of a hypothetical "move optimization pass" (although liveness can probably be extended with "surely dead").

Hypothetical move optimization pass:

In each scope where they exist, all variables are given an associated boolean representing whether they were certainly moved from. Moving from a variable in a scope marks it as such for that scope. This can be bubbled up if and only if the variable is also moved from in every other branch.

If the compiler can bubble this up to the scope where the variable is declared, the drop glue can be omitted and all moves from that variable do not need to zero it.

This can probably be extended to fields too, but I'm not sure on what the semantics would be.

@nikomatsakis: does this look like something that would be reasonable to implement (probably as part of liveness)? It doesn't actually need to bubble it up to be useful - even if it only worked in a local scope, it would allow the destructor of the save variable in the TreeMap split and skew functions to be omitted (since it's moved from in the same scope where it's declared) and that would be a big performance win.

The text was updated successfully, but these errors were encountered:

nikomatsakis · 2013-02-19T18:07:49Z

This seems doable and straightforward. It's basically adding another bit to track per variable ("was moved") which uses intersection on join when propagating.

However, there is another approach that @pcwalton and I have toyed with from time to time, which is saying that when you move things, the destructor for the moved thing runs "early"---basically at the point where control flow on the non-moved path (if any) rejoins the control flow from the moved path. The idea here was to remove the need to zero 100% of the time. However, I am not sure if that idea (early destructing) is such a good idea, because it seems hard to specify and predict, whereas the current semantics (destructors run at the same time unless moved) is easier.

So if we keep current semantics, then I think something like this seems like a good optimization. In principle we could do better and identify nodes in the CFG where, upon entering the node, we know the value will be moved (this is what liveness will be computing, actually). Then we could avoid zeroing as long as we dominate the node. That might be trickier though given the way failure and unrolling works, so maybe it's better to only optimize the case where the variable declaration is such a node.

graydon · 2013-03-04T21:02:49Z

http://llvm.org/docs/LangRef.html#llvm-lifetime-end-intrinsic may also be interesting

catamorphism · 2013-04-29T19:42:02Z

Nominating for milestone 1, well-defined.

graydon · 2013-05-02T17:13:59Z

Accepted for well defined. This needs to be documented.

glaebhoerl · 2013-05-27T14:57:19Z

I don't think the alternative would necessarily be harder to specify and predict. It's basically dynamic vs. static transfer of ownership. With the current rule, a variable's destructor is run at the end of the scope it was declared in, unless ownership of it was dynamically transferred to an inner scope. With the alternative rule, referring to the variable in an inner scope statically transfers ownership of it to that inner scope. The compiler already uses the latter rule to determine when a variable is legal to access, so the programmer has to think about it (if not, the compiler will make her). The alternative would make destructors follow the same rules. Currently you can have points in the code where the compiler won't let you access a variable but the variable's destructor may not have been run, which might be unintuitive itself.

nikomatsakis · 2013-05-28T13:55:17Z

Current plan (based on meeting discussion) is to have the compiler issue an error if there are ever two control flows that meet where one does a move and the other does not, so that this issue becomes a moot point. I am in the process of preparing a pull request taking some steps in this direction.

…terns into borrow checker and generalize what moves are allowed. Fixes a nasty bug or two in the pattern move checking code. Unifies dataflow code used for initialization and other things. First step towards once fns. Everybody wins. Fixes rust-lang#4384. Fixes rust-lang#4715. cc once fns (rust-lang#2202), optimizing local moves (rust-lang#5016).

nikomatsakis · 2013-11-15T10:22:27Z

I'm somewhat working on this, btw. Also, it will affect the semantics of what moves are legal, particularly around vectors, since in those cases the compiler will not be able to determine statically which indices have been moved and which have not. I imagine we'll just disallow moves from like let x = vec[y] and instead offer a method like let x = vec.move(y) where move is a fn(self) method.

nikomatsakis · 2013-11-15T10:25:14Z

Thinking a bit more, there is no need for fn(self) method. Something like pop etc will suffice, particularly combined with swap:

impl<T> ~[T] {
    fn remove(&mut self, index: uint) -> T {
        assert!(index < self.len());
        if (index != self.len() - 1) {
            self.swap(index, self.len() - 1);
        }
        self.pop();
    }
}

huonw · 2013-11-15T10:31:12Z

FWIW, that method currently exists: .swap_remove.

nikomatsakis · 2013-11-15T18:21:47Z

@huonw that's right, I knew I'd seen this somewhere :)

pnkfelix · 2013-11-28T18:19:53Z

cc me

pnkfelix · 2014-02-03T11:32:24Z

This interacts with the meaning/utility of the std::ptr::read_and_zero_ptr function, right?

I've been playing with that function recently in attempts to prototype some hypothetical finalization APIs. So I am curious to know what would replace it.

If a struct currently carries a ~T, I believe I can use read_and_zero_ptr to zero out that reference and keep that destructor for ~T (and thus for T itself) from running.

What is the replacement for that?

(Or should the struct field use Option<~T> instead of ~T and thus I would be able to swap in None to prevent a destructor for ~T from running when the struct is destroyed?)

thestinger · 2014-02-03T18:38:03Z

@pnkfelix: I haven't had any issues with avoiding a reliance on the drop flag for various smart pointers and low-level data structures.

You can store *mut T instead of ~T as the Rc<T> and Weak<T> implementations already do. They'll need to be fully migrated from ~T when they gain allocator support anyway. They can either do the memory allocation directly or use a Uniq<T, A = Heap> type.

Here's an example of a ring buffer leaving an uninitialized gap in the underlying vector:

https://github.com/thestinger/rust-core/blob/master/core/deque.rs

The trick is just preventing the underlying destructor from running. At the moment it leaves the vector length at zero, but if it did reuse the field for the container length then it would just call set_len at the end. It could transmute the vector to another type, but it's easy enough to prevent it from destroying the contained values without that.

thestinger · 2014-02-03T18:41:54Z

I do know one place where the drop flag results in a slight increase in performance. A priority queue needs to shuffle along elements in one direction. It's doing comparisons along the way, and that can unwind. So instead of just doing log2(n) shallow copies, it ends up needing to do 3 * log(n) shallow copies in order to perform swaps. It's a significant performance hit. The drop flag allows performing log2(n) shallow copies and log2(n) zeroes, which is still worse than when unwinding isn't present but certainly better than swaps.

nikomatsakis · 2014-02-03T19:15:20Z

@pnkfelix more or less what @thestinger said. I personally would be inclined to model that situation with an Option<~T> and swap it with None -- this is, after all, what the low-level code was doing, just exposed.

pnkfelix · 2014-02-03T20:50:36Z

@nikomatsakis I guess my concern was that I did not want the use of Option<~T> to leak out into client code ... the use case I am dealing with is when an object has been enqueued for finalization and I want to, for lack of a better term for now, "mess with its innards."

However, I just realized that I can probably safely transmute e.g. a ~(A, ~B)) to a ~(A, Option<~B>), and then I get the ability to do the mucking about internally, and the client who passed me the ~(A, ~B) should be none-the-wiser.

nikomatsakis · 2014-02-03T23:10:36Z

Certainly we could still have the drop glue check for null even if WE no longer zero.

-------- Original message --------
From: Felix S Klock II notifications@github.com
Date: 02/03/2014 15:50 (GMT-05:00)
To: mozilla/rust rust@noreply.github.com
Cc: Niko Matsakis niko@alum.mit.edu
Subject: Re: [rust] remove the necessity of "zeroing out" from codegen (#5016)

@nikomatsakis I guess my concern was that I did not want the use of Option<~T> to leak out into client code ... the use case I am dealing with is when an object has been enqueued for finalization and I want to, for lack of a better term for now, "mess with its innards."

However, I just realized that I can probably safely transmute e.g. a ~(A, ~B)) to a ~(A, Option<~B>), and then I get the ability to do the mucking about internally, and the client who passed me the ~(A, ~B) should be none-the-wiser.

—
Reply to this email directly or view it on GitHub.

flaper87 · 2014-02-05T23:12:19Z

cc @flaper87

bluss · 2015-06-06T15:11:33Z

Linking relevant PR: Replace zeroing-on-drop with filling-on-drop #23535

Add dropflag hints (stack-local booleans) for unfragmented paths in trans. Part of #5016. Added code to maintain these hints at runtime, and to conditionalize drop-filling and calls to destructors. In this early stage of my incremental implementation strategy, we are using hints, so we are always free to leave out a flag for a path -- then we just pass `None` as the dropflag hint in the corresponding schedule cleanup call. But, once a path has a hint, we must at least maintain it: i.e. if the hint exists, we must ensure it is never set to "moved" if the data in question might actually have been initialized. It remains sound to conservatively set the hint to "initialized" as long as the true drop-flag embedded in the value itself is up-to-date. I hope the commit series has been broken up to be readable; most of the commits in the series should build (though I did not always check this). ---- Oh, I think this technically qualifies as a: [breaking-change] because it removes drop-filling in some cases where one could previously observe it. That should only affect `unsafe` code; no safe code should be able to inspect whether the drop-fill was present or not. For an example of code that needed to change to account for this, see commit a81c24ae0216ab47df59acd724f8a33124fb6d97 (a unit test of the `move_val_init` intrinsic). I have not encountered actual code that needed to be updated to account for the change, since this should only be skipping the drop-filling on *moved* values, not on dropped one, and so even types that use `unsafe_no_drop_flag` should be unchanged by this particular PR. (Their time will come later.)

ticki · 2015-11-19T12:37:19Z

What's the state of this?

huonw · 2015-11-19T12:41:21Z

I believe it is waiting for the MIR work in the compiler to shape up some more.

jarcode-foss · 2016-03-18T06:24:15Z

Programmers should be able to mutate structure memory without possibly interfering with drop flags -- for a 'systems programming language', the current implementation does not make sense, nor is it very efficient.

Along with a few other people, I'm eager to use Rust -- but problems like this make the language less appealing.

aldanor · 2016-04-06T15:52:27Z

This issue is over three years old now and is still indeed is a pretty major pain when wrapping C libraries, effectively disallowing zero-copy conversions in a lot of cases and forcing one to jump through multiple ugly hoops.

In the cases where #[unsafe_no_drop_flag] could help today, is it wise to rely on the assumption that there will come a time where droppable objects could be #[repr(C)]?

pnkfelix · 2016-04-06T19:38:12Z

As noted by @huonw above, at this point the compiler team considers this work to be dependent on MIR. The compiler team has made some baby steps forward on dynamic drop flags, but it is not the highest priority item right now. (Examples of things of higher priority: MIR itself; incremental compilation.)

The real point is that I don't want the age of this issue to be interpreted as meaning that the Rust leam isn't ever going to do this. We have every intention of doing it. I feel silly even having to write that down, but I worry that if no one says anything, then people are going to get an incorrect impression from the comment history here.

aldanor · 2016-04-07T00:39:18Z

@pnkfelix Thanks for clarifying! Didn't mean to sound any negative, was mostly just wondering if it's wise for library developers to assume this will land in some (finite) time and make their choices accordingly :)

nikomatsakis · 2016-04-12T19:42:32Z

I expect it to land this year.

On Wed, Apr 06, 2016 at 05:39:55PM -0700, Ivan Smirnov wrote:

@pnkfelix Thanks for clarifying! Didn't mean to sound any negative, mostly just wondering if it's wise for library developers to assume this will land in some (finite) time and make their choices accordingly :)

You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub:
#5016 (comment)

[MIR] non-zeroing drop This enables non-zeroing drop through stack flags for MIR. Fixes #30380. Fixes #5016.

ticki · 2016-06-05T11:42:20Z

🎉

sorear · 2016-06-18T05:39:52Z

I have some code that spends a measurable amount of time checking for drop flags (there's an inner loop which drops an Option<Box<...>> which is virtually always None), and so I'm kind of wondering if we have a ticket for the other side of this, where we delete POST_DROP_U8 and stop generating code in drop methods to check for it. Do we have a tracking ticket for that?

Incidentally, this ticket is currently the tracking issue for the POST_DROP_U8 constant; maybe that should be repointed now that this ticket is closed?

nikomatsakis mentioned this issue Apr 30, 2013

Meta: Borrow checker rules rewrite #5074

Closed

thestinger mentioned this issue May 26, 2013

use llvm.lifetime.end to mark the end of unique pointer lifetimes #6755

Closed

bblum mentioned this issue Jun 22, 2013

Zero out only the first pointer when canceling cleanups, and don't zero out PODs #4400

Closed

Aatch mentioned this issue Jun 28, 2013

Improve LLVM IR Code generation #6819

Closed

alexcrichton mentioned this issue Aug 13, 2013

Moving out of locals still calls drop glue #8484

Closed

ghost assigned nikomatsakis Nov 15, 2013

alexcrichton mentioned this issue Jan 10, 2014

Avoid placing immutable lvalues in allocas and zeroing them to cancel cleanup. #11445

Closed

nikomatsakis mentioned this issue Feb 5, 2014

Attempting to move a non-copyable static segfaults #10577

Closed

nikomatsakis mentioned this issue Apr 1, 2014

Allow "moves" out of immutable static (turn into memcpy) #13245

Closed

SSheldon mentioned this issue May 16, 2015

Convert a slice of Ids to references at zero-cost SSheldon/rust-objc-id#1

Open

pnkfelix mentioned this issue Jun 10, 2015

Nonzeroing Move Hints (take3 branch) #26173

Merged

alexcrichton added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Aug 11, 2015

Ms2ger mentioned this issue Aug 16, 2015

Tracking: Unstable Rust feature gates used by Servo servo/servo#5286

Closed

99 tasks

eddyb mentioned this issue Jan 22, 2016

Filling drop blocks LLVM from optimizing away allocations. #31133

Closed

alexcrichton added T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. B-unstable Blocker: Implemented in the nightly compiler and unstable. and removed C-enhancement Category: An issue proposing an enhancement or a PR with one. labels Mar 8, 2016

durka mentioned this issue May 14, 2016

[MIR] non-zeroing drop #33622

Merged

bors added a commit that referenced this issue May 29, 2016

Auto merge of #33622 - arielb1:elaborate-drops, r=<try>

3e35b32

[MIR] non-zeroing drop This enables non-zeroing drop through stack flags for MIR. Fixes #30380. Fixes #5016.

bors added a commit that referenced this issue May 30, 2016

Auto merge of #33622 - arielb1:elaborate-drops, r=<try>

78c5218

[MIR] non-zeroing drop This enables non-zeroing drop through stack flags for MIR. Fixes #30380. Fixes #5016.

bors added a commit that referenced this issue Jun 5, 2016

Auto merge of #33622 - arielb1:elaborate-drops, r=nikomatsakis

f97c411

[MIR] non-zeroing drop This enables non-zeroing drop through stack flags for MIR. Fixes #30380. Fixes #5016.

bors closed this as completed in #33622 Jun 5, 2016

pnkfelix mentioned this issue Jun 21, 2016

post-MIR task: remove artifacts of embedded drop flags #34398

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement "dynamic drop" semantics using flags on the stack rather than zeroing #5016

implement "dynamic drop" semantics using flags on the stack rather than zeroing #5016

thestinger commented Feb 18, 2013

nikomatsakis commented Feb 19, 2013

graydon commented Mar 4, 2013

catamorphism commented Apr 29, 2013

graydon commented May 2, 2013

glaebhoerl commented May 27, 2013

nikomatsakis commented May 28, 2013

nikomatsakis commented Nov 15, 2013

nikomatsakis commented Nov 15, 2013

huonw commented Nov 15, 2013

nikomatsakis commented Nov 15, 2013

pnkfelix commented Nov 28, 2013

pnkfelix commented Feb 3, 2014

thestinger commented Feb 3, 2014

thestinger commented Feb 3, 2014

nikomatsakis commented Feb 3, 2014

pnkfelix commented Feb 3, 2014

nikomatsakis commented Feb 3, 2014

flaper87 commented Feb 5, 2014

bluss commented Jun 6, 2015

ticki commented Nov 19, 2015

huonw commented Nov 19, 2015

jarcode-foss commented Mar 18, 2016

aldanor commented Apr 6, 2016 •

edited

Loading

pnkfelix commented Apr 6, 2016

aldanor commented Apr 7, 2016

nikomatsakis commented Apr 12, 2016

ticki commented Jun 5, 2016

sorear commented Jun 18, 2016

implement "dynamic drop" semantics using flags on the stack rather than zeroing #5016

implement "dynamic drop" semantics using flags on the stack rather than zeroing #5016

Comments

thestinger commented Feb 18, 2013

Original Description

Hypothetical move optimization pass:

nikomatsakis commented Feb 19, 2013

graydon commented Mar 4, 2013

catamorphism commented Apr 29, 2013

graydon commented May 2, 2013

glaebhoerl commented May 27, 2013

nikomatsakis commented May 28, 2013

nikomatsakis commented Nov 15, 2013

nikomatsakis commented Nov 15, 2013

huonw commented Nov 15, 2013

nikomatsakis commented Nov 15, 2013

pnkfelix commented Nov 28, 2013

pnkfelix commented Feb 3, 2014

thestinger commented Feb 3, 2014

thestinger commented Feb 3, 2014

nikomatsakis commented Feb 3, 2014

pnkfelix commented Feb 3, 2014

nikomatsakis commented Feb 3, 2014

flaper87 commented Feb 5, 2014

bluss commented Jun 6, 2015

ticki commented Nov 19, 2015

huonw commented Nov 19, 2015

jarcode-foss commented Mar 18, 2016

aldanor commented Apr 6, 2016 • edited Loading

pnkfelix commented Apr 6, 2016

aldanor commented Apr 7, 2016

nikomatsakis commented Apr 12, 2016

ticki commented Jun 5, 2016

sorear commented Jun 18, 2016

aldanor commented Apr 6, 2016 •

edited

Loading