`sched::clone` doesn't pass ownership of closure to new thread. #919

xurtis · 2018-07-04T05:17:54Z

The current implementation of of sched::clone doesn't pass ownership of the the closure called by callback into the newly created thread, only a reference. This means that when the call to clone ends, the callback is actually dropped, releasing all associated memory. This most likely means that the closure that actually gets run in the child thread is a reference to invalid memory when it is run.

Recommended fix is to unbox the CloneCb into a raw pointer and box it back up again in callback. This would mean that the closure would be dropped at the end of the callback in the child thread, rather than at the end of the call to clone.

The text was updated successfully, but these errors were encountered:

xurtis · 2018-07-04T05:18:11Z

I'll have a patch in a few minutes.

xurtis · 2018-07-04T06:58:41Z

Without rust-lang/rust#28796 being stabilised, sched::clone shouldn't support actual closures at all. It makes no sense for clone to accept a closure with references as the child thread then assumes that everything being referenced lives longer than itself and the parent thread assumes that all borrows that the child thread had were dropped when the call to clone succeeded.

In light of this, I think I'll make a patch to change clone to accept something more like what the syscall does now (i.e. fn(T) -> isize with a T passed into clone.

Fixes nix-rust#919 Annotate the correct lifetime (`'static`) for the closure transferred to the child process of the clone. Properly box the closure in the parent then rebox it in the child to ensure that the closure is dropped at the end of the invocation in the child rather than at the end of the call to `clone` in the parent. For some reason, unwrapping the boxed function itself caused all kinds of mayhem, so it instead ends up wrapped in an additional box from which it is then unwrapped.

talchas · 2021-07-28T06:08:28Z

This is still a problem, and makes clone completely broken (rather than the expected horribly unsafe because it's clone). FnBox has been fixed for a while, though it would be a breaking change to improve CloneCb to be Box<FnOnce()> instead of Box<FnMut()>. That said, fixing the other correctness issues there to make it Box<FnMut() -> isize + Send + 'static> instead would also be breaking, so might as well make it FnOnce() at the same time.

On top of that &mut [u8] is not a great choice for the stack type, as you need the borrow for up to 'static, not just for the duration of clone(), and the most obvious ways to allocate [u8; stack size] will break in practice.

asomers · 2021-08-09T21:06:06Z

@talchas would you like to try rewriting clone? I wouldn't worry about preserving backwards-compatibility; it probably can't be done in a way that also fixes clone's problems.

talchas · 2021-08-09T23:30:15Z

I'd go with

    pub fn clone<F: FnOnce() -> c_int + Send + 'static>(
        cb: F,
        stack: *mut [u8],
        flags: CloneFlags,
        signal: Option<c_int>,
    ) -> Result<Pid> {
        extern "C" fn callback<F: FnOnce() -> c_int + Send + 'static>(data: *mut c_void) -> c_int {
            let cb: Box<F> = unsafe { Box::from_raw(data as *mut F) };
            (*cb)()
        }

        let cb = Box::into_raw(Box::new(cb));
        let res = unsafe {
            let combined = flags.bits() | signal.unwrap_or(0);
            let ptr = (stack as *mut u8).add((*stack).len());
            let ptr_aligned = ptr.sub(ptr as usize % 16);
            libc::clone(
                callback::<F>,
                ptr_aligned as *mut c_void,
                combined,
                cb as *mut c_void,
            )
        };
        let res = Errno::result(res).map(Pid::from_raw);

        // In CLONE_VM we need to free the callback on both sides
        if flags.contains(CloneFlags::CLONE_VM) || res.is_err() {
            unsafe { Box::from_raw(cb); }
        }

        res
    }

but there's a lot of fairly reasonable bikeshed options: taking *mut F (and probably having users just leak it or do it wrong) or Box<FnOnce() -> c_int + Send + 'static>, maybe taking Box<[u8]> for stack. Then there's the option of not doing closures at all since they're still kinda sketchy in CLONE_VM (anything calling into the allocator in the new mostly-process is), though then the benefit vs libc is small.

asomers · 2021-08-11T00:36:38Z

Taking raw pointers isn't much better than having the user call FFI functions directly. Could stack be a &'static mut [u8] instead? Granted, it would probably have to be unsafely created. Or if you take Box<[u8]> for the stack, who would be responsible for deallocating it, the parent, the child, or both?
What's the point of taking a closure argument if it must be 'static? Is that any different than taking a plain function pointer? If there is a good reason to use a closure argument, then your Box::into_raw logic looks correct.
I think your logic for CLONE_VM is backwards. The parent should free the callback, unless that flag is set.
Finally, this function should definitely be unsafe.

talchas · 2021-08-11T00:40:42Z

Yeah, I didn't do &'static because it would need unsafe anyways, and didn't do Box<[]> because of the freeing issues, and statically allocating a buffer is maybe useful sometimes.

You still want a closure, even if it's 'static; you can move in a String/ints/anything else owned still.

Yes, I forgot which way around CLONE_VM worked, and making it unsafe is probably good too.

asomers · 2021-08-11T00:56:59Z

Hm. The child process/thread thing will never free stack, and unless CLONE_VM is set then the parent can do whatever it wants with the stack after return. So maybe it doesn't need to be 'static. Would it be better to make stack a &mut [u8], with a note in the documentation that it must outlive the child process if CLONE_VM is set? Or should we remove CLONE_VM from CloneFlags, and create a separate clone_vm function that requires a static array for stack?

talchas · 2021-08-11T01:37:25Z

Having a separate CLONE_VM certainly could be reasonable, as the mechanics around memory safety and lifetimes are completely different. For non-VM ones you can drop the 'static requirement, use &mut [u8], and use &mut cb rather than boxing it (but that one definitely should be unsafe for similar reasons to CommandExt::pre_exec).

Looking more at clone(2), I guess there's also the issue that all the SETTID/CLEARTID/PIDFD stuff causes it to write to the varargs slots which there is no way to fill in, so setting those flags has no use other than being UB. Similarly, SETTLS can't be specified, and I guess would be reason enough to make clone_vm() be unsafe as well, since tls is assumed to be set up properly for safe code.

asomers · 2021-08-11T02:07:19Z

Yeah, good catch. You should remove those flags too, or create separate functions to enable their use.

xurtis changed the title ~~std::sched::clone doesn't pass ownership of closure to new thread.~~ sched::clone doesn't pass ownership of closure to new thread. Jul 4, 2018

xurtis mentioned this issue Jul 4, 2018

Closure transfer fix for sched::clone #920

Open

This was referenced Jul 29, 2021

Using clone(2) to create the init container process segfault when build with release version of the youki binary youki-dev/youki#163

Closed

Fix how closure is transferred to the clone call. youki-dev/youki#173

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`sched::clone` doesn't pass ownership of closure to new thread. #919

`sched::clone` doesn't pass ownership of closure to new thread. #919

xurtis commented Jul 4, 2018 •

edited

Loading

xurtis commented Jul 4, 2018

xurtis commented Jul 4, 2018 •

edited

Loading

talchas commented Jul 28, 2021

asomers commented Aug 9, 2021

talchas commented Aug 9, 2021

asomers commented Aug 11, 2021

talchas commented Aug 11, 2021

asomers commented Aug 11, 2021

talchas commented Aug 11, 2021

asomers commented Aug 11, 2021

sched::clone doesn't pass ownership of closure to new thread. #919

sched::clone doesn't pass ownership of closure to new thread. #919

Comments

xurtis commented Jul 4, 2018 • edited Loading

xurtis commented Jul 4, 2018

xurtis commented Jul 4, 2018 • edited Loading

talchas commented Jul 28, 2021

asomers commented Aug 9, 2021

talchas commented Aug 9, 2021

asomers commented Aug 11, 2021

talchas commented Aug 11, 2021

asomers commented Aug 11, 2021

talchas commented Aug 11, 2021

asomers commented Aug 11, 2021

`sched::clone` doesn't pass ownership of closure to new thread. #919

`sched::clone` doesn't pass ownership of closure to new thread. #919

xurtis commented Jul 4, 2018 •

edited

Loading

xurtis commented Jul 4, 2018 •

edited

Loading