Need a solution for select / async events #6842

brson · 2013-05-30T22:38:34Z

Pipes supports 'select', which allows one to receive on multiple ports at the same time. The new scheduler doesn't have a solution for this, but it needs to be more general than the old one.

Some types of events we might want to wait on:

Port receives, both one-shot, stream and other receives
I/O reads of all kinds
Timers
Socket accepts

Miscellaneous notes:

The biggest requirement is that we need one abstraction that allows for waiting on both I/O events (single-threaded) and comm events (multithreaded). uv handles and comm types have very different implementations.

select may not be the right abstraction. Maybe there are more expressive ways to do this. We could for instance have an event loop with callbacks. Also look an .NET's async I/O which I know nothing about. Needs more research.

Asynchronous I/O events should hopefully allow for all the various extension methods and decorator types that synchronous I/O allows.

The solution should additionally be workable for tasks that are not coroutines scheduled by an I/O event loop. We would like to have tasks that are backed simply by threads and don't use the I/O event loop and that makes the design of various runtime components more difficult. It's potentially ok not to solve this yet.

Some types of events need to be cancellable as well, e.g. if you wait on both a timer and an accept, if the timer goes off first you may want to cancel the accept.

The text was updated successfully, but these errors were encountered:

brson · 2013-05-30T22:53:18Z

This will be greatly complicated by the fact that I/O types are not bound to their schedulers - so it won't be as simple as registering a uv callback.

mneumann · 2013-05-30T22:58:53Z

Wouldn't it be better if tasks would always stay in the Scheduler they are created in? Or if you migrate them, also somehow migrate the task's libuv context?

thadguidry · 2013-05-31T14:40:33Z

This idea comes out of the Data Architect world that I live in. It's more along the lines of that basic idea that your pitching which I call "receivership".

In Pentaho Data Integration, there is a process step called "Block this step until steps finish". What it does is Block a single step in an execution flow until a selected step or even multiple steps somewhere in the chain or process has completed it's execution and returned a True for it's "execution complete flag" where a launched listener is waiting and watching.

It's a generic wrapper if you will, for additional next steps to perform, but those next steps do not get called, do not get any input, and no variables are passed to them until the the child Blocking step has seen from (there's a listener process launched for each step or steps choosen to watch) all its parent or parents steps somewhere say that it's own execution has finished and lets the listener know. I think the "Block this step until steps finish" just waits for all those listeners it is watching to say it's OK to proceed with it's own set of next steps.

Anyways, just wanted to throw this out there... and maybe this idea should be focused on another issue, feel free to move it there... here's their Javadoc http://javadoc.pentaho.com/kettle/org/pentaho/di/trans/steps/blockuntilstepsfinish/BlockUntilStepsFinish.html

brson · 2013-06-01T00:25:39Z

@mneumann I believe that it is not better for tasks to stay in the scheduler that created them. Here scheduler == thread, and it's easy under such a strategy not to distribute load effectively (fwiw our current scheduler never migrates tasks and it's easy to underutilize cores. This is easy to see in long running test cases that block other test cases while leaving other cores unoccupied).

The libuv context is essentially the libuv event loop, which is deeply integrated with the scheduler - it's what drives the scheduler and allows synchronous I/O to yield to other tasks - so the event loop itself cannot change threads. It may be possible in some circumstances to transplant file handles from one libuv event loop to another, but I don't think libuv currently has such functionality, and it isn't possible in all cases.

We could make a limitation that I/O handles are not sendable, but that is fairly restrictive.

brson · 2013-06-01T00:36:38Z

@thadguidry I responded on the list.

Putting the list link here for posterity: https://mail.mozilla.org/pipermail/rust-dev/2013-May/004305.html

mikedilger · 2013-06-04T08:49:58Z

I haven't used libuv, but I understand it is for platform independence. My comments will be only about linux, as that is the only knowledge base I have worth contributing from.

In newer kernels, the different types of events have all been unified to use file descriptors and thus be "select"-able with epoll_wait(). Timers can be detected with timerfd, signals with signalfd, and network and file activity with the file descriptors you are already familiar with. Thus all these events can be handled asynchronously non-blocking in edge-triggered mode. epoll() is thread-aware in a way that multiple threads can wait on the same epoll, and the event will go to whichever one the kernel is most enamoured with ;-)

I have some very basic code (560 lines) that touches on all this if you want to see an example.

mikedilger · 2013-06-04T09:11:05Z

https://github.com/mikedilger/epoll_demo

mneumann · 2013-06-04T09:31:43Z

I think the "problem" is that libuv/libevent can only select on kernel events while we also want to wait for port receives which are handled in pure user level. To solve it, you have to use a regular pipe (known to the kernel) which can awake the event loop from waiting. This is how libuv implements async callbacks. Async in this context means that you invoke a callback in the same thread as the event loop, but triggered from another thread.

I don't know if you will use a separate I/O task, or just one thread that performs all I/O and also runs the tasks. In the latter case you'd only have to do an async cross-thread call when:

The port receive is from a task on another scheduler/native thread and
this thread is currently blocked in the event loop waiting for I/O to happen.

You can special case 1. and directly schedule to the receiving task. And 2. you can handle with an async callback. Actually you can use an async callback whenever the receiving tasks lives in another scheduler. But I think you have to take care when you run the async callback (and for example schedule another thread from there) and then another async callback is triggered, you might loose it or not.

mneumann · 2013-06-04T09:41:31Z

@brson: I think thread migration is a good general solution as long as you are (optionally) able to pin a task to it's originating scheduler to avoid trashing. And maybe it's good when threads are randomly distributed at creation time to limit the work stealing.

mneumann · 2013-06-04T09:48:49Z

Timers, socket accepts and I/O of all kind would be handled by libuv. The callbacks would just set a bit in the corresponding Rust I/O object, and determine if the corresponding task would become runnable as a result. After all callbacks have been run by libev, the scheduler would then "simply" run all runnable threads. Port receives would be triggered (as I described above) via async callbacks. An async callback is "global" (only one per event loop), i.e. you need to keep information about the ports which contain messages in another data structure, which then the async callback can inspect and perform the same action as for other callbacks (determine if corresponding task would become runnable).

brson · 2013-06-26T21:23:26Z

After further consideration with @bblum on IRC I think it may not make sense to conflate waiting on pipes with waiting on I/O - especially if it's going to cause complexity or extra overhead for either. (Rust) pipes and sockets are pretty easy to adapt to each other. In particular, for the important use case of waiting on I/O while also waiting on some other signal, I think we can use an I/O-based message abstraction, i.e. something that behaves like pipes but is backed by unix domain sockets.

pnkfelix · 2013-07-05T06:39:11Z

@brson Would signal handling support be considered part of the more general problem of asynchronous events described here? (I am wondering whether bugs that could be resolved via a program-specified signal handler should be considered blocked by this issue.)

graydon · 2013-07-05T08:11:25Z

Signals are i/o in uv. So yes.

pnkfelix · 2014-01-16T18:34:59Z

a solution for this seems like it ties into our story for event handling, such as in the concrete example in #2873.

Nominating.

pnkfelix · 2014-01-22T18:11:07Z

Accepted for P-backcompat-libs.

derekchiang · 2014-02-24T21:57:57Z

I'm not sure if anyone is still working on resolving this, but I just want to point out that currently there doesn't seem to be a way to 1. have a TcpSocket block on read, and 2. close the socket after a timeout. Ideally, you would be able to select from network events and timer pipe events, but this can't be done atm. You would think you could run the blocking read in a separate task and have it send data across a pipe, but then while you can indeed select from pipes, you can't terminate the task with the blocking read, because you can't close a task from outside. Anyone has any thoughts how to get around this?

rapha · 2014-03-16T04:21:22Z

Does #12855 provide a workaround here?

ghost · 2014-04-23T04:32:26Z

Apologies if this is a dumb question, but is there any particular reason why the Rust pipe cannot be a bog-standard Unix pipe with dummy writes? It seems to me that the only mechanism that could possibly satisfy all these requirements is the file descriptor.

thestinger · 2014-04-23T05:14:02Z

Unix pipes are very slow relative to efficient concurrent queues. Every write or read requires a system call. If you mean for signalling events, there are better ways to do that in combination with a select-like API (like epoll) on a per-platform basis like eventfd, but it's still significantly slower than a condition variable.

brson · 2014-06-09T22:46:54Z

Nominating for removal from milestone. Our strategy of keeping I/O and comm select separate, and satisfying use cases as they arise is I think passable for 1.0, and we can take another look at async I/O later.

pnkfelix · 2014-06-12T21:17:15Z

Removing from milestone and switching to P-high, to reflect demontion in importance as outlined by @brson above (i.e. it will not be end of the world if we have to solve this post 1.0).

Ericson2314 · 2014-10-02T03:24:38Z

With #17325 could simple epoll/kqueue/select/poll wrappers be added to (the new) libnative without much difficulty? That seems like a harmless quick fix, leaving a portable interface that ideally works with rust channels too as a problem for later.

dtantsur · 2014-10-21T13:08:34Z

I would also vote for reevaluating this. Providing poll-like feature is essential for network programming, e.g. because tasks 1. presumably do not scale that well, 2. do not cover some cases like timeouts

mhart · 2014-10-28T19:08:33Z

Aw, this bummed me out 😿

Definitely feel like async I/O should be a core concern for a modern systems programming language. ie, important enough to have a standard API in place for 1.0

Looking at https://github.com/carllerche/mio for now (thanks @steveklabnik for the heads up), but would've loved for this to be in core

mikedilger · 2014-10-29T01:26:33Z

I think 1.0 just means no more code-breaking language-level changes, not necessarily "ready for mainstream usage" or "feature complete." Perhaps they could have chosen some version number less than 1.0 for such a milestone, but what number would you pick? 0.99? 0.100?

I agree on the high importance of async I/O to a systems level language.

mhart · 2014-10-29T02:07:36Z

I guess I just feel that async code tends to go beyond just the APIs, but actually affects programming idioms themselves – and that you'll avoid a number of inevitable religious wars over async I/O coding paradigms by bedding down a really well-sanctioned and performant standard before "releasing it to the wild" – which essentially is how everyone will see 1.0, regardless of how many caveats are added that it's "only" the language and not the standard library or modules or what have you.

Also, the comparison to other languages will almost certainly follow when 1.0 is released, and if Rust can't hold its weight in developing modern network applications, then I fear it will be prematurely dismissed by many potential advocates.

I'm all for a small well-contained core set of modules a la Node.js, but to add async I/O as an afterthought in a new language just seems a bit... backwards...?

carllerche · 2014-10-29T17:41:14Z

I disagree about including something like mio in rust std for a few reasons.

Leaving the library separate allows it to evolve on its own timeline. Rust, the language, is going to change on a timeline different from that of libraries (like mio). By keeping it separate, users can mix and match the various versions that suit their needs (rust 2.0 & mio 1.0, rust 1.0 & mio 2.0, etc..). Using Cargo makes including & managing dependencies very low overhead.
There is no "one size fits all" concurrency paradigm (async is just one concurrency paradigm). Rust supports extremely low level use cases (down to the kernel) and is also ergonomic enough to be used for higher level use cases as well. Paradigms don't match up at these various levels. I would not implement something like haproxy the same way that I would handle processing large data sets.

Anyway, I say let libs compete, and once patterns emerge (post 1.0) then consider standardizing if there is a strong win to do so.

mhart · 2014-10-29T18:23:49Z

You can apply that argument (make it external) much more so to other modules that have made it into the core set of crates though. graphviz!? getopts? Even in std you've got LruCache, PriorityQueue, half of the std::io module could be made external – Readers/Writers/Streams already represent concurrency paradigms that have been chosen to be included.

I don't think epoll/kqueue/select/poll are too "external" – they make up the backbone of the fastest web servers and databases in use today. Java's had non-blocking I/O for 12 years now!

I'm just gonna be disappointed if/when Rust 1.0 comes out and you won't be able to write a server out of the box that can beat the pants off Java or Go. IMO the announcement of 1.0 should make it compelling for people to write systems software in, not just desktop browsers...

dtantsur · 2014-10-29T19:41:22Z

While argument about not moving too much to the stdlib is completely valid, I would also argue that select/epoll and non-blocking sockets should be part of stdlib asap. It would be fair to say: we don't include socket library at all. But as stdlib does contain e.g. UdpSocket, I don't see the compelling reason not to include other parts of socket library.

One more thing that async IO is something that you usually expect to use in a system-level language. It's a good selling point (or better it's absence is not really good selling point for a system-level language).

SirVer · 2015-01-20T08:02:50Z

+1. I was playing around trying to implement a simple chat server and was confused that I could not select! over socket types or streams. Having channels is only half good if you cannot have poll on your sockets.

lilith · 2015-03-07T00:13:49Z

One of the earliest comments asked about .NET's async system. It's a callback system, like most, but there is a lot of syntactic sugar, and the 3 different systems that are exposed have different levels of overhead.

The most popular is C# async/await, which generates a closure and state machine to provide a continuation callback. Stack and thread context restoration is expensive, but you can opt out per-task.

F# has a simpler implementation, due to its functional nature and the ease of continuation passing.

The original used begin_[task] end_[task], i.e, https://msdn.microsoft.com/en-us/library/system.io.stream.beginread(v=vs.80).aspx and https://msdn.microsoft.com/en-us/library/system.io.stream.endread(v=vs.80).aspx

I am very interested in interop between Rust and other languages, particularly F#, C#, Ruby, and Lua, and making streams interoperable between them.

Are there any traits in the standard library which will describe a (callback+token/state object)-driven stream interface?

steveklabnik · 2015-04-21T14:42:09Z

Moving to the RFCs repo, like other wishlist items: rust-lang/rfcs#1081

…logiq Add `let_underscore_untyped` Fixes rust-lang#6842 This adds a new pedantic `let_underscore_untyped` lint which checks for `let _ = <expr>`, and suggests to either provide a type annotation, or to remove the `let` keyword. That way the author is forced to specify the type they intended to ignore, and thus get forced to re-visit the decision should the type of `<expr>` change. Alternatively, they can drop the `let` keyword to truly just ignore the value no matter what. r? `@llogiq` changelog: New lint: [let_underscore_untyped]

Downgrade let_underscore_untyped to restriction From reading rust-lang#6842 I am not convinced of the cost/benefit of this lint even as a pedantic lint. It sounds like the primary motivation was to catch cases of `fn() -> Result` being changed to `async fn() -> Result`. If the original Result was ignored by a `let _`, then the compiler wouldn't guide you to add `.await`. **However, this situation is caught in a more specific way by [let_underscore_future](https://rust-lang.github.io/rust-clippy/master/index.html#let_underscore_future) which was introduced _after_ the original suggestion (rust-lang#9760).** In rust-lang#10410 it was mentioned twice that a <kbd>restriction</kbd> lint might be more appropriate for let_underscore_untyped. changelog: Moved [`let_underscore_untyped`] to restriction

brson mentioned this issue May 30, 2013

Scheduler rewrite with I/O event loop #4419

Closed

30 tasks

ghost assigned yichoi Jun 4, 2013

pnkfelix mentioned this issue Jun 18, 2013

Fix the organization of the Selectable trait, etc. #5160

Closed

pnkfelix mentioned this issue Jul 6, 2013

Make it easier to locate deadlocked tests #2873

Closed

brson mentioned this issue Jul 19, 2013

Redesign the I/O library and traits #4248

Closed

28 tasks

bblum mentioned this issue Aug 6, 2013

Make the select! macro work with newrt pipes. #8342

Closed

olsonjeffery mentioned this issue Sep 14, 2013

Do not poll on the port in recv_timeout #9195

Closed

brson added the I-nominated label Jun 9, 2014

pnkfelix removed this from the 1.0 milestone Jun 12, 2014

pnkfelix added P-high and removed P-backcompat-libs labels Jun 12, 2014

alexcrichton mentioned this issue Jun 17, 2014

Is there some API like "select/poll/epoll_wait"? #14961

Closed

steveklabnik mentioned this issue Apr 21, 2015

Async IO rust-lang/rfcs#1081

Closed

steveklabnik closed this as completed Apr 21, 2015

MaxFangX mentioned this issue May 20, 2022

lightning-net crate lightningdevkit/rust-lightning#1469

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need a solution for select / async events #6842

Need a solution for select / async events #6842

brson commented May 30, 2013

brson commented May 30, 2013

mneumann commented May 30, 2013

thadguidry commented May 31, 2013

brson commented Jun 1, 2013

brson commented Jun 1, 2013

mikedilger commented Jun 4, 2013

mikedilger commented Jun 4, 2013

mneumann commented Jun 4, 2013

mneumann commented Jun 4, 2013

mneumann commented Jun 4, 2013

brson commented Jun 26, 2013

pnkfelix commented Jul 5, 2013

graydon commented Jul 5, 2013

pnkfelix commented Jan 16, 2014

pnkfelix commented Jan 22, 2014

derekchiang commented Feb 24, 2014

rapha commented Mar 16, 2014

ghost commented Apr 23, 2014

thestinger commented Apr 23, 2014

brson commented Jun 9, 2014

pnkfelix commented Jun 12, 2014

Ericson2314 commented Oct 2, 2014

dtantsur commented Oct 21, 2014

mhart commented Oct 28, 2014

mikedilger commented Oct 29, 2014

mhart commented Oct 29, 2014

carllerche commented Oct 29, 2014

mhart commented Oct 29, 2014

dtantsur commented Oct 29, 2014

SirVer commented Jan 20, 2015

lilith commented Mar 7, 2015

steveklabnik commented Apr 21, 2015

Need a solution for select / async events #6842

Need a solution for select / async events #6842

Comments

brson commented May 30, 2013

brson commented May 30, 2013

mneumann commented May 30, 2013

thadguidry commented May 31, 2013

brson commented Jun 1, 2013

brson commented Jun 1, 2013

mikedilger commented Jun 4, 2013

mikedilger commented Jun 4, 2013

mneumann commented Jun 4, 2013

mneumann commented Jun 4, 2013

mneumann commented Jun 4, 2013

brson commented Jun 26, 2013

pnkfelix commented Jul 5, 2013

graydon commented Jul 5, 2013

pnkfelix commented Jan 16, 2014

pnkfelix commented Jan 22, 2014

derekchiang commented Feb 24, 2014

rapha commented Mar 16, 2014

ghost commented Apr 23, 2014

thestinger commented Apr 23, 2014

brson commented Jun 9, 2014

pnkfelix commented Jun 12, 2014

Ericson2314 commented Oct 2, 2014

dtantsur commented Oct 21, 2014

mhart commented Oct 28, 2014

mikedilger commented Oct 29, 2014

mhart commented Oct 29, 2014

carllerche commented Oct 29, 2014

mhart commented Oct 29, 2014

dtantsur commented Oct 29, 2014

SirVer commented Jan 20, 2015

lilith commented Mar 7, 2015

steveklabnik commented Apr 21, 2015