-
Notifications
You must be signed in to change notification settings - Fork 229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create an io.js streams team #97
Comments
Great initiative! I'd be more than happy to help out. |
The WHATWG streams spec is shaping up nicely; it might be good to see if @domenic would be interested in helping / peering over what we're working on here depending on his availability. |
Excited to help (though will not be around much the next week) On Wed, Dec 31, 2014, 8:23 AM Chris Dickinson notifications@github.com
|
I would definitely be happy to be involved 😃 👍 to inviting @domenic, I think that is a good move. |
👍 to @domenic |
Would be happy to help. |
I'll schedule a Hangout On Air in a week or two so we can get started, I know that a lot of people have powered down for holidays just at the moment. We can start with the initial list of people here but that doesn't mean it's fixed in stone, we really just need to get some direction and momentum. |
I am interested if the group is ready to consider some radical departures from current design, more specifically moving from a device-centric approach to a data processing approach. See nodejs/node#188 But otherwise userland is a comfortable place. |
@bjouhier I think there are some interesting ideas and discussions floating around in nodejs/node#89 & nodejs/node#188. The The main issues I see to radical API changes are that a lot of core is built on these constructs, changes to semantics could significantly affect the behaviour of these components and should be approached with caution and not on a whim (how would this implicate movements to merging upstream with joyent/node). Also despite the unstable rating there is also a large developing eco-system of streams building on top of the current implementation. IMO the current Any changes made would ideally need to be gradually, since they could have significant implications (to core and user-land). As discussed nodejs/node#89 core's main use-cases for For this reason I would advocate making However this could be a topic for further discussion. But we should discuss this in another thread. |
You cannot make breaking changes to the streams interface. WHATWG streams or what @bjouhier proposed (which seems close to simple-streams or pull-streams from creationix & dominictarr) are completely untested in production and will have all kinds of bugs. Bugs in the streams interface will ripple through everything, including TCP and HTTP. This would devastate the stability of all existing servers in node. The streams1 to streams2 migration went reasonably smoothly because it had full back compat, but that still introduced a bunch of edge cases. That being said, count me in for a streams team 👍 ! |
@sonewman The principles that I am proposing also apply to text and binary streams. There are a few text parsers and formatters in ez-streams but we also used the approach to implement a binary TCP/IP protocol converter in our ERP product (and believe me, it was a pretty hairy one). Separation of concerns applies here: it is easier to implement parsers and formatters with a simple callback API because you don't have to deal with backpressure explicitly. @Raynos I agree with you about backward compat. Given the complexity of today's streams it seems very risky to reimplement the current interface with a different approach. But streams API could also evolve with a side-by-side approach: keep the current implementation "as is" for backward compat and start fresh on the side. Saying that simple-streams, pull-streams or ez-streams will have all kinds of bugs is an easy way to rule them out. FWIW we are using ez-streams in our ERP product, which has been released commercially last June. I would not claim that it is bugfree but my experience is that bugs have been very easy to fix so far because the API is very simple. |
Any differences between io.js streams and WHATWG streams will have to be reconciled at some point. By that I do not mean that io.js streams will become WHATWG streams or vice versa, but that we need to iron out any irreconcilable differences while WHATWG streams are being standardized so that either io.js streams become a compatibility API between legacy streams and WHATWG stream or transpilers like browserify can solve this in the future. The thing I'm most worried about is a situation similar to what we have with Buffer vs TypedArray. |
@mikeal I agree completely, that seems like a perfect direction for discussion and development. |
This would be a good thing to get consensus on: nodejs/node#288 |
If we are going to be moving towards the whatwg spec, but haven't reached that yet, it sounds unstable. |
@Fishrock123 I linked that issue because people have strong opinions about this, so I thought it best to bring it to wider attention. The way I have interpreted the ideas with regards to WHATWG are that we should ensure we align internal behaviour with that of WHATWG (so there are no surprises) and discuss ideas around compatibility between the two types. There has been very little discussion about this yet, but it's important that there is collaboration and understanding of implications from all angles. |
Part of what to figure out is how close to align with whatwg streams, off the top of my head some major differences
In many ways whatwg streams are more generic than node.js streams and node streams could be built built as subclasses of whatwg streams but there would be compatibility issues with the similar but slightly different read and write methods (mainly the read method, write could return a promise if there was no callback provided). Edit: corrected some errors |
@calvinmetcalf this very useful well documented comparison. It is quite evident there are a lot of differences in API. It would be quite good to decouple the enqueue strategy from node streams. In addition since we are taking a step back from the idea of providing promise based API's for other methods, is this something we would want to consider for io.js streams? In particular it would require adding a dependency for some es6 promise polyfill library, which would be an additional load on Browerify. I also wonder what effects a promised based approach would have on performance, since core infrastructural components of io.js currently rely on streams for internals (net, http, etc). Passing internal methods to the constructor of a stream could be a simple addition. I think the fact that io.js streams having both flowing and not flowing mode in readable-streams is a great advantage, as is the cork and uncork for writables (although lacking real exposure given the lack of documentation, due to there being no v0.12 release). Could these features be implemented with WHATWG streams with a custom enqueue strategy? We could potentially alias Anyway as I said, that is some great information |
Thanks for the detailed reply, @calvinmetcalf! I've reordered your points a bit:
I think this is an improvement on Node streams -- by limiting the number of ways to interact with the API, it becomes a lot easier to make sure users are oriented properly. Right now it's possible to push into a readable stream "out-of-band," or attach the
By separating
This is one of my favorite features of WHATWG streams. The buffering strategy that's baked into Node streams is one of the hardest things to reason about for the least amount of benefit for userland streams. It exists because transitioning the state machine between reading and paused (or writing and waiting) is expensive for the kinds of streams built into node. This isn't usually the case for userland streams. What ends up happening when this concept is layered (via pipe chains) is that there's a big disconnect between what the user expects to happen and what reads and writes actually happen underneath the hood. There's no reason the stream buffering behavior shouldn't be configurable in this fashion.
This can be roughly emulated using a writableBufferingStrategy, if I understand it correctly.
Another good thing, I think. It reduces the number of "true streams" to just Readable and Writable -- Transforms and Duplexes are just objects that have-a readable and writable side. Diagnosing problems with Node's Duplex and Transform is a dizzying experience because they inherit from both Readable and Writable.
That doesn't match my understanding of
In streams2 the modes were way different -- to the point of monkeypatching the stream instance. As of streams3, the modes are unified, though they're a tri-state: paused, flowing, or "uninitialized" -- the "uninitialized" state standing in so that we can implicitly start the stream
I think it's a little unfortunate that streams expose Re: the size parameter in read -- I think that's ultimately more trouble than it's worth (due to having to slice or copy or concat buffers to fulfill the requested size). Repeated reads are probably better than single reads with buffer copies.
I think this is going to be the biggest issue for interop: the underlying communication mechanism is different. Node streams probably can't abandon events without breaking all sorts of things, so we'll have to tread very carefully when approaching this problem. |
This isn't quite accurate. pipeTo does error forwarding as well. pipeThrough(x) is literally just pipeTo(x.writable); return x.readable.
This is still a desired feature that'll definitely make it.
The current thinking for binary data is captured in https://github.com/whatwg/streams/blob/master/BinaryExtension.md |
This is answered by using the exclusive reader to get a lock during piping. Relatively recent innovation.
See the readInto for the BinaryReadableStream |
Thanks @chrisdickinson & @domenic for clearing up some of that (my knowledge of WHATWG is a little hairy). I agree on the most part with your observations. Particularly things like when calling As you say the You are correct though, I think there are challenges ahead, but I have every bit of faith that the outcome will be awesome 😸 |
Updated my previous comment to correct some errors. I don't think events vs promises are going to be as major of an issue as I think main issues that would prevent an ambidextrous stream is handling On Mon, Jan 12, 2015, 4:47 PM Domenic Denicola notifications@github.com
|
I've been watching the exclusive reader api with interest -- great work landing it. I still have the vague feeling that exposing
This looks great; I'm a big fan of the way this keeps these concerns separate from the core streams implementation. |
@domenic I am very intrigued by this, are you refering to: whatwg/streams/...experimental/readable-byte-stream.js?
@chrisdickinson I am very much in agreement with you on this. |
so looking through the code, the use of exclusive stream reader for piping is going to cause compatibility issues as piping to 2 different streams is totally valid in node (I want to say it pipes at the spead of the slowest stream but not positive) but an error in whatwg |
@sonewman not quite, check out:
@calvinmetcalf agreed, although I have heard it is "buggy" in Node. (As in, a few times I have heard of people trying that, and getting unexpected results even for common cases.) But since pipeTo has (very carefully!) been designed to only use public APIs, it can always be re-implemented to not use readers, and to not propagate errors, and anything else.
Although I can appreciate general uneasiness about exposing stream "internals" in this way, at least on the TIMTOWTDI front, pipeTo is just sugar over the more basic operations, so there's really only one way to do it plus a helper function for a common case. |
@domenic agreed it would be pretty easy to write a pipe method that acted more like nodes (didn't care about target, allowed multiple etc). |
I pipe multiple readers into one writer all the time. You have to use the |
Yeah the case we're talking about is a single readable stream into multiple writable streams. |
I do have some code that does that, I pipe data into a database, but I also pipe it into analytics, it might make sense for me to just file an issue about this and discuss it over at whatwg |
@domenic re:
Interesting – I've come at it from the opposite angle. I probably got this from min-streams, where the equivalents of I'm excited to get started! In the interest of forward motion, here are some smaller actions we can take to get the ball rolling:
|
@domenic Thanks for the links 👍 @calvinmetcalf That is interesting because if one writer applies back pressure would that apply the same buffering to all the streams being written too? @chrisdickinson they seem like reasonable preliminary tasks to me. Excitement is what we need:exclamation::boom: |
@sonewman yes it pipes at the rate of the slowest reader |
@calvinmetcalf sorry i think i missed that part in one of the above messages. |
Here's the PR backporting iojs streams. Please review! Thanks all. I think it might be a bit trickier than expected to start flowing changes from readable-stream into iojs core -- we may want to consider adopting a preprocessing step of some sort so we can conditionally select what dependencies we're bringing in. |
Hey all! As an update, the following people have been added to the io.js streams team: @iojs/streams: Thanks for your time! I will have an issue up early this coming week detailing what's coming up and we can start to divvy tasks up / talk problems out / schedule meetings if necessary/desired from there. If @rvagg mentioned you and you did not get an invite, please email me and I'd be happy to add you. To folks interested in joining this effort: the best way to do so is to contribute to this repository, iojs/io.js, or joyent/node with a focus on solving specific issues with streams. Contributions need not necessarily be code: helping debug streams issues on the platform repos is super helpful as well. If you are interested in contributing, please reach out to @rvagg or I and we would be happy to point you in the right direction. |
Updated: added @isaacs to the team. |
@chrisdickinson ⬆️ 👍 |
@chrisdickinson and I have been charged with splitting streams concerns out of io.js core such that the authoritative source of code, docs and issues for streams comes from this repo and flows back into iojs/io.js. @isaacs has been kind enough to gift this repo to the iojs org and the first order of business is to create a streams team that can be responsible for maintaining this repo, dealing with issues, updating docs, deciding the future directions for streams in io.js (and potentially joyent/node, but we'll see where that goes), and actually implementing it.
Ideally, those involved will have some clue how the current implementation (streams3 mainly) works. The biggest problem iojs (and node) faces wrt streams is the limited number of people that understand how they work, certainly most of the TC doesn't really have an in-depth knowledge.
As a starting point, I'd like to include people that have made contributions to readable-stream since they are the obvious candidates for forming a core team:
Please indicate your willingness to be included here and then we can start to move forward with some more practical concerns about how this repo & the flow of code, docs & issues is going to work.
The text was updated successfully, but these errors were encountered: