workers: initial implementation #2133

petkaantonov · 2015-07-08T12:31:57Z

#1159 retargeted to master

addressed inline comments in the old PR
io.js master merged the required libuv fix regarding to closing stdio handles on Windows
addressed use-after-free bug as described in workers: initial implementation #1159 (comment)
make it possible to run fs tests which use a tmp directory in parallel using workers

alubbe · 2015-07-08T12:34:06Z

Awesome, thank you for picking this up!

thefourtheye · 2015-07-08T13:46:42Z

lib/worker.js

+'use strict';
+
+if (!process.features.experimental_workers) {
+  throw new Error('Experimental workers are disabled');


Nit: Strictly speaking, they are not disabled, but not enabled.

petkaantonov · 2015-07-08T15:29:36Z

Implemented process.threadId which is always 0 for the main thread and > 0 for workers.

Implemented data option where you can pass initial data to the worker (process.env cannot be used since it's process wide). The passed data is available in process.workerData inside a worker. This is needed for running fs and network tests in parallel when using workers.

Implemented eval option, a boolean that you can set to true if you want the first argument to be evaluated as code rather than loading it as a file.

Fishrock123 · 2015-07-08T18:08:19Z

@petkaantonov "io.js master merged the required libuv fix regarding to closing stdio handles on Windows"

Could you provide a link to the libuv issue up there? :)

evanlucas · 2015-07-08T20:41:40Z

src/node.cc


 // process-relative uptime base, initialized at start-up
 static double prog_start_time;
 static bool debugger_running;
+// Needed for potentially non-thread-safe process-globas


s/globas/globals

piscisaureus · 2015-07-08T22:32:29Z

io.js master merged the required libuv fix regarding to closing stdio handles on Windows

That happened: libuv/libuv@60e515d...c619f37. I believe a libuv release is also imminent.

petkaantonov · 2015-07-08T22:37:13Z

@piscisaureus yeah the task means that current deps/uv in master doesn't contain the changes

trevnorris · 2015-07-09T23:09:12Z

src/node.cc

+  // process.isWorkerInstance
+  READONLY_PROPERTY(process,
+                    "isWorkerInstance",
+                    Boolean::New(env->isolate(), env->is_worker_instance()));


Is there a case when isMainInstance == isWorkerInstance?

I assume no, and probably that having both is just for convenience.

If the main thread is guaranteed to have a process.threadId of 0, there is no need to have process.isMainInstance nor process.isWorkerInstance. It also would help reduce the API surface.

On a related point, should it be process.threadId or process.tid as discussed in the last PR?

Yeah they are for convenience and readability

I'm fine with having both. Matches how cluster works. This is a nit, but possibly consider how to shorten the names, or maybe just swap Instance for Thread.

petkaantonov · 2015-07-10T14:40:38Z

@kzc The issue you reported was actually known all along in this comment:

// Deleting WorkerContexts in response to their notification signals
// will cause use-after-free inside libuv. So the final `delete this`
// call must be made somewhere else

"somewhere else" means queuing the delete this call asynchronously on the main thread event loop. And of course this fails when the owner thread == main thread.

A significantly simpler solution (without this problem) now occurs to me where WorkerContexts to be deleted would be pushed to a global cleanup queue which is looped through in-between event loop iterations on the main event loop.

kzc · 2015-07-10T15:05:17Z

@petkaantonov - it's been some time since I looked at your thread worker code but I vaguely recall it already had a cleanup queue that was intended be called asynchronously. The problem was a nested event loop handler within a dispose function. Nesting event loops is something that really should be avoided - it creates complexity and it is difficult to reason about the ordering of events and the correctness of a solution.

petkaantonov · 2015-07-10T15:28:00Z

@kzc btw did you want to know the worker's threadId from the worker object on its owner thread as well? As in worker.threadId?

And yeah I'll change process.threadId to process.tid for better symmetry with process.pid :)

kzc · 2015-07-10T15:54:09Z

For my needs just having process.threadId is sufficient.

This brings up a good point - in your implementation can a given worker instance potentially be scheduled on different threads during its lifetime, or are workers always pinned to a specific thread? If not pinned, the worker instance could benefit from having a unique worker id (never reused for the lifetime of the process across all workers), which is different than a threadId.

petkaantonov · 2015-07-10T15:59:39Z

Worker is exclusively tied to a thread. I am not sure what benefit there would be from being able to schedule it on different threads, it would be very complex to implement as you need to facilitate the ownership transfer of a v8 isolate and so on.

However tying a worker to a specific CPU core will be possible if/when libuv merges libuv/libuv#280.

petkaantonov · 2015-07-10T16:01:51Z

The use-after-free and nested event loops should be fixed now

kzc · 2015-07-10T18:35:44Z

@petkaantonov - Just curious... instead of posting delete tasks to the main thread with QueueWorkerContextCleanup() and CleanupWorkerContexts(), why don't you delete the WorkerContext at the end of WorkerContext::RunWorkerThread() when the worker thread's event loop is guaranteed to have finished?

void WorkerContext::RunWorkerThread(void* arg) {
  WorkerContext* worker = static_cast<WorkerContext*>(arg);
  worker->Run();
  delete worker;
}

petkaantonov · 2015-07-10T19:21:45Z

After Run() completes only the stuff belonging to the worker thread has been disposed. It is still pending owner disposal at that point.

ronkorving · 2015-07-11T06:51:04Z

Very cool stuff, but is this not going to be solved/replaced by Vats "some day"? I'm probably missing something, but hope this isn't overlooked.

petkaantonov · 2015-07-11T14:29:43Z

You seem to imply that all strawman proposals will eventually be implemented but that is not the case.

ronkorving · 2015-07-11T14:57:47Z

I just assume a lot, out of ignorance :)

kzc · 2015-07-11T19:52:04Z

@petkaantonov - I tested your "Fix use-after-free" patch on Linux with valgrind as per the instructions here. It appears to work correctly.

You may consider getting rid of the async WorkerContext reaper on the main thread and adopting something like this instead which I think is easier to understand and should put less of a burden on the main thread since it would no longer have to poll the WorkerContext queue:

void WorkerContext::RunWorkerThread(void* arg) {
  WorkerContext* worker = static_cast<WorkerContext*>(arg);
  worker->Run();
  ...wait on a libuv condition variable signalled by 
     owner thread at end of WorkerContext::Dispose()...
  delete worker;
}

Unfortunately the Mac OSX BADF/select problem mentioned in the last PR still exists. I think it's a libuv issue. There's also an unrelated linux issue outlined below.

Using the latest workers implementation as of 1e0b6b1fd5fc93986d056798f47804d0a15a9bec and this patch:

--- a/test/workers/test-crypto.js
+++ b/test/workers/test-crypto.js
@@ -33,3 +33,3 @@ var tests = [

-var parallelism = 4;
+var parallelism = 8;
 var testsPerThread = Math.ceil(tests.length / parallelism);

running this command repeatedly:

./iojs --experimental-workers test/workers/test-crypto.js

on a 4 core Linux VM it experiences this error roughly once per 50 runs:

/opt/iojs-workers-implementation/test/common.js:484
  throw e;
        ^
Error: Running test/parallel/test-crypto-stream.js inside worker failed:
AssertionError: false == true
    at Decipheriv.end (/opt/iojs-workers-implementation/test/parallel/test-crypto-stream.js:52:5)
    at Decipheriv.<anonymous> (/opt/iojs-workers-implementation/test/common.js:371:15)
    at emitOne (events.js:82:20)
    at Decipheriv.emit (events.js:169:7)
    at done (_stream_transform.js:178:19)
    at _stream_transform.js:119:9
    at Decipheriv.Cipher._flush (crypto.js:160:5)
    at Decipheriv.<anonymous> (_stream_transform.js:118:12)
    at Decipheriv.g (events.js:260:16)
    at emitNone (events.js:67:13)
    at Worker.<anonymous> (/opt/iojs-workers-implementation/test/common.js:477:14)
    at emitOne (events.js:77:13)
    at Worker.emit (events.js:169:7)
    at onerror (worker.js:61:18)
    at WorkerBinding.workerContext._onmessage (worker.js:75:16)

on a 4 core Mac it experiences these errors roughly once per 20 runs:

 /opt/iojs-workers-implementation/test/common.js:484
   throw e;
         ^
 Error: Running test/parallel/test-crypto-hmac.js inside worker failed:
 Error: EBADF: bad file descriptor, close
     at Error (native)
     at Object.fs.closeSync (fs.js:518:18)
     at Object.fs.readFileSync (fs.js:445:21)
     at Object.Module._extensions..js (module.js:447:20)
     at Module.load (module.js:355:32)
     at Function.Module._load (module.js:310:12)
     at Function.Module.runMain (module.js:471:10)
     at process._runMain (node.js:68:18)
     at Worker.<anonymous> (/opt/iojs-workers-implementation/test/common.js:477:14)
     at emitOne (events.js:77:13)
     at Worker.emit (events.js:169:7)
     at onerror (worker.js:61:18)
     at WorkerBinding.workerContext._onmessage (worker.js:75:16)
 (node) crypto.createCredentials is deprecated. Use tls.createSecureContext instead.
 <Buffer 0c 1e e9 6b 67 d3 29 f7 94 26 87 51 bb 05 53 3f>
 Assertion failed: (r == 1), function uv__stream_osx_interrupt_select, file ../deps/uv/src/unix/stream.c, line 127.
 Abort trap: 6

Ignore the deprecation lines - they are of no consequence to this issue.

evanlucas · 2015-07-11T19:57:52Z

src/util.cc

+    return nullptr;
+
+  // Allocate enough space to include the null terminator
+  size_t len = StringBytes::StorageSize(string_value, UTF8) + 1;


Shouldn't StorageSize take an isolate?

hax · 2016-06-19T05:33:14Z

@isiahmeadows Workers should be in core to support transfer ArrayBuffer or other resources in Node. But I'm not sure whether this PR implement these features.

siriux · 2016-07-25T11:35:31Z

@bnoordhuis Is there any place to follow your work on integrating webworkers for v7?

HyeonuPark · 2016-08-22T15:37:37Z

As Atomics and SharedArrayBuffer api landed in stage 2 of tc39 process and v8 is implementing it, I think nodejs should have thread apis in any form as it's core module, to support shared memory correctly.

#bnoordhuis have you checked that sharedmem api? Can it be possible with your implementation?

HyeonuPark · 2016-08-22T15:39:41Z

@bnoordhuis have you checked that sharedmem api? Can it be possible with your implementation?

I' just wonder why i used # instead @ :P

martinheidegger · 2016-11-24T12:19:34Z

src/node.cc


-  // -p, --print


Here are a few blocks that are marked as "changes" when in actually just some formatting / ordering changed. Probably not a good idea to offer possible merge conflicts because of indentation fixes?!

dead-claudia · 2016-12-11T12:42:23Z

I'll point out that this is a really old PR, and it would be easiest to rewrite it from scratch again.

…

On Tue, Dec 6, 2016, 04:25 Cogery bot ***@***.***> wrote: [image: review status] <http://127.0.0.1:7000/review/nodejs/node/2133/> — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2133 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AERrBFy1afzdViBRi8x8E-DgUUVsBAklks5rFSn8gaJpZM4FUYS9> .

bnoordhuis · 2016-12-11T14:25:53Z

I've been working on and off on a rewrite of this pull request but after discussion with other collaborators I've come to the conclusion that multi-threading support adds too many new failure modes for not enough benefit.

The primary motivation for the rewrite was improved IPC performance but I'm fairly confident by now that we can also accomplish that using more traditional means like shared memory and more efficient serialization.

I'll go ahead and close this. Thanks for your hard work, Petka.

GnorTech · 2016-12-13T02:34:24Z

For those who want to write Node.js code in multithread program: NW.js implemented this by enabling Node.js in Web Workers: https://nwjs.io/blog/v0.18.4/

pemrouz · 2017-01-10T21:42:50Z

The primary motivation for the rewrite was improved IPC performance but I'm fairly confident by now that we can also accomplish that using more traditional means like shared memory and more efficient serialization.

Hi @bnoordhuis. Can I ask what the latest plan/thinking is for shared memory in Node (i.e. implement workers, or somehow allow transferring SharedArrayBuffers with cluster, or different API altogether)? The latest version seems to have SharedArrayBuffer (and Atomics), but there is no way to currently use this iiuc? Also, what would be the best the way to help out with this?

addaleax · 2017-01-12T19:49:46Z

I've come to the conclusion that multi-threading support adds too many new failure modes for not enough benefit.

Also… could you mention what exactly it is that you have discarded? Multi-threading with the full Node API available in each thread, or something more lightweight like a WebWorkers-style API?

rsp · 2017-01-12T22:57:34Z

@addaleax I tried to summarize the state of this issue as well as the different types of concurrency and their pros and cons in the context of Node, and I also kept posting updates about this pull request (mostly thanks to comments from @matheusmoreira - thanks for that) in this answer on Stack Oveflow:

Which would be better for concurrent tasks on node.js? Fibers? Web-workers? or Threads?

If anything is incorrect or outdated please let me know.

bnoordhuis · 2017-01-13T10:44:13Z

Can I ask what the latest plan/thinking is for shared memory in Node (i.e. implement workers, or somehow allow transferring SharedArrayBuffers with cluster, or different API altogether)?

Shared memory is not my primary focus right now, reducing the overhead of serializing/deserializing is. I ran a lot of benchmarks and in most non-contrived cases the overhead of converting to and from JSON is significantly greater (as in 70/30 or 80/20 splits) than sending it to another process.

Once I get the overhead of serdes down, I'm going to look into shared memory support. It's a minefield of platform-specific quirks and limitations so it's probably going to take a while to get it merged in libuv and iron out the bugs. If you want to help out, this is probably a good place to start.

V8 5.5 or 5.6 will make it a lot easier to do efficient serdes so that's what I'm currently waiting for.

could you mention what exactly it is that you have discarded? Multi-threading with the full Node API available in each thread, or something more lightweight like a WebWorkers-style API?

The former, the node-with-threads approach. WebWorkers-style parallelism is still an option and not terribly hard to implement but I didn't see a point in pursuing that in core, there are already add-ons that do.

dead-claudia · 2017-01-15T10:23:34Z

@bnoordhuis

The former, the node-with-threads approach. WebWorkers-style parallelism is still an option and not terribly hard to implement but I didn't see a point in pursuing that in core, there are already add-ons that do.

That'd be useful except none of the modules I've seen using true threads (instead of processes) actually support require in any way, which makes it way harder to scale. (More specifically, they currently can't, because there's no way to atomically modify the require-related caches via different threads. It has to be moved into C++ land for that to be possible, thanks to V8's lack of thread safety.)

ronkorving · 2017-01-16T02:50:31Z

@bnoordhuis

V8 5.5 or 5.6 will make it a lot easier to do efficient serdes so that's what I'm currently waiting for.

Out of curiosity, could you elaborate as to why this is?

addaleax · 2017-01-16T02:54:25Z

V8 5.5 or 5.6 will make it a lot easier to do efficient serdes so that's what I'm currently waiting for.

Out of curiosity, could you elaborate as to why this is?

I’m pretty sure Ben is referring to the added ValueSerializer and ValueDeserializer classes

ronkorving · 2017-01-16T04:05:15Z

@addaleax Ah nice, a serializer that doesn't use JSON strings?

Sidenote: Something cluster's messaging could make use of too I imagine (would be nice as it's quite slow now imho).

addaleax · 2017-01-16T04:07:28Z

Ah nice, a serializer that doesn't use JSON strings?

I mean, I haven’t used it myself yet, but that’s sure what it sounds like. 😄

Something cluster's messaging could make use of too I imagine (would be nice as it's quite slow now imho).

Yeah, I had that thought, too. But as far as the current slowness is concerned: #10557 seems to fix quite a bit of that. :)

NawarA · 2017-05-12T05:41:30Z

Is this still being worked on?

jasnell · 2017-05-12T12:58:02Z

@NawarA ... not at this time. If someone wanted to volunteer to pick it up, I think that would be welcome, but it's quite a task and there would be much to do.

pemrouz · 2017-05-21T17:53:55Z

@jasnell it would be useful if someone could setup a meta-tracking issue like #6980 to just break down the problem on what is required in order to get this done - then volunteers can start picking them up.

Based on the above, it's not even clear what the desired end state is. It would be good to clarify how Node users will eventually be able to use SharedArrayBuffer (e.g: shared memory between workers? shared memory between processes? using cluster module?).

petkaantonov mentioned this pull request Jul 8, 2015

workers: initial implementation #1159

Closed

petkaantonov added the semver-minor PRs that contain new features and should be released in the next minor version. label Jul 8, 2015

thefourtheye reviewed Jul 8, 2015
View reviewed changes

brendanashworth added the c++ Issues and PRs that require attention from people who are familiar with C++. label Jul 8, 2015

evanlucas reviewed Jul 8, 2015
View reviewed changes

bmeck mentioned this pull request Jul 9, 2015

Generators and Fibers nodeup/contribute#30

Open

trevnorris reviewed Jul 9, 2015
View reviewed changes

petkaantonov force-pushed the workers-implementation branch from 24fe97b to 5574ea0 Compare July 10, 2015 13:18

petkaantonov force-pushed the workers-implementation branch from cf80d53 to 1e0b6b1 Compare July 11, 2015 13:55

formula1 mentioned this pull request Jul 11, 2015

Threads runtimejs/runtime#75

Open

evanlucas reviewed Jul 11, 2015
View reviewed changes

ZauberNerd mentioned this pull request Jun 28, 2016

Parallel bundling paeckchen/paeckchen#28

Closed

MylesBorins mentioned this pull request Aug 17, 2016

Can we support Worker API in node environment , like browser's Web work behavior. #8137

Closed

Fishrock123 mentioned this pull request Aug 20, 2016

Feature request: Horizontal peer-to-peer event-loop messaging w/ process.send (workers) #8186

Closed

Trott force-pushed the master branch from b0df363 to c5ce7f4 Compare September 21, 2016 00:09

rvagg force-pushed the master branch 2 times, most recently from c133999 to 83c7a88 Compare October 18, 2016 17:01

martinheidegger reviewed Nov 24, 2016

View reviewed changes

bnoordhuis closed this Dec 11, 2016

addaleax mentioned this pull request May 21, 2017

Tracking issue: Worker support #13143

Closed

refack mentioned this pull request May 25, 2017

High level architecture nodejs/worker#2

Closed

benjamingr mentioned this pull request Aug 31, 2017

please develop native threads that can load native modules with require and share/lock objects ayojs/ayo#31

Open

workers: initial implementation #2133

workers: initial implementation #2133

Conversation

petkaantonov commented Jul 8, 2015

alubbe commented Jul 8, 2015

Choose a reason for hiding this comment

petkaantonov commented Jul 8, 2015

Fishrock123 commented Jul 8, 2015

Choose a reason for hiding this comment

piscisaureus commented Jul 8, 2015

petkaantonov commented Jul 8, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

petkaantonov commented Jul 10, 2015

kzc commented Jul 10, 2015

petkaantonov commented Jul 10, 2015

kzc commented Jul 10, 2015

petkaantonov commented Jul 10, 2015

petkaantonov commented Jul 10, 2015

kzc commented Jul 10, 2015

petkaantonov commented Jul 10, 2015

ronkorving commented Jul 11, 2015

petkaantonov commented Jul 11, 2015

ronkorving commented Jul 11, 2015

kzc commented Jul 11, 2015

Choose a reason for hiding this comment

hax commented Jun 19, 2016

siriux commented Jul 25, 2016

HyeonuPark commented Aug 22, 2016

HyeonuPark commented Aug 22, 2016

Choose a reason for hiding this comment

dead-claudia commented Dec 11, 2016 via email

bnoordhuis commented Dec 11, 2016

GnorTech commented Dec 13, 2016

pemrouz commented Jan 10, 2017

addaleax commented Jan 12, 2017

rsp commented Jan 12, 2017

bnoordhuis commented Jan 13, 2017

dead-claudia commented Jan 15, 2017

ronkorving commented Jan 16, 2017

addaleax commented Jan 16, 2017

ronkorving commented Jan 16, 2017

addaleax commented Jan 16, 2017

NawarA commented May 12, 2017

jasnell commented May 12, 2017

pemrouz commented May 21, 2017