Re: [whatwg] WebWorkers vs. Threads

Shannon Wed, 13 Aug 2008 11:50:48 -0700

Kristof Zelechovski wrote:

A background task invoked by setTimeout has to be split to small chunks;
_yielding_ occurs when each chunk ends (having called setTimeout to execute
the next chunk).  It is very hard to code in this way; you have to maintain
an explicit stack and create an exit/entry point at every chunk boundary.
This technique is interesting as an academic exercise only, real-world
developers will be right to stay away from it.

I'm not sure I get your meaning. If this is how current browsersimplement setTimeout then how is it "academic"? Also since nobody istalking about deprecating setTimeout I don't see how its relevant.Whatever happens setTimeout remains an issue that real-world developerscan't stay away from.

Guarding concurrent access to global variables is not enough if those
variables hold references to objects because an object can end up in a
logically inconsistent state if two threads try modifying its properties
concurrently.  The objects would have to be lockable to avoid corrupting
global state.
Even if you limit yourself to scalar variables, there is nothing to prevent
a script to define a compound state as a set of scalar variables, each one
with its own name.  While it is not a good programming practice, old code
does it a lot because it is (or was) more efficient to say 'gTransCount'
than 'gTrans.count'.
Chris

Ok I'm clear on that, these are good arguments for providing explicitlocking. I'm still not clear on how variable race conditions in multipleinterleaved setTimeout chunks would be different for true threads butI'll take your word for it that automated locking is hard or impossibleto implement.

What I really don't understand is how the WebWorkers proposal solvesthis. As far as I can tell it does some hand-waving with MessagePorts topretend it goes away but what happens when you absolutely DO needconcurrent access to global variables - say for example the DOM - frommultiple threads? How do you perform any sort of synchronisation?


Take the example given:
{ var la = g.i; g.i = la + 1 }


The WebWorkers implementation (scary! hide your children!!):

--- worker.js ---
updateGlobalLa = function (e) {
  var localLa = someLongRunningFunction( e );
  workerGlobalScope.port.postMessage("set la = "+ localLa);
}
workerGlobalScope.port.AddEventListener("onmessage", updateGlobalLa, false);
workerGlobalScope.port.postMessage("get la");

--- main.js ---
// global object or variable
var la = 0;

handleMessage = function(e) {
  if (typeof e.match("set la"))
     la = parseInt(e.substr(3));
  } else if (typeof e.match("get la")) {
     worker.postMessage(la.toString());
  }
}
var worker = new Worker("worker.js");
worker.AddEventListener("onmessage", handleMessage, false);

Unlike the one-line example above we increment the global value based onsome long-running calculation on its original value (rather than justadd 1). This shows a more realistic use case for threading.Unfortunately our potentially dangerous one-liner is now an equallydangerous 18-line monster spread over 2 files and we STILL haven'tsolved the issue of another worker or the main context updating 'la'between our original postMessage query and our response.

I should also point out that even this simple, naive and probablyincorrect example still took me nearly 2 hours to write - largely due tothe complexity of the WebWorkers spec and the lack of any decentexamples. Honestly anyone who thinks this interface is supposed to makethings easier is kidding themselves.

Regardless of the kind of Getters/Setters/Managers/Whatever paradigm youuse in your main thread you can never escape the possibility that 2workers might want exclusive access to an essential global object (ie,DOM node or global setting). So far I have not found any real-worldprogramming language or hardware that can do this without some kind ofside-effect or programming construct (ie, locks, mutexes, semaphores,etc...). What WebWorkers is really doing is requiring the author towrite their own.

In other words despite all the complexity and limitations of workers allthat's actually achieved is:

a.) Synchronisation problems simply promoted to the message queue level.
b.) Decrease in performance due to horrible string-only messaging interface.
c.) Increase in browser and javascript bugs due to API complexity.

d.) Decrease in programmer interest in using threads (I certainlywouldn't use them in their current state).

I don't think I can stress enough how many important properties andfunctions of a web page are ONLY available as globals. DOM nodes, styleproperties, event handlers, window.status ... the list goes on. Thesecan't be duplicated because they are properties of the page all workersare sharing. Without direct access to these the only useful thing aworker can do is "computation" or more precisely string parsing andmaths. I've never seen a video encoder, physics engine, artificialintelligence or gene modeller written in javascript and I don't reallythink I ever will. Apart from being slow there is the obviouscorrelation that anything that complex is:

a.) The realm of academics and science geeks using highly parallelspecialist systems and languages, not web developers.b.) Valuable enough to be commercial software - and therefore requiringprotection against illicit copying (something Javascript can't provide).


Shannon

Re: [whatwg] WebWorkers vs. Threads

Reply via email to