Re: [fonc] Everything You Know (about Parallel Programming) Is Wrong!: A Wild Screed about the Future

Miles Fidelman Wed, 04 Apr 2012 09:13:01 -0700

Apologies to David Ungar - should have had another cup of coffee beforesending this. I went back to the original post that started this thread(pointing to a talk by David Ungar). The promptly mixed up David Ungarand David Barbour in my thinking. Ooops. Apologies. Arguement remainsthe same, but it's with David Barbour....


Miles Fidelman wrote:

David Barbour wrote:
Your approach to parallelism strikes me as simplistic. Like sayingEarth is in center of Solar system. Sun goes around Earth. It soundssimple. It's "easy to conceptualize". Oh, and it requires epicyclicorbits to account for every other planet. Doesn't sound so simpleanymore. Like this, simplistic becomes a complexity multiplier indisguise.
You propose actor per object. It sounds simple to you, and "easy toconceptualize". But now programmers have challenges to controllatency, support replay, testing, maintenance, verification,consistency. This is in addition to problems hand-waved through likeline-of-sight and collision detection. It doesn't sound so simpleanymore.
The whole point of architecture is to generate the overall outline ofa system, to address a particular problem space within the constraintsat hand. The KISS principle applies (along with "seek simplicity anddistrust it"). If there isn't a degree of simplicity and elegance inan architecture, the architect hasn't done particularly good job.
In the past, limitations of hardware, languages, and run-timeenvironments have dictated against taking parallel (or moreaccurately, concurrent) approaches to problems, even when massiveconcurrency is the best mapping onto the problem domain - resulting invery ugly code.
Yes, there are additional problems introduced by modeling a problem asmassively concurrent - and those are areas that I think are areas forfruitful research. In particular, re. the ones you cite:
- control latency, support replay, testing, maintenance, verification:these are nothing new at the systems level (think about either all thedifferent things that run on a common server, or about all the thingsthat go on in a distributed system such as the federated collection ofSMTP servers that we're relying on right now)
- consistency: is not your message "Avoid the Concurrency Trap byEmbracing Non-Determinism?" -- is not a key question: what does itmean to "embrace non-determinism" and how to design systems in aninherently indeterminate environment? (more below)
- now line-of-sight and collision detection, which are more specificto the simulation domain, are interesting in two regards:
-- collision detection (and weapons effects) are easy if you allowactors to calculate to determine "I'm hit," not so easy if you wantindependent verification by a referee or a physical environment model- the latter pretty much requires some kind of serialization, and thequestion becomes how
-- line-of-sight calculations are the bane of simulators - right now,the practice is for each entity to do it's own line of sightcalculations (doesn't matter if it's an object that is invoked by acontrol thread, or an asynchronous actor) - each entity takes a lookaround (scans a database) to determine who it can see (and be seenby), who it can't, what's blocking it's view of other objects, etc. --very compute intensive, and where coders spend a LOT of timeoptimizing (a CGF has to do this 20 times a second or more, GISsystems tend to take 30 seconds to several minutes to do the samething -- when I was in the simulation business, I sat in severalrather amusing meetings, watching coders from a well-known GIS firm,as their jaws dropped when told how fast our stuff did line-of-sightcalucations). I expect that there are some serious efficiencies thatcan be gained by performing LOS calculations from a globalperspective, and that these can benefit from massive parallelism - Iexpect there's work on ray tracing and rendering that applies - butthat gets pretty far afield from my own experience.)
The old sequential model, or even the pipeline technique I suggest,do not contradict the known, working structure for consistency.
But is consistency the issue at hand?
This line of conversation goes back to a comment that the limits toexploiting parallelism come down to people thinking sequentially, andinherent complexity of designing parallel algorithms. I argue thatquite a few problems are more easily viewed through the lens ofconcurrency - using network protocols and military simulation asexamples that I'm personally familiar with.
You seem to be making the case for sequential techniques that maintainconsistency. But is that really the question? This entire threadstarted with a posting about a paper you were giving on ProjectRenaissance - that contained two points that stood out to me:
"If we cannot skirt Amdahl’s Law, the last 900 cores will do us nogood whatsoever. What does this mean? We cannot afford even tinyamounts of serialization."
"Avoid the Concurrency Trap by Embracing Non-Determinism?" (actuallynot from the post, but from the Project Renaissance home page)
In this, I think we're in violet agreement - the key to takingadvantage of parallelism is to "embrace non-determinism."
In this context, I've been enjoying Carl Hewitt's recent writingsabout indeterminacy in computing. If I might paraphrase a bit, isn'tthe point that 'complex computing systems are inherently and alwaysindeterminate, let's just accept this, not try to force consistencywhere it can't be forced, and get on with finding ways to solveproblems in ways that work in an indeterminate environment.'
Which comes back to my original comment that there are broad classesof problems that are more readily addressed through the lens ofmassive concurrency (as a first-order architectural view). And thatnew hardware advances (multi-core architectures, graphic processors),and language/run-time models (actors, Erlang-like massiveconcurrency), now allow us to architect systems around massiveconcurrency (when the model fits the problem).
And, returning to this context:
    "... For huge classes of problems - anything that's remotely
    transactional or event driven, simulation, gaming come to mind
    immediately - it's far easier to conceptualize as spawning a
    process than trying to serialize things. The stumbling block has
    always been context switching overhead. That problem goes away as
    your hardware becomes massively parallel. "

    Are you arguing that:
    a) such problems are NOT easier to conceptualize as parallel and
    asynchronous, or,
    b) parallelism is NOT removing obstacles to taking actor-like
    approaches to these classes of problems, or
    c) something else?


I would argue all three.
Ahh... then I would counter that:
a) you selectively conceptualize only part of the system - anidealized happy path. It is much more difficult to conceptualize yourwhole system - i.e. all those sad paths you created but ignored. Manysimulators have collision detection, soft real-time latencyconstraints, and consistency requirements. It is not easy toconceptualize how your system achieves these.
In this one, I write primarily from personal experience andobservation. There are a huge class of systems that are inherentlyconcurrent, and inherently not serializeable. Pretty much anydistributed system comes to mind - email and transaction processingcome to mind. I happen to think that simulators fall into this class -and in this regard there's an existence proof:
- Today's simulators are built both ways:
-- CGFs and SAFs (multiple entities simulated on a single box) -generally written with an object-oriented paradigm in C++ or Java,highly optimized for performance, with code that is incredibly hard tofollow, and turns out to be rather brittle
-- networked simulations (e.g. F16 man-in-the-loop simulators linkedby network) are inherently independent processes, linked by networksthat have all kinds of indeterminancies vis-a-vis packet delays,packet delivery order, packet loss (you pretty much have to usemulti-cast UDP, and packet loss, or you can't keep up with real-timesimulation - and the pilots tend to throw up all over the simulatorsif the timing is off - sensitive thing the human vestibular system)---- a much simpler architecture, systems that are much easier to follow
Very different run-time environments, very different systemarchitectures. Both work.
Personally, I find networked simulators to be a lot easier toconceptualize than today's CGFs -- in one case, adding a new entity(say a plane) to a simulation = adding a new box that has a cleaninterface. In the other, it involves adding a new object, and havingto understand that there's all kinds of behind-the-scenes magic goingon, as multiple control threads wind their way through all the objectsin the system. One is a clean mapping between the problem space, theother is just ugly.
Yes, as noted above, serious problems remain - but the question isabout serial vs. parallel approaches are more tractable at thearchitectural level.
b) parallelism is not concurrency; it does not suggest actor-likeapproaches. Pipeline and data parallelism are well provenalternatives used in real practice. There are many others, which Ihave mentioned before.
Fair point. If we limit ourselves to a discussion of pipelines anddata parallelism, I'll concede that they do not necessarily lead tocleaner conceptual mappings between problems and systemsarchitectures. In fact, for the examples I've been talking about, mysense is that a pipelined approach to simulation is not particularlyeasier to comprehend than current approaches - though it might takebetter advantage of large numbers of processing cores. In the case ofemail, I can't even begin to think about applying synchronousparallelism to messages flowing a federation of mail servers.
On the other hand, if we look at the larger question of "skirtingAmdah's law" in an environment with lots of processing cores -certainly within some definitions of "parallelism" - then actor-likemassive concurrency approaches are certainly in bounds, and theavailability of more cores certainly allows for running more actorswithout running into resource conflicts.
c) performance - context-switching overhead - isn't the mostimportant stumbling block. consistency, correctness, complexity areeach more important.
Ahh... here's I'll through it back to the question of architecturesand design patterns that assume inconsistency as the norm (biologicalmetaphors if you will). And maybe add a touch of protocol layeringtechniques (IP packets are inherently unreliable and probabilistic inbehavior, we layer TCP on top of it to provide reliable connections.In other cases - like VoIP and video streaming - we can't go back andretransmit, so we either forget about lost packets, or useforward-error-correcting codes).
Like you, I believe we can achieve parallel designs while improvingsimplicity. But I think I will eschew turning tanks into actors.
Agreed on the first, not, obviously on the second.



--
In theory, there is no difference between theory and practice.
In practice, there is.   .... Yogi Berra


_______________________________________________
fonc mailing list
[email protected]
http://vpri.org/mailman/listinfo/fonc

Re: [fonc] Everything You Know (about Parallel Programming) Is Wrong!: A Wild Screed about the Future

Reply via email to