Re: [Python-Dev] Thoughts fresh after EuroPython

Glenn Linderman Tue, 07 Sep 2010 22:03:23 -0700

 On 7/26/2010 7:36 AM, Guido van Rossum wrote:

According to CSP advicates, this approach will break down when you
need more than 8-16 cores since cache coherence breaks down at 16
cores. Then you would have to figure out a message-passing approach
(but the messages would have to be very fast).

Catching up on Python-Dev after 3 months of travel (lucky me!), soapologies for a "blast from the past" as I'm 6 weeks late in replying here.

Think of the hardware implementation of cache coherence as a MIL -memory interleave lock, or a micro interpreter lock (the hardware isinterpreting what the compiled software is doing).


That is not so different than Python's GIL, just at a lower level.

I didn't read the CSP advocacy papers, but experience in early parallelsystem at CMU, Tandem Computers, and Teradata strongly imply thatmultiprocessing of some sort will always be able to scale larger thanmemory coherent cores -- if the application can be made parallel at all.

It is interesting to note that all the parallel systems mentioned aboveimplemented fast message passing hardware of various sorts (affected byavailable technologies of their times).

It is interesting to note the similarities between some of the extrememulti-way cache coherence approaches and the various message passinghardware, also... some of the papers that talk about exceeding 16 coreswere going down a message passing road to achieve it. Maybe somethingnew has been discovered in the last 8 years since I've not beenfollowing the research... the only thing I've read about that in thelast 8 years is the loss of Jim Gray at sea... but the IEEE paper youposted later seems to confirm my suspicions that there has not yet beena breakthrough.

The point of the scalability remark, though, is that while lots ofproblems can be solved on a multi-core system, problems also growbigger, and there will likely always be problems that cannot be solvedon a multi-core (single cache coherent memory) system. Those problemswill require message passing solutions. Experience with the systemsabove has shown that switching from a multi-core (semaphore based)design to a message passing design is usually a rewrite.

Perhaps the existence of the GIL, forcing a message passing solution tobe created early, is a blessing in disguise for the design of largescale applications. I've been hearing about problems for which the datais too large to share, and the calculation is too complex to parallelizefor years, but once the available hardware is exhausted as the problemgrows, the only path to larger scale is message passing parallelism...forcing a redesign of applications that outgrew the available hardware.

That said, applications that do fit in available hardware generally canrun a little faster with some sort of shared memory approach: messagepassing does have overhead.


--
Glenn
------------------------------------------------------------------------
I have CDO..It's like OCD, but in alphabetical order..The way it should be!
(a facebook group is named this, except for a misspelling.)

_______________________________________________
Python-Dev mailing list
[email protected]
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] Thoughts fresh after EuroPython

Reply via email to