Re: Message processing

Avery Ching Fri, 09 Sep 2011 09:03:11 -0700

Jake,

The model you're describing sounds a bit like a hybrid between BSP andasynchronous, stream based computing. Definitely would be great toexperiment with either of these a bit. I too would prefer to eliminateany complicated locking models (i.e. a distributed lock manager for allvertices). But it might come to that if we decide that asynchronousremote vertex mutation is important. I think an asynchronous modelcould provide performance benefits over BSP in some cases. Butdebugging would be more difficult (less deterministic). So I believeboth models will be useful for Giraph.


Avery

On 9/9/11 8:26 AM, Jake Mannix wrote:

On Fri, Sep 9, 2011 at 8:03 AM, Avery Ching <[email protected]<mailto:[email protected]>> wrote:
    The GraphLab model is more asynchronous than BSP  They allow you
    to update your neighbors rather than the BSP model of messaging
    per superstep.  Rather than one massive barrier in BSP, they
    implement this with vertex locking.  They also all a vertex to
    modify the state of its neighbors.  We could certainly add
    something similar as an alternative computing model, perhaps
    without locking.  Here's one idea:

    1) No explicit supersteps (asynchronous)
This sounds interesting, esp. for streaming algorithms, although I wasthinking something slightly less ambitions to start out: still havesupersteps (effectively) which describe when each vertex has had achance to send all messages it wants for this iteration, and hasprocessed all inbound messages.
    2) All vertices execute compute() (and may or may not send
    messages) initially
    3) Vertices can examine their neighbors or any vertex in the graph
    (issue RPCs to get their state)
"or any vertex in the graph" sounds pretty scary, but yes, powerful.I like it when my seemingly radical ideas get made look not so scaryby comparison! :)
    4) When messages are received by a vertex, compute() is executed
    on it (and state is locally locked to compute only)
    5) Vertices stlll vote to halt when done, indicating the end of
    the application.
    6) Combiners can still be used to reduce the number of messages
    sent (and the number of times compute is executed).

    This could be fun.  And provide an interesting comparison platform
    barrier based vs vertex based synchronization.
Yeah, I think locking is an implementation detail which might be evenavoidable: if Vertices are effectively given a messageQueue which theycan process from, we could interpolate between buffering andprocessing messages synchonously. The per-mapper threading modelcould get... interesting!
  -jake

Re: Message processing

Reply via email to