Re: [OMPI devel] Device failover in dr pml (fwd)

2009-04-16 Thread Jeff Squyres
On Apr 16, 2009, at 9:12 AM, Ralph Castain wrote: Sounds fine, though note that we don't want ob1 itself to do this as it inevitably adds overhead that translates into latency. Instead, we want that functionality to be in a separate component for those people who want to use it. To drive this

Re: [OMPI devel] Device failover in dr pml (fwd)

2009-04-16 Thread Ralph Castain
Sounds fine, though note that we don't want ob1 itself to do this as it inevitably adds overhead that translates into latency. Instead, we want that functionality to be in a separate component for those people who want to use it. We did talk on a telecon earlier this week about the need to

Re: [OMPI devel] Device failover in dr pml (fwd)

2009-04-16 Thread Sylvain Jeaugey
Well, if reviving means making device failover work, then yes, in a way we revived it ;) We are currently making mostly experiments to figure out how to have device failover working. No big fixes for now, and that's why we are posting here before going further. From what I understand, Rolf's

Re: [OMPI devel] Device failover in dr pml

2009-04-15 Thread Rolf Vandevaart
Hi, We are also looking to get device failover working. However, for the reasons cited by Ralph, we are using the OB1 PML as the starting point. Also, similar to you, we do not need the checksumming feature or the timed out retransmission that the dr PML provides. Rolf Ralph Castain wrot

Re: [OMPI devel] Device failover in dr pml

2009-04-15 Thread Ralph Castain
Last anyone knew, the dr pml was dead - way out of date and unmaintained. I gather that you folks have revived it and sync'd it back up to the current ob1 module? I don't think anyone really cares what is done with the dr module itself. There are others working on failover modules, and ther