Re: [OMPI devel] problem in runing MPI job through XGrid

2007-10-10 Thread Jinhui Qin
Hi Brian, I found the problem. It looks like xgrid need to do more work on fault tolerance. It seems that xgrid controller distributed jobs to each available agent only in certain fixed order, if one of the agents has problem in communicating with the controller, all jobs failed, even when the

Re: [OMPI devel] DDT for v1.2 branch

2007-10-10 Thread Terry Dontje
Jeff Squyres wrote: George has proposed to bring the DDT over from the trunk to the v1.2 branch before v1.2.5 in order to fix some pending bugs. What does this entail (ie does this affect the pml interface at all)? Also by saying "before v1.2.5" I am assuming you mean this fix is to be put

[OMPI devel] DDT for v1.2 branch

2007-10-10 Thread Jeff Squyres
George has proposed to bring the DDT over from the trunk to the v1.2 branch before v1.2.5 in order to fix some pending bugs. I do not think that this has been tested yet, but are there any knee- jerk reactions against doing this? -- Jeff Squyres Cisco Systems