Hi Brian,
I found the problem. It looks like xgrid need to do more work on fault
tolerance. It seems that xgrid controller distributed jobs to each available
agent only in certain fixed order, if one of the agents has problem in
communicating with the controller, all jobs failed, even when the
Jeff Squyres wrote:
George has proposed to bring the DDT over from the trunk to the v1.2
branch before v1.2.5 in order to fix some pending bugs.
What does this entail (ie does this affect the pml interface at all)?
Also by saying "before v1.2.5" I am assuming you mean this fix is to
be put
George has proposed to bring the DDT over from the trunk to the v1.2
branch before v1.2.5 in order to fix some pending bugs.
I do not think that this has been tested yet, but are there any knee-
jerk reactions against doing this?
--
Jeff Squyres
Cisco Systems