Hi everyone, I'm running Moses through its experiment management system across a cluster and I'm finding that sometimes jobs will finish successfully but the .STDERR and .STDOUT files will be slow in appearing relative to the .DONE file, meaning that the EMS concludes that the step crashed. I can run the system again and it successfully reuses the results of the step (it doesn't have to rerun the step) but this is becoming frustrating as I have to restart the system frequently. I tried adding a call to sleep() in the check_if_crashed() method in experiment.perl but this is not helping in general - I think sometimes the delay is as much as a couple of minutes.
Has anyone else faced this problem, or have a better idea for how to get around it? Cheers, Suzy -- Suzy Howlett http://www.showlett.id.au/ _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support