Hi everyone,

I'm running Moses through its experiment management system across a 
cluster and I'm finding that sometimes jobs will finish successfully but 
the .STDERR and .STDOUT files will be slow in appearing relative to the 
.DONE file, meaning that the EMS concludes that the step crashed. I can 
run the system again and it successfully reuses the results of the step 
(it doesn't have to rerun the step) but this is becoming frustrating as 
I have to restart the system
frequently. I tried adding a call to sleep() in the check_if_crashed() 
method in experiment.perl but this is not helping in general - I think 
sometimes the delay is as much as a couple of minutes.

Has anyone else faced this problem, or have a better idea for how to get 
around it?

Cheers,
Suzy

-- 
Suzy Howlett
http://www.showlett.id.au/
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to