Hi,

Am 21.08.2012 um 22:44 schrieb Henrichs, Juryk:

> we are running sge 6.2u5. I am trying to restart jobs via checkpointing. 
> On one of our clusters that works fine - jobs is suspended via the 
> suspend command, is stopped, rescheduled in the queue and restarted if 
> resources are available.
> 
> With apparently the same setup of the sge on a second cluster my jobs 
> are rescheduled but do not get started. qstat -sj shows
> "cannot run on host XXX until clean up of an previous run has finished"
> 
> If the job is deleted from the queue and restarted manually works perfect.
> 
> Is there a way to get a more elaborate error message and to find out 
> what exactly goes wrong with the cleanup?

Depending on the checkpointing setup it might be necessary to remove all 
processes of a job in the "migr_command" defined script. Which checkpointing 
type do you use amd how do you remove the processes therein?

-- Reuti


> Juryk
> 
> 
> This e-mail and any attachment thereto may contain confidential information 
> and/or information protected by intellectual property rights for the 
> exclusive attention of the intended addressees named above. Any access of 
> third parties to this e-mail is unauthorised. Any use of this e-mail by 
> unintended recipients such as total or partial copying, distribution, 
> disclosure etc. is prohibited and may be unlawful. When addressed to our 
> clients the content of this e-mail is subject to the General Terms and 
> Conditions of GL's Group of Companies applicable at the date of this e-mail.
> If you have received this e-mail in error, please notify the sender either by 
> telephone or by e-mail and delete the material from any computer.
> GL's Group of Companies does not warrant and/or guarantee that this message 
> at the moment of receipt is authentic, correct and its communication free of 
> errors, interruption etc.
> FutureShip GmbH, HRB 106781 AG HH, VAT Reg. No. DE263937825
> Geschäftsführer (CEO): Volker Höppner, Henning Kinkhorst, Stefan Deucker
> 
> 
> _______________________________________________
> users mailing list
> [email protected]
> https://gridengine.org/mailman/listinfo/users
> 


_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to