Hi All.

I am using Son of Grid Engine 8.1.6.

We have an issue that occurs once in a while in which Grid Engine will suspend a job ( subordinate queue ) and while Grid Engine thinks the job is suspended ( qstat shows "S" for job state ), the process on the node keeps running and not really suspended.

If I manually suspend the job ( qmod -sj <job-id> ), then the process suspends just fine on the node and I see the "Ss" in qstat listing.

Is there a way to tell Grid Engine to re-issue a suspend signal to processes on a node that are supposed to be suspended?

I can manually tell GE to suspend a job ( qmod -sj ) but then I have to also manually un-suspend it. So what I am looking for is to have GE re-issue suspend signals for jobs it believes are already suspended.

Thanks,
Joseph



_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to