Hello,

I'm experiencing a very weird issue. I've no idea how to deal with it.

 * I've submited multiple jobs ie: job1, job2, job3.
 * Jobs are running in multiple compute nodes
 * I've modified jobs to user hold and then rescheduled
 * Jobs are now in a hqR state in SGE job pool (they're supposed to
   stay there and free their slots and resources in their respective
   compute nodes)
 * Compute nodes that previously ran this jobs continue to execute the
   job process and consuming resources (I can see them with htop inside
   compute node)

So what's the correct way to pause/restart a job and hold it on SGE pool without holding resources?

Thank you!

Regards,
Guillermo.

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to