Re: [gridengine users] Forgetting the Subordinate Queue

2013-01-01 Thread Reuti
Hi, Am 31.12.2012 um 19:14 schrieb Joseph Farran: > Hi All. > > I am running GE 8.1.2 and I have a situation where once in a while ( 2x a > week ), Grid Engine forgets about one of the Subordinate queues. > > Everything works as expected where my subordinate queue goes to "S" > suspend-mode w

Re: [gridengine users] How prevent abnormal nodes load using qsub?

2013-01-01 Thread Reuti
Am 31.12.2012 um 09:34 schrieb Semi: > The memory is not a problem, the problem CPU load, > every python process runs 2 other processes and this stuck nodes. > For examle: 16 CPU nodes run 48 python processes. The question was: as you suggested to your user to request "-l mem_free=4G" it implies

Re: [gridengine users] A 10, 000-node Grid Engine Cluster in Amazon EC2

2013-01-01 Thread Ron Chen
There's also the SDM module that was released with SGE 6.2u5, and with SDM Grid Engine can burst to EC2. Not sure if anyone is still using SDM, but for those interested in learning more about it: https://blogs.oracle.com/templedf/entry/service_domain_manager  -Ron - Original Message --

Re: [gridengine users] Forgetting the Subordinate Queue

2013-01-01 Thread Joseph Farran
Hello Reuti. Yes, the job(s) are not suspending (S) as they normally do. So it's not the queue, but the jobs. Normally as soon as 1 or more core jobs enters the node through the queue, the subordinate jobs suspend immediately.Once is a while, the jobs that go in through the subordinate qu