Re: [gridengine users] Large cluster with memory reservation leaving cores idle

2016-03-09 Thread Reuti
Hi Chris, Am 08.03.2016 um 18:29 schrieb Christopher Black: > Thanks for the reply Reuti! > > Sounds like some of the suggestions are moving limits out of RQS and into > complexes and consumable resources. Yep. > How do we make that happen without > requiring users to add -l bits to their qsu

Re: [gridengine users] Large cluster with memory reservation leaving cores idle

2016-03-08 Thread Christopher Black
Thanks for the reply Reuti! Sounds like some of the suggestions are moving limits out of RQS and into complexes and consumable resources. How do we make that happen without requiring users to add -l bits to their qsubs? On 3/8/16, 7:32 AM, "Reuti" wrote: >I saw cases were RQS blocks further sch

Re: [gridengine users] Large cluster with memory reservation leaving cores idle

2016-03-08 Thread Reuti
Hi, > Am 08.03.2016 um 00:20 schrieb Christopher Black : > > Greetings! > We are running SoGE (mix of 8.1.6 and 8.1.8, soon 8.1.8 everywhere) on a > ~300 node cluster. > We utilize RQS and memory reservation via a complex to allow most nodes to > be shared among multiple queues and run a mix of s

Re: [gridengine users] Large cluster with memory reservation leaving cores idle

2016-03-08 Thread Fotis Georgatos
Hi William, Christopher, all, qtop is not yet made to visualize memory allocation info (or any other consumables so far) yet there is no reason why it couldn't: https://github.com/qtop/qtop/tree/develop If people come up with some scheme which makes senses to debug job allocation information, it

Re: [gridengine users] Large cluster with memory reservation leaving cores idle

2016-03-08 Thread William Hay
On Mon, Mar 07, 2016 at 11:20:04PM +, Christopher Black wrote: > Greetings! > We are running SoGE (mix of 8.1.6 and 8.1.8, soon 8.1.8 everywhere) on a > ~300 node cluster. > We utilize RQS and memory reservation via a complex to allow most nodes to > be shared among multiple queues and run a mi

[gridengine users] Large cluster with memory reservation leaving cores idle

2016-03-07 Thread Christopher Black
Greetings! We are running SoGE (mix of 8.1.6 and 8.1.8, soon 8.1.8 everywhere) on a ~300 node cluster. We utilize RQS and memory reservation via a complex to allow most nodes to be shared among multiple queues and run a mix of single core and multi core jobs. Recently when we hit 10k+ jobs in qw, w