Itay M wrote:
'ncpus' still exists but only in 17 'old' jobs - ones that were submitted before we made the 'unset' change. I guess I should wait until these will end and re-test the system?

Possibly, but rather unlikely...

diagnose -n says for example, on node28 :
node28 Busy 0:4 2926:3950 1:1 3871:7641 1.00 DEFAUL [NONE] DEF 2.19 002 [heavy_2:4][light_4:4][b_que [DEFAULT] [NONE]
WARNING:  node 'node28' has more processors utilized than dedicated (4 > 2)
----- --- 6:86 72602:98716 26:26 142420:212774

But this node is running 2 jobs which both does not have 'ncpus' settings if I use qstat -f on them.

Can you also report the output of checkjob and diagnose -j on these 2 jobs? Do they also have the MEM requirement?

About the MEM requirement: do you mean to unset it to? other than that we don't use any MEM requierment in our qsub script.

Well, it must be coming from somewhere, quite possibly from a default in the queue or server configuration. So I'd try unsetting it there. However, looking at the diagnose -n output above makes me think it is processor related - judging from the 0:4, for some unknown reason your jobs consume 2 processors each rather than 1.

Regards,
Jan Ploski
_______________________________________________
mauiusers mailing list
mauiusers@supercluster.org
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to