Hi,

It's this bit that's doing it: "SGE_ND=true". It's there so that the qmaster doesn't daemonise, in order to play nicely with systemd. Unfortunately, as it was originally put in to aid debugging, it also enables some debug messages.

If too much is being generated, I'd suggest either redirecting the output of qmaster to /dev/null, or playing with the variable William mentioned (SGE_DEBUG_LEVEL) to see if you can reduce the amount. There's some documentation of said variable here:

http://sid.ethz.ch/debian/gridscheduler/gridscheduler-0%2B20110210/source/libs/rmon/rmon.html

I wonder if setting SGE_DEBUG_LEVEL="0 0 0 0 0 0 0 0" might improve things for you, but I suspect it won't. If not, there's probably some missing if tests in the debug code and the "correct" fix is probably to edit them to prevent those messages from being generated unless the appropriate SGE_DEBUG_LEVEL is set.

Mark

On Thu, 22 Mar 2018, Sms Backup wrote:

Hello,

[root@qmaster ~]# cat /proc/44697/environ
SGE_CELL=defaultSGE_ND=truePATH=/usr/local/sbin:/usr/
local/bin:/usr/sbin:/usr/binPWD=/SGE_EXECD_PORT=6814LANG=en_US.UTF-8SGE_
QMASTER_PORT=6813SGE_ROOT=/opt/sgeSHLVL=1SGE_CLUSTER_NAME=`cat
$SGE_ROOT/$SGE_CELL/common/cluster_name 2>/dev/null`_=/opt/sge/bin/lx-
amd64/sge_qmaster

As you see there is no debug enabled...I do not know where these messages
come from...I even tried disabling accounting, but it did not decrease
messages number.


2018-03-21 15:54 GMT+01:00 William Hay <[email protected]>:

On Wed, Mar 21, 2018 at 07:59:41AM +0100, Sms Backup wrote:
   William,
   Thanks for reply. Unfortunately I have few non-interactive queues, so
I
   cannot limit slots this way.
   99% of messages printed to system log look like this below, so I
believe
   that are the messages which are suppressed:
   Mar 20 21:55:18 qmaster sh: -------------------------------
   Mar 20 21:55:18 qmaster sh: RUE_name             (String)    =
   thomas///medium.q//
   Mar 20 21:55:18 qmaster sh: RUE_utilized_now     (Double)    =
2.000000
   Mar 20 21:55:18 qmaster sh: RUE_utilized         (List)      = empty
   Mar 20 21:55:18 qmaster sh: RUE_utilized_now_non (Double)    =
0.000000
   Mar 20 21:55:18 qmaster sh: RUE_utilized_nonexcl (List)      = empty

I'm wondering if your qmaster is running with debuging enabled.
If you dump the environment of the qmaster (/proc/<pid>/environ)
is there a mention of SGE_DEBUG_LEVEL?

If so you should try to figure out where it gets and tweak to a
more appropriate level before restarting the qmaster.

William




_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to