Hallo to distinguished forum members,

I hope you can assist me.
We are in process of transition to SoGE 8.1.8 from SGE GE 6.1u6.

We experience strange issues with reporting variables.
For example we have a reporting variable called mt_os_type which returns OS 
type.

Output example 1: centos6
Output example 1: centos5

#name                  shortcut               type        relop requestable 
consumable default  urgency
#-------------------------------------------------------------------------------------------------------
mt_os_type             mt_os_type             STRING      ==    YES         NO  
       NONE     0

qconf -se global | grep -i mt_os_typeL
                      mt_syslog_bytes,mt_yp_query_ms,mt_os_type

It works OK on SGE GE 6.1.u6, while on SoGE 8.1.8 it behaves inconsistently: it 
appears and after some time the variable and its value disappears from 
attributes on execution hosts.
qconf -se <some_host> | /bin/grep -i <some_host>:
mt_os_type=centos5

And after a minute:
qconf -se <some_host> | /bin/grep -i <some_host>:

No variable and no value...

Some additional information:

Report time is (qconf -sconf):
load_report_time             00:00:40


Within load_sensors.sh script we use several types of sensors: the ones which 
are invoked every 40 sec and other that invoked every 2 and 5 minutes, see 
below:

load_sensors.sh (part of it):
MINUTES=`date +%M | cut -c2`

###########################################################################
# Execute High Rate sensors
###########################################################################
for LOAD_SENSOR in \
mt_dns_query \
mt_iowait
do
echo "${MYHOST}:${LOAD_SENSOR}:"`source ${LOADBIN}/${LOAD_SENSOR}.sh`
done
###########################################################################
# Execute Medium Rate sensors
###########################################################################
if [[ ${MINUTES} = "0" || ${MINUTES} = "2" || ${MINUTES} = "4" || ${MINUTES} = 
"6" || ${MINUTES} = "8" ]]; then

for LOAD_SENSOR in \
mt_my_sensor3 \
mt_my_sensor4

do
  echo "${MYHOST}:${LOAD_SENSOR}:"`source ${LOADBIN}/${LOAD_SENSOR}.sh`
done

Fi

###########################################################################
# Execute Low Rate sensors
###########################################################################
if [[ ${MINUTES} = "0" || ${MINUTES} = "5" ]]; then

for LOAD_SENSOR in \
  mt_os_type
do
  echo "${MYHOST}:${LOAD_SENSOR}:"`source ${LOADBIN}/${LOAD_SENSOR}.sh`
done

fi

Sensors which are located in High Rate sensors block (40sec) work OK:
qconf -se mtlx346 | egrep -i iowait:
mt_dns_query_ms=0,mt_iowait=0.874

Sensors which are located within Medium/Low Rate (2 & 5 mins) block work 
inconsistently - they appear and disappear.

NOTE: In old SGE 6.1u6 this worked OK.

Any ideas why does this strange behavior occur?
Thanks.


Yuri Burmachenko | Sr. Engineer | IT | Mellanox Technologies Ltd.
Work: +972 74 7236386 | Cell +972 54 7542188 |Fax: +972 4 959 3245
Follow us on Twitter<http://twitter.com/mellanoxtech> and 
Facebook<http://www.facebook.com/pages/Mellanox-Technologies/223164879116>

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to