Hallo to distinguished forum members,
I hope you can assist me.
We are in process of transition to SoGE 8.1.8 from SGE GE 6.1u6.
We experience strange issues with reporting variables.
For example we have a reporting variable called mt_os_type which returns OS
type.
Output example 1: centos6
Output example 1: centos5
#name shortcut type relop requestable
consumable default urgency
#-------------------------------------------------------------------------------------------------------
mt_os_type mt_os_type STRING == YES NO
NONE 0
qconf -se global | grep -i mt_os_typeL
mt_syslog_bytes,mt_yp_query_ms,mt_os_type
It works OK on SGE GE 6.1.u6, while on SoGE 8.1.8 it behaves inconsistently: it
appears and after some time the variable and its value disappears from
attributes on execution hosts.
qconf -se <some_host> | /bin/grep -i <some_host>:
mt_os_type=centos5
And after a minute:
qconf -se <some_host> | /bin/grep -i <some_host>:
No variable and no value...
Some additional information:
Report time is (qconf -sconf):
load_report_time 00:00:40
Within load_sensors.sh script we use several types of sensors: the ones which
are invoked every 40 sec and other that invoked every 2 and 5 minutes, see
below:
load_sensors.sh (part of it):
MINUTES=`date +%M | cut -c2`
###########################################################################
# Execute High Rate sensors
###########################################################################
for LOAD_SENSOR in \
mt_dns_query \
mt_iowait
do
echo "${MYHOST}:${LOAD_SENSOR}:"`source ${LOADBIN}/${LOAD_SENSOR}.sh`
done
###########################################################################
# Execute Medium Rate sensors
###########################################################################
if [[ ${MINUTES} = "0" || ${MINUTES} = "2" || ${MINUTES} = "4" || ${MINUTES} =
"6" || ${MINUTES} = "8" ]]; then
for LOAD_SENSOR in \
mt_my_sensor3 \
mt_my_sensor4
do
echo "${MYHOST}:${LOAD_SENSOR}:"`source ${LOADBIN}/${LOAD_SENSOR}.sh`
done
Fi
###########################################################################
# Execute Low Rate sensors
###########################################################################
if [[ ${MINUTES} = "0" || ${MINUTES} = "5" ]]; then
for LOAD_SENSOR in \
mt_os_type
do
echo "${MYHOST}:${LOAD_SENSOR}:"`source ${LOADBIN}/${LOAD_SENSOR}.sh`
done
fi
Sensors which are located in High Rate sensors block (40sec) work OK:
qconf -se mtlx346 | egrep -i iowait:
mt_dns_query_ms=0,mt_iowait=0.874
Sensors which are located within Medium/Low Rate (2 & 5 mins) block work
inconsistently - they appear and disappear.
NOTE: In old SGE 6.1u6 this worked OK.
Any ideas why does this strange behavior occur?
Thanks.
Yuri Burmachenko | Sr. Engineer | IT | Mellanox Technologies Ltd.
Work: +972 74 7236386 | Cell +972 54 7542188 |Fax: +972 4 959 3245
Follow us on Twitter<http://twitter.com/mellanoxtech> and
Facebook<http://www.facebook.com/pages/Mellanox-Technologies/223164879116>
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users