Hello Adrian,

On 13 Jun 2013, at 15:32, Adrian Sevcenco wrote:

> On 06/09/2013 11:26 AM, Sergio Ballestrero wrote:
>> We run a single gmetad with 11 collector gmond (listen-only) and 1
>> "client" gmond (send-only) to monitor the host itself, all using unicast.
> Hi! Are this gmonds on the same (gmetad) host? i imagine that yes, so i
> ask you : how did you specify the each configuration?
> was enough to replace in /etc/init.d/gmond
> GMOND=/usr/sbin/gmond
> with
> GMOND=/usr/sbin/gmond -c /etc/ganglia/my_conf_1
> ?
It was not really enough, because then the service stop kills all gmond 
indiscriminately.
I've copied the "server" script at bottom, feel free to use it. Then the 
"client" gmond is the same except for replacing where needed
 for CFG in /etc/ganglia/gmond.conf; do
I keep them separate for ease of management (e.g. restart all client gmond)

> Each gmond collector defines a separate cluster?
Yes, grouped by function. The majority of them is actually all in one cluster, 
the HLT worker nodes.

> For this number of nodes how did you chose the size of rrd storage? (the
> time granularity) how big do you have the rrd archive?
For the moment we are using the defaults, so about 500KB per node, 1.3GB total 
(probably some duplication from moving hosts across groups)
We'll probably want to tune it later, try to keep high res for a bit longer to 
help debugging performance issues older than one hour.

Cheers,
  Sergio

-------------- sysV Init script -------------
[sash@pc-tdq-sys-02 ganglia]$ cat /etc/init.d/gmondsrv 
#!/bin/sh 
#
# chkconfig: - 70 40
# description: gmond server startup script
#
GMOND=/usr/sbin/gmond

. /etc/rc.d/init.d/functions

RETVAL=0

case "$1" in
    start)
        echo -n "Starting GANGLIA gmond: "
        [ -f $GMOND ] || exit 1
        RETVAL=0
        for CFG in /etc/ganglia/gmond.*.srvconf; do
            PID=/var/run/$(basename $CFG).pid
            daemon --pidfile $PID $GMOND -c $CFG -p $PID
            RETVAL=$[RETVAL||$?]
        done
        [ "$RETVAL" -eq 0 ] && success $"$base startup" || failure $"$base 
startup"
        echo
        [ $RETVAL -eq 0 ] && touch /var/lock/subsys/gmond
        ;;

  stop)
      echo -n "Shutting down GANGLIA gmond: "   
      RETVAL=0
      for CFG in /etc/ganglia/gmond.*.srvconf; do
          PID=/var/run/$(basename $CFG).pid
          killproc -p $PID gmond
          R=$?
          RETVAL=$[RETVAL||R]
      done
      [ "$RETVAL" -eq 0 ] && success $"$base startup" || failure $"$base 
startup"
      echo
      [ $RETVAL -eq 0 ] && rm -f /var/lock/subsys/gmond
        ;;

  restart|reload)
      $0 stop
      $0 start
      RETVAL=$?
      ;;
  status)
      RETVAL=0
      for CFG in /etc/ganglia/gmond.*.srvconf; do
        PID=/var/run/$(basename $CFG).pid
        status -p $PID gmond
        RETVAL=$[RETVAL||$?]
      done
      ;;
  *)
      echo "Usage: $0 {start|stop|restart|status}"
      exit 1
      ;;
esac

exit $RETVAL



------------------------------------------------------------------------------
This SF.net email is sponsored by Windows:

Build for Windows Store.

http://p.sf.net/sfu/windows-dev2dev
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to