master2:/usr/sbin # ./gmond --version gmond 3.0.3 master2:/usr/sbin # ./gmetad --version ganglia 3.0.3
The web interface to Ganglia fails to work after I boot because gmetad isn't starting. master2:~ # ps aux |grep gmond nobody 3747 0.1 0.0 23716 2404 ? Ss 12:53 0:00 /usr/sbin/gmond root 4325 0.0 0.0 4188 748 pts/0 S+ 12:55 0:00 grep gmond master2:~ # netstat -an |grep 8649 tcp 0 0 0.0.0.0:8649 0.0.0.0:* LISTEN udp 0 0 10.2.7.1:32772 239.2.11.71:8649 ESTABLISHED udp 0 0 239.2.11.71:8649 0.0.0.0:* master2:~ # ps aux |grep gmetad root 4389 0.0 0.0 4180 740 pts/0 S+ 12:56 0:00 grep gmetad master2:~ # netstat -an |grep 8651 master2:~ # /etc/init.d/gmetad restart Shutting down gmetad done Starting gmetad done master2:~ # netstat -an |grep 8651 tcp 0 0 0.0.0.0:8651 0.0.0.0:* LISTEN This is odd because my /var/log/boot.msg has: <notice>gmetad start <notice>'gmetad start' exits with status 0 <notice>gmond start Adding route for multicast trafficdone Starting gmond<notice>startproc: execve (/usr/sbin/gmond) [ /usr/sbin/gmond ], [ CONSOLE=/dev/console ROOTFS_FSTYPE=xfs SHELL=/bin/sh TERM=linux ROOTFS_FSCK=0 LC_ALL=POSIX INIT_VERSION=sysvinit-2.86 REDIRECT=/dev/tty1 COLUMNS=80 PATH=/bin:/sbin:/usr/bin:/usr/sbin vga=normal RUNLEVEL=5 PWD=/ SPLASHCFG= PREVLEVEL=N LINES=25 HOME=/ SHLVL=2 SPLASH=no ROOTFS_BLKDEV=/dev/sda2 _=/sbin/startproc DAEMON=/usr/sbin/gmond ] done <notice>'gmond start' exits with status 0 It appears that gmetad is starting before gmond. Which should start first? Does it matter? What happens if gmetad starts before gmond? The documentation shows that gmond is started first but it doesn't list any specific start order dependencies. master2:~ # cd /etc/rc.d master2:/etc/rc.d # find . -name \*gmetad\* ./rc3.d/S10gmetad ./rc3.d/K12gmetad ./rc5.d/S10gmetad ./rc5.d/K12gmetad ./gmetad master2:/etc/rc.d # find . -name \*gmond\* ./rc3.d/S10gmond ./rc3.d/K12gmond ./rc5.d/S10gmond ./rc5.d/K12gmond ./gmond So, based on this and how init.d works, with my configuration, it will try to start gmetad before gmond. I get this error: There was an error collecting ganglia data (127.0.0.1:8652): fsockopen error: Connection refused ... until gmetad is started manually. This makes sense, since the web interface gathers data from gmetad, but I don't know why my configuration is saying the service is starting when it really isn't. Does gmetad terminate if gmond hasn't started yet? Any clues? Thanks in advance. - Ken ------------------------------------------------------------------------- Sponsored by: SourceForge.net Community Choice Awards: VOTE NOW! Studies have shown that voting for your favorite open source project, along with a healthy diet, reduces your potential for chronic lameness and boredom. Vote Now at http://www.sourceforge.net/community/cca08 _______________________________________________ Ganglia-general mailing list Ganglia-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-general