Re: [Ganglia-general] Incorrect boottime and uptime

2009-07-16 Thread Ken Teague
On 7/8/2009 3:43 PM, Ken Teague wrote: > On 7/8/2009 3:30 PM, Bernard Li wrote: >> Hi Ken: >> >> Okay, try this: >> >> Figure out the user gmond is running as (common examples are: ganglia, >> nobody, etc.). See if you can cat /proc/stat as that user.

Re: [Ganglia-general] Incorrect boottime and uptime

2009-07-08 Thread Ken Teague
On 7/8/2009 3:30 PM, Bernard Li wrote: > Hi Ken: > > Okay, try this: > > Figure out the user gmond is running as (common examples are: ganglia, > nobody, etc.). See if you can cat /proc/stat as that user. master3:~ # ps aux |grep gmond nobody 31801 0.0 0.0 23128 2884 ?Ss 15:30

Re: [Ganglia-general] Incorrect boottime and uptime

2009-07-08 Thread Ken Teague
On 7/8/2009 2:21 PM, Bernard Li wrote: > You should be looking at /proc/stat on your *nodes*, not on your > masters. I am guessing that perhaps your nodes don't have the /proc > filesystem mounted or something like that. btime in /proc/stat is fine on the nodes as well. I should also note that

Re: [Ganglia-general] Incorrect boottime and uptime

2009-07-08 Thread Ken Teague
On 7/8/2009 1:24 PM, Bernard Li wrote: > I just looked at the code, Ganglia determines boottime based on btime > of /proc/stat. If it fails to get the value of btime, it sets > boottime to 0 (which is what you are observing). I also want to point out that what you're stating here is correct, as

Re: [Ganglia-general] Incorrect boottime and uptime

2009-07-08 Thread Ken Teague
On 7/8/2009 1:24 PM, Bernard Li wrote: > I just looked at the code, Ganglia determines boottime based on btime > of /proc/stat. If it fails to get the value of btime, it sets > boottime to 0 (which is what you are observing). > > uptime is derived from boottime. > > So, the question is, what is

Re: [Ganglia-general] Incorrect boottime and uptime

2009-07-08 Thread Ken Teague
On 7/8/2009 11:16 AM, Bernard Li wrote: > Hi Ken: Hi Bernard > What OS/arch are the nodes in cluster2/cluster3 running on? Is it > different from cluster1? They're all running SUSE. cluster1 is on SUSE 10.1 and cluster2 and cluster3 are running openSUSE 10.3. master:~ # cat /etc/*release SU

[Ganglia-general] Incorrect boottime and uptime

2009-07-08 Thread Ken Teague
I have 3 separate clusters; cluster1, cluster2, and cluster3. On cluster2 and cluster3, if I go into the Ganglia web interface and click on, say, node2 of that cluster, it's reporting an incorrect boottime and uptime. boottimeWed, 31 Dec 1969 19:00:00 -0500 uptime 14433 days,

[Ganglia-general] gmetad not starting on boot

2008-07-01 Thread Ken Teague
master2:/usr/sbin # ./gmond --version gmond 3.0.3 master2:/usr/sbin # ./gmetad --version ganglia 3.0.3 The web interface to Ganglia fails to work after I boot because gmetad isn't starting. master2:~ # ps aux |grep gmond nobody3747 0.1 0.0 23716 2404 ?Ss 12:53 0:00 /usr/sbi