On Monday, May 5, 2003, at 05:07 AM, Martin Knoblauch wrote:

Hi,

 today I upgraded one of our clusters from 2.5.1 to 2.5.3 (gmond,
gmetad and the web-frontend). Since then the log-files on the gmetad
node get filled with stuff like:

headnode /usr/sbin/gmetad[22664]: RRD_update: illegal attempt to update
using time 1052131942 when last update time is 1052132440 (mini
mum one second step)

I think I know what this is. We recently changed the RRD update logic to use CLUSTER LOCALTIME
as the rrd timestamp. This was done in 2.5.3 I believe.

Now if your gmetad has a data source which is another gmetad (port 8651), it will try to update its rrds multiple times with the same CLUSTER LOCALTIME. Why? Because gmetad only updates its XML every 20-30s.

So it is possible for your gmetad to attempt to update its rrds twice with the same LOCALTIME timestamp, causing the errors you see in your logs.

This is one of those hard-to-anticipate bugs which occur from unintended side effects to the system. To fix it, I believe we need to use the true localtime when updating rrds for which we are not the "authority" on. (The authority mode is off whenever we get our data from another gmetad).

I will think about this some more, and make the changes to the new gmetad in CVS.

-FDS


 Also, the graphs for anything are flat. The actuals from gmond look OK
though.

Cheers
Martin

=====
------------------------------------------------------
Martin Knoblauch
email: [EMAIL PROTECTED] or [EMAIL PROTECTED]
www:   http://www.knobisoft.de





-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Ganglia-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-developers

Federico

Rocks Cluster Group, SDSC, San Diego, CA


Reply via email to