Sorry for the delay, a lot of tests going on in the meantime...

Am 12.01.2011 18:54, schrieb unruh:
> On 2011-01-12, Heiko Gerstung <heiko.removethistext.gerst...@meinberg.de> 
> wrote:
>> Just an addition:
>>
>> The peerstats file shows the nearly the same time jumps at exactly the same 
>> time. I am still wondering if this is NTP or the
>> Kernel ...
> 
> Probably the hardware. But what kind of kernel are you running?
> Tickless? Or is this virtual OS? 
No, it's a standard Linux 2.6.33 (also tried 2.6.37, same problem) on real 
hardware (Embedded i386 compatible CPU). I tried
HZ=250, HZ=100 and HZ=1000 (tickless does not seem to be something I want to 
try out....). The problem exists with all three
different HZ settings, but the interval changes. The anomalies happen more 
often with the HZ=1000 settings and less often
with the HZ=100 setting.

> Remember, all the kernel has to figure
> out the time is its own interenal counters/oscillators. If they go out,
> or skip, then suddenly your time is out. The time does not seem to be
> jumping (ie is not going from 2us to 2ms between one reading and the
> next) but rather seems suddenly to be drifting fast-- as if the CPU
> suddenly got hot. 
Mmmh, the polling cycle I use is 8s (patched ntpd), and if you look at this 
sample sequence ...:
>>> 55571 1286.037 0.000000000 -85.118 0.000001922 0.000329 3
>>> 55571 1294.037 -0.000228000 -85.230 0.000080528 0.039362 3
>>> 55571 1302.037 0.000035000 -85.213 0.000119531 0.037312 3
... you see that it goes from 0us to -228us and back to +35us.


> Ie, is this correlated with a sudden burst of activity
> of the computer?
I did some testing and fired up dd to max out CPU utilization (dd if=/dev/zero 
of=/dev/null) as well as used some bash
scripting to keep it busy ( while true; do echo $((13**9)) > /dev/null ; done), 
both results in 100% CPU utilization, but
that did not do anything with the offset reported by ntpd. I kept that running 
for a couple of minutes, but no spike or
anything.

I also did not find anything unique/unusual in the system log with a timestamp 
near the "spikes". Will have to continue to
look into the kernel, I guess. Next test: Running NTPD with "disable ntp" to 
see if the anomaly still shows up, independent
of ntpd's actions.

My suspicion is that there is something wrong with the kernel timing, which 
does not affect all platforms and hardware.

Best Regards,
 Heiko

_______________________________________________
questions mailing list
questions@lists.ntp.org
http://lists.ntp.org/listinfo/questions

Reply via email to