Re: another bug: system time change
On Fri, Apr 09, 2010 at 10:22:05AM +0200, Wolfgang Hennerbichler wrote: Hi Ondrejs, I think I found another bug. For some mysterious reason our system time jumped forward (more than 30.000 seconds) on one of our route-servers. I don't know why this happened, but I suspect a broken ntp server could have caused this. Nevetheless, this was reason enough for BIRD to drop the BGP peerings: You see the log-entries from the wtachdog, which is run every minute, all of a sudden the time jumps to 10:08 UTC, and BIRD brings down BGP sessions. Are the session hold timers dependent on the system time? That is really strange. On Linux 2.6, we use monotonic timers, which shouldn't be affected by system time change. If monotonic timers are not available (on Linux 2.4), we use system time but we detect time jumps and ignore them. -- Elen sila lumenn' omentielvo Ondrej 'SanTiago' Zajicek (email: santi...@crfreenet.org) OpenPGP encrypted e-mails preferred (KeyID 0x11DEADC3, wwwkeys.pgp.net) To err is human -- to blame it on a computer is even more so. signature.asc Description: Digital signature
Re: another bug: system time change
On Apr 9, 2010, at 12:44 , Arnold Nipper wrote: Is this really a BIRD bug? I guess it is expected behaviour to drop a session if you _think_ you missed keepalives Time Counters within a Daemon should NEVER be dependent on the system time, so yes, I would consider this a bug of BIRD, but it seems that BIRD has implemented this correctly, as Ondrej said. The bug is w/ the underlying OS. Virtualisation and keeping time really seems to be a problem. For this and other reasons we refrain from using a VM for RS. We'll see how this works out. Arnold -- Arnold Nipper / nIPper consulting, Sandhausen, Germany email: arn...@nipper.de phone: +49 6224 9259 299 mobile: +49 172 2650958 fax: +49 6224 9259 333 -- www.vix.at | www.aco.net w...@univie.ac.at | WH844-RIPE Vienna University Computer Center Tel: +43 1 4277-14031 | Fax: -9140
Re: another bug: system time change
On Apr 9, 2010, at 14:08 , Ondrej Zajicek wrote: On Fri, Apr 09, 2010 at 10:49:15AM +0200, Wolfgang Hennerbichler wrote: Hm. Now this is strange indeed. I run linux 2.6.33.1 (amd64) - but it is a virtualized host (with xen). Maybe it was xen's fault, but the logs don't reveal much as you see. Hm. this sucks. If there is a virtualization involved, then a bug in the virtualized monotonic timers is the simplest explanation. yes. might be. damn :) -- Elen sila lumenn' omentielvo Ondrej 'SanTiago' Zajicek (email: santi...@crfreenet.org) OpenPGP encrypted e-mails preferred (KeyID 0x11DEADC3, wwwkeys.pgp.net) To err is human -- to blame it on a computer is even more so. -- www.vix.at | www.aco.net w...@univie.ac.at | WH844-RIPE Vienna University Computer Center Tel: +43 1 4277-14031 | Fax: -9140
RE: another bug: system time change
Hello, This is my first time so sorry if I have not done this right. I have seen this problem before. I have seen this in Linux when the BIOS synchronises its time with the OS. The Linux OS could have been fine and have had a valid NTP source, but periodically there is some form of synchronisation that occurs between the BIOS and the OS and weird things can happen. This is probably nothing to do with bird. PS; A Virtual BIOS does exactly the same thing. Ie. the real BIOS time gets synchronised with all the Virtual BIOS clocks (one for each VM), which each in turn synchronise with their respective guest. Check all your BIOS times (physical BIOS and virtual Guest BIOS'). Hope this helps. Andy. -Original Message- From: owner-bird-us...@atrey.karlin.mff.cuni.cz [mailto:owner-bird-us...@atrey.karlin.mff.cuni.cz] On Behalf Of Wolfgang Hennerbichler Sent: 09 April 2010 09:49 To: Ondrej Zajicek Cc: Bird Users Subject: Re: another bug: system time change On Apr 9, 2010, at 10:49 , Ondrej Zajicek wrote: On Fri, Apr 09, 2010 at 10:22:05AM +0200, Wolfgang Hennerbichler wrote: Hi Ondrejs, I think I found another bug. For some mysterious reason our system time jumped forward (more than 30.000 seconds) on one of our route-servers. I don't know why this happened, but I suspect a broken ntp server could have caused this. Nevetheless, this was reason enough for BIRD to drop the BGP peerings: You see the log-entries from the wtachdog, which is run every minute, all of a sudden the time jumps to 10:08 UTC, and BIRD brings down BGP sessions. Are the session hold timers dependent on the system time? That is really strange. On Linux 2.6, we use monotonic timers, which shouldn't be affected by system time change. If monotonic timers are not available (on Linux 2.4), we use system time but we detect time jumps and ignore them. Hm. Now this is strange indeed. I run linux 2.6.33.1 (amd64) - but it is a virtualized host (with xen). Maybe it was xen's fault, but the logs don't reveal much as you see. Hm. this sucks. -- Elen sila lumenn' omentielvo Ondrej 'SanTiago' Zajicek (email: santi...@crfreenet.org) OpenPGP encrypted e-mails preferred (KeyID 0x11DEADC3, wwwkeys.pgp.net) To err is human -- to blame it on a computer is even more so. -- www.vix.at | www.aco.net w...@univie.ac.at | WH844-RIPE Vienna University Computer Center Tel: +43 1 4277-14031 | Fax: -9140 Monitor Computer Systems Limited Company Registration Number: NI 17805 Registered Office: 3 Pine Crest, Holywood, North Down, Northern Ireland BT18 9ED