subject:"\[ntp\:questions\] Number of Stratum 1 Stratum 2 Peers"

Re: [ntp:questions] Number of Stratum 1 Stratum 2 Peers

2015-02-06 Thread Mike Cook

 snip
 Three are fine, as long as only one dies or goes nuts.
 
 Again, define goes nuts. You don't seem to like the term 
 falseticker, so how do you define goes nuts? If one goes nuts or 
 even goes offline, if the remaining two do not agree then it is like 
 having no server at all.
 
 No, it is like having two, with one being out. 
 falseticker is a term with a very specific internal definition. Thus a
 server whose time is right on UTC could be a falseticker, because the
 other two servers were both exactly 3 days out, with tiny jitter estimates. 
 I would say then that you had two servers going nuts, and one good, even
 though ntpd would say there were two good and one false ticker. 

In fact this does not happen. I just tested the hypothesis.
What happens depends on how the two wayward get there exaggerated offset:
a) someone,something resets the date:
result: ntp on both those servers crashes due to the panic_stop limit.

  So in this case  the client has only one reference and continues using that. 
It is not flagged as a falsticker.
  That is normal.
   
b) someone restarts ntp on the servers with the wrong date. Here the servers 
ntpd has no way of knowing that it has bad time and so continues serving 
normally. 
On the client. The running ntp sees immediately a huge offset and huge 
jitter.

Tue Dec  9 13:15:04 CET 2014
 remote   refid  st t when poll reach   delay   offset  jitter
==
*192.168.1.15.GPS1.   1 u  320   64  3600.5490.040   0.037
+192.168.1.16.GPS2.   1 u   37   64  3770.6060.006   0.028
+192.168.1.17.GPS1.   1 u  309   64  3600.5760.027   0.025
Tue Dec  9 13:16:08 CET 2014
 remote   refid  st t when poll reach   delay   offset  jitter
==
 192.168.1.15.GPS1.   1 u   55   64  3410.5650.042 9660780
*192.168.1.16.GPS2.   1 u   37   64  3770.6060.006   0.024
 192.168.1.17.GPS1.   1 u   42   64  3410.5790.041 9660773

After 5 mins the client is unable to resolve this and declares all clock 
falsetickers and then panics. I did not have ntpd in debug mode here, but it is 
reasonable to assume that it panics due to the selected clock being too far out 
and hitting the panic limit.

Tue Dec  9 13:23:37 CET 2014
 remote   refid  st t when poll reach   delay   offset  jitter
==
 192.168.1.15.GPS1.   1 u   45   64  3770.596  -255600 155.539
*192.168.1.16.GPS2.   1 u   25   64  3770.6140.024   0.008
 192.168.1.17.GPS1.   1 u   30   64  3770.583  -255600  52.806
Tue Dec  9 13:24:41 CET 2014
 remote   refid  st t when poll reach   delay   offset  jitter
==
x192.168.1.15.GPS1.   1 u   43   64  3770.596  -255600 179.609
x192.168.1.16.GPS2.   1 u   23   64  3770.6140.024   0.008
x192.168.1.17.GPS1.   1 u   27   64  3770.618  -255599   6.009
/usr/local/bin/ntpq: read: Connection refused
Tue Dec  9 13:25:45 CET 2014
/usr/local/bin/ntpq: read: Connection refused

This is exactly what happens if the client is restarted.

clock_filter: n 1 off -255599.997967 del 0.000662 dsp 7.937502 jit 0.02
select: endpoint -1 -255600.000806
select: endpoint  1 -255599.995128
select: survivor 192.168.1.17 0.002839
select: combine offset -255599.997967134 jitter 0.0
event at 1 192.168.1.17 903a 8a sys_peer
clock_update: at 1 sample 1 associd 18641
event at 1 0.0.0.0 c617 07 panic_stop -255600 s; set clock manually within 1000 
s.
event at 1 0.0.0.0 c61d 0d kern kernel time sync disabled

So ntp does NOT continue in your test case. Your case may be better if the time 
difference is less than the panic limit. Say if the two servers do not insert a 
leap second, but the  « correct » one does. I’ll try that for my own 
satisfaction if I can figure how to do it.

Like

 
 
 
 Brian Utterback
 
 ___
 questions mailing list
 questions@lists.ntp.org
 http://lists.ntp.org/listinfo/questions

___
questions mailing list
questions@lists.ntp.org
http://lists.ntp.org/listinfo/questions

Re: [ntp:questions] Number of Stratum 1 Stratum 2 Peers

2014-12-22 Thread Martin Burnicki


Phil W Lee wrote:

I believe it is important to allow negative leap seconds again, in
order to allow a dignified recovery from erroneous positive leap
seconds.


I don't think fake negative leap seconds can (and should) be used to 
undo the effect of an erroneously applied positive leap second.


Martin

___
questions mailing list
questions@lists.ntp.org
http://lists.ntp.org/listinfo/questions

Re: [ntp:questions] Number of Stratum 1 Stratum 2 Peers

2014-12-18 Thread Brian Inglis


On 2014-12-18 14:27, Phil W Lee wrote:

Martin Burnicki martin.burni...@meinberg.de considered Wed, 17 Dec
2014 10:35:39 +0100 the perfect time to write:

Phil W Lee wrote:

Martin Burnicki martin.burni...@meinberg.de considered Tue, 16 Dec
2014 14:23:15 +0100 the perfect time to write:

Harlan Stenn wrote:

An alternative is that we get enough support to advance NTF's General
Timestamp API, and then we can run systems on either TAI or UTC and
these conversions will happen automatically.

Since timescale files in the GTSAPI are versioned, one could still use
an obsolete leapsecond file, and while those UTC timestamps would be
wrong if a new leapsecond was added, these timestamps would be
correctable when a new version of the UTC timescale file was available.


Hm, that may not really help if the API returns a wrong UTC time stamp
which is then used to set the system time wrong.

The tzdist protocol could also be helpful here to provide the
information required to do the conversion correctly. An expiration date
could be used for versioning.


You don't need an expiry date if you have a version number and/or an
authoritative source for any new version that may be available - you
just compare the two, and use the newest available.


Yes you do. With only a version number consumers like ntpd would not
be able to know if the information is outdated, or not.

Of course, if leap seconds should be abolished it would be useful to
support a pseudo expiration date meaning until further notice.


As long as the IERS stays at the same URL, you could just use their
file at http://hpiers.obspm.fr/eoppc/bul/bulc/Leap_Second.dat
(although it would be useful if that file was more complete, with a
version number and checksum).


This is once more a different file format than the format used by
tzdata, or NIST/NTP. :-(

A service like a tzdist client, or a simple script which might look for
and download updated files, could report an error if the URL is not
reachable, and thus it can't even *check* if a new version of the file
is available.

However, similarly as not every tiny NTP client node should query the
time directly from NIST and similar servers but should use pool servers
instead, not every tiny embedded system should try to download a leap
second file directly from the primary server.


So make the download dependent on the stratum of the ntp server -
mandatory for stratum 1, optional for 2, disabled below that (or some
such system - that's only a suggestion, although obviously I think it
has merit).


If they use secondary servers an older version of the file may me
available, but outdated. No way to check this without an expiration date.

There are companies with a whole (sub-)network without access to the
internet, so it may be required to update DST rules and leap second
information manually. An easy way to do this could be to set up a tzdist
or FTP (or whatever) server which can provide the internal clients with
the update.

If no one cares about those updates then applications like ntpd can
output a warning if the expiration date has been passed. With only a
version information this isn't possible.


It would also be useful if they used SSL, and changed the url to
https://etc.


Agreed.


Perhaps NTP V5 could support all current leap second file source types,
specification of URIs and file paths for all types; with an optional leap
second packet extension, added only to early association establishment
packets once a server version is known, or whenever a source update is
detected.

The leap second packet extension would be optional only for V  5 or stratum  
1,
and give the last/next leap second time announced, the expiration time, now
available in all sources, and source file type, to allow for different 
expiration
times, unless NIST and IERS agree on expiration times.

If the leap second and expiration times announced agree, the packet extension
returns the same values, otherwise the later leap second time and/or earlier
expiration time, if the leap second time agrees, is adopted by the lower stratum
system and returned in the reply, indicating adoption, then the extension can be
dropped from subsequent packets.

Lack of agreement and/or adoption should always be logged by lower stratum 
systems,
could optionally be logged by equal stratum systems acting as clients or peers, 
and
could optionally be logged only once by higher stratum systems.

Other requirements will undoubtedly need to be added to cover all possible 
scenarios,
including false-leapers, false-expirers, and dropped packets.

--
Take care. Thanks, Brian Inglis
___
questions mailing list
questions@lists.ntp.org
http://lists.ntp.org/listinfo/questions

Re: [ntp:questions] Number of Stratum 1 Stratum 2 Peers

2014-12-17 Thread Martin Burnicki

Phil W Lee wrote:

Martin Burnicki martin.burni...@meinberg.de considered Tue, 16 Dec
2014 12:56:15 +0100 the perfect time to write:

William Unruh wrote:

The importance of trades is usually a before/after. And UTC TAI, GPS all
have exactly the same definition of before and after. Of course if one
time was in UTC and the otehr in TAI, that could well be successfully
argued.

From a technical point I absolutely agree with you.

However, there have been discussions (IIRC on the TZ or leapsecs mailing
lists) where folks tried to explain that even though the difference
between TAI and UTC is just an integral number of seconds, TAI could not
even be used as a replacement for UTC without leap seconds, just due to
specific wordings in certain documents.

Leap seconds seem to be a real mess in the IT world.
It would be useful if the way of inserting a leap-second was set in a
standard, in such a way that time continued at a set rate (maybe by
slewing at a set percentage or PPM). If that could be achieved, it
would remove many of the objections to leap-seconds.
It might be difficult to thrash out in practice though.

I know that officially at present there is an additional second
between 23:59:60 and 00:00:00, but no time recording system that I
know of has the ability to record times between 23:59:60 and 00:00:00
correctly (despite such times existing since 1st Jan 1961, which must
surely pre-date any software currently in use), which is a necessary
requirement if the second is to be inserted exactly as currently
specified and time continued forward (so that events are recorded in
the correct order).
Does anyone know of any software which will record times during that
additional second accurately, e.g. as 23:59:60.789?

Yes, different software from Meinberg. ;-)

The main problem is that the underlying system time (often POSIX, which
just counts seconds since an epoch) has the *same* time stamp art the
beginning and end of the leap second.

In order to do the conversion correctly you need to know if the current
second is the leap second, or not, i.e. you need some status flag in
addition to the raw (e.g. POSIX) timestamp.

This is basically similar to what you have at the end of DST, where (in
local time) a whole hour is passed twice. You need to know the DST
status to determine unambiguously if it's the first or the second turn.

Is there any realistic prospect of forcing software to comply with a
time standard which includes times between 23:59:60 and 00:00:00?

Now, you can either stipulate that all software including operation
systems recognise times during that additional second - which would
require re-writing the time functions of most of the worlds software
to recognise and record times 23:59:60 00:00:00 (but only if a leap
second has been legally notified), or you can agree to insert the
additional second more gradually by clock slewing (but at what rate
would have to be agreed).
Clock slewing would require much more change to international
agreements, but would require far less work on re-writing software,
and would actually relate better to the real world, which is
annoyingly both analogue AND irregular:-)
Or the second itself could be redefined to take account of the actual
speed the earth rotates - but that might be problematic for the
scientists, as we'd likely have to keep doing it as the earth slows.

I certainly think we need to deal with the problem better than we do
at the moment.

A main reason for the problems with leap seconds is that POSIX has just
ignored them when they defined their standards on timekeeping.

There has been a proposal from David Mills many years ago where the time
during an inserted leap second increases only *very* slowly during the
leap second, so monotonicity of time stamps is kept.

However, most algorithms, or API calls returning a time stamp plus
consistent status information require longer execution time than just
returning a time stamp, e.g. just a 64 bit number. So the easy way is
often preferred over the accurate way.

I've collected some information on leap seconds in a paper which you can
find here:

http://www.meinbergglobal.com/english/info/#whitepaper

Especially:
Technical Aspects of Leap Second Propagation and Evaluation
http://www.meinbergglobal.com/download/burnicki/Technical%20Aspects%20of%20Leap%20Second%20Propagation%20and%20Evaluation.pdf

89 matches

Mail list logo