On Wed, Jan 18, 2017 at 10:17:38AM -0200, Marcelo Tosatti wrote:
> On Tue, Jan 17, 2017 at 04:36:21PM +0100, Radim Krcmar wrote:
> > 2017-01-17 09:30-0200, Marcelo Tosatti:
> > > On Tue, Jan 17, 2017 at 09:03:27AM +0100, Miroslav Lichvar wrote:
> > >> On Mon, Jan 16, 2017 at 06:01:14PM -0200, Marcelo Tosatti wrote:
> > >> > On Mon, Jan 16, 2017 at 05:47:15PM -0200, Marcelo Tosatti wrote:
> > >> > > On Mon, Jan 16, 2017 at 05:36:55PM -0200, Marcelo Tosatti wrote:
> > >> > > > Sorry, unless i am misunderstanding how this works, it'll get the 
> > >> > > > guest clock
> > >> > > > 2us behind, which is something not wanted.
> > >> > > > 
> > >> > > > Miroslav, if ->gettime64 returns the host realtime at 2us in the 
> > >> > > > past, 
> > >> > > > this means Chrony will sync the guest clock to
> > >> > > > 
> > >> > > > host realtime - 2us
> > >> > > > 
> > >> > > > Is that correct?
> > >> 
> > >> Probably. It depends on the error of both host and guest timestamps.
> > >> If the error is the same on both sides, it will cancel out. An
> > >> occasional spike in the delay shouldn't be a problem as the reading
> > >> will be filtered out, but for best accuracy it's necessary that the
> > >> host's timestamp is taken in the middle between the guest's
> > >> timestamps.
> > > 
> > > The problem is that spikes can be far from occasional: it depends on 
> > > activity of
> > > the host CPU and interrupts. Whose delay can be "intermittent": as long
> > > as interrupts are being sent to the host CPU, for example, the delay
> > > will be high (which can last minutes).
> > > 
> > > The TSC reading in the guest KVM PTP driver corrects for that delay.
> > > 
> > >> Users of the PTP_SYS_OFFSET ioctl assume that (ts[0]+ts[2])/2
> > >> corresponds to ts[1], (ts[2]+ts[4])/2 corresponds to ts[3], and so on.
> > >> 
> > >>                     ts[1]     ts[3]
> > >> Host time    ---------+---------+........
> > >>                       |         |
> > >>                       |         |
> > >> Guest time   ----+---------+---------+......
> > >>                 ts[0]    ts[2]     ts[4]
> > 
> > KVM PTP delay moves host ts[i] to be close to guest ts[i+1] and makes
> > the offset very consistent, so the graph would look like:
> > 
> >                         ts[1]     ts[3]
> > Host time    -------------+---------+........
> >                           |         |
> >                           |         |
> > Guest time   ----+---------+---------+......
> >                 ts[0]    ts[2]     ts[4]
> > 
> > which doesn't sound good if users assume that the host reading is in the
> > middle -- the guest time would be ahead of the host time.
> > 
> > I'm wondering why is the PTP precision around 10ns, when the hypercall
> > takes around 2-3k cycles.  Have you measured the guest<->host offset by
> > getting the output of the hypercall, i.e.
> >   {host_sec @ tsc, host_nsec @ tsc, tsc}
> > and comparing it with guest time computed from the same tsc, i.e.
> >   {guest_sec @ tsc, guest_nsec @ tsc}
> > ?
> > 
> > Thanks.
> 
> Testcase: run a guest and a loop sending SIGUSR1 to vcpu0 (emulating
> intense interrupts). Follows results:
> 
> Without TSC delta calculation:
> =============================
> 
> #* PHC0                          0   3   377     2    -99ns[ +206ns] +/-  
> 116ns
> #* PHC0                          0   3   377     8   +202ns[ +249ns] +/-  
> 111ns
> #* PHC0                          0   3   377     8   -213ns[ +683ns] +/-   
> 88ns
> #* PHC0                          0   3   377     6    +77ns[ +319ns] +/-   
> 56ns
> #* PHC0                          0   3   377     4   -771ns[-1029ns] +/-   
> 93ns
> #* PHC0                          0   3   377    10    -49ns[  -58ns] +/-  
> 121ns
> #* PHC0                          0   3   377     9   +562ns[ +703ns] +/-  
> 107ns
> #* PHC0                          0   3   377     6     -2ns[   -3ns] +/-   
> 94ns
> #* PHC0                          0   3   377     4   +451ns[ +494ns] +/-  
> 138ns
> #* PHC0                          0   3   377    11    -67ns[  -74ns] +/-  
> 113ns
> #* PHC0                          0   3   377     8   +244ns[ +264ns] +/-  
> 119ns
> #* PHC0                          0   3   377     7   -696ns[ -890ns] +/-   
> 89ns
> #* PHC0                          0   3   377     4   +468ns[ +560ns] +/-  
> 110ns
> #* PHC0                          0   3   377    11   -310ns[ -430ns] +/-   
> 72ns
> #* PHC0                          0   3   377     9   +189ns[ +298ns] +/-   
> 54ns
> #* PHC0                          0   3   377     7   +594ns[ +473ns] +/-   
> 96ns
> #* PHC0                          0   3   377     5   +151ns[ +280ns] +/-   
> 71ns
> #* PHC0                          0   3   377    10   -590ns[ -696ns] +/-   
> 94ns
> #* PHC0                          0   3   377     8   +415ns[ +526ns] +/-   
> 74ns
> #* PHC0                          0   3   377     6  +1381ns[+1469ns] +/-  
> 101ns
> #* PHC0                          0   3   377     4   +571ns[+1304ns] +/-   
> 54ns
> #* PHC0                          0   3   377     8     -5ns[  +71ns] +/-  
> 139ns
> #* PHC0                          0   3   377     7   -247ns[ -502ns] +/-   
> 69ns
> #* PHC0                          0   3   377     5   -283ns[ +879ns] +/-   
> 73ns
> #* PHC0                          0   3   377     3   +148ns[ -109ns] +/-   
> 61ns
> 
> With TSC delta calculation:
> ============================
> 
> #* PHC0                          0   3   377     7   +379ns[ +432ns] +/-   
> 53ns
> #* PHC0                          0   3   377     9   +106ns[ +420ns] +/-   
> 42ns
> #* PHC0                          0   3   377     7    -58ns[ -136ns] +/-   
> 62ns
> #* PHC0                          0   3   377    12    +93ns[  -38ns] +/-   
> 64ns
> #* PHC0                          0   3   377     8    +84ns[ +107ns] +/-   
> 69ns
> #* PHC0                          0   3   377     3    -76ns[ -103ns] +/-   
> 52ns
> #* PHC0                          0   3   377     7    +52ns[  +63ns] +/-   
> 50ns
> #* PHC0                          0   3   377    11    +29ns[  +31ns] +/-   
> 70ns
> #* PHC0                          0   3   377     7    -47ns[  -56ns] +/-   
> 42ns
> #* PHC0                          0   3   377    10    -35ns[  -42ns] +/-   
> 33ns
> #* PHC0                          0   3   377     7    -32ns[  -34ns] +/-   
> 42ns
> #* PHC0                          0   3   377    11   -172ns[ -173ns] +/-  
> 118ns
> #* PHC0                          0   3   377     6    +65ns[  +76ns] +/-   
> 23ns
> #* PHC0                          0   3   377     9    +18ns[  +23ns] +/-   
> 37ns
> #* PHC0                          0   3   377     6    +41ns[  -60ns] +/-   
> 30ns
> #* PHC0                          0   3   377    10    +39ns[ +183ns] +/-   
> 42ns
> #* PHC0                          0   3   377     6    +50ns[ +102ns] +/-   
> 86ns
> #* PHC0                          0   3   377    11    +50ns[  +75ns] +/-   
> 52ns
> #* PHC0                          0   3   377     6    +50ns[ +116ns] +/-  
> 100ns
> #* PHC0                          0   3   377    10    +46ns[  +65ns] +/-   
> 79ns
> #* PHC0                          0   3   377     7    -38ns[  -51ns] +/-   
> 29ns
> #* PHC0                          0   3   377    10    -11ns[  -12ns] +/-   
> 32ns
> #* PHC0                          0   3   377     7    -31ns[  -32ns] +/-   
> 99ns
> #* PHC0                          0   3   377    10   +222ns[ +238ns] +/-   
> 58ns
> #* PHC0                          0   3   377     6   +185ns[ +207ns] +/-   
> 39ns
> #* PHC0                          0   3   377    10   -392ns[ -394ns] +/-  
> 118ns
> #* PHC0                          0   3   377     6     -9ns[  -50ns] +/-   
> 35ns
> #* PHC0                          0   3   377    10   -346ns[ -355ns] +/-  
> 111ns
> 
> 
> Do you still want to drop it in favour of simplicity?
> 

This is the output of "chronyc sources". See section "Time sources"
of https://chrony.tuxfamily.org/doc/2.4/chronyc.html.


Reply via email to