On Wed, 25 Jan 2017 16:25:20 +0100
Frederic Weisbecker <fweis...@gmail.com> wrote:

> On Mon, Jan 23, 2017 at 10:44:56AM +0100, Martin Schwidefsky wrote:
> > On Sun, 22 Jan 2017 19:20:13 +0100
> > Frederic Weisbecker <fweis...@gmail.com> wrote:
> >   
> > > The s390 clock has a higher granularity than nanoseconds. 1 nanosec
> > > equals 4.096 in s390 cputime_t. Therefore we leak a remainder while
> > > flushing the cputime through cputime_to_nsecs().
> > > 
> > > For more precision, make sure we keep that remainder on cputime
> > > accumulators for later accounting.
> > > 
> > > Reported-by: Martin Schwidefsky <schwidef...@de.ibm.com>
> > > Cc: Benjamin Herrenschmidt <b...@kernel.crashing.org>
> > > Cc: Paul Mackerras <pau...@samba.org>
> > > Cc: Michael Ellerman <m...@ellerman.id.au>
> > > Cc: Heiko Carstens <heiko.carst...@de.ibm.com>
> > > Cc: Martin Schwidefsky <schwidef...@de.ibm.com>
> > > Cc: Tony Luck <tony.l...@intel.com>
> > > Cc: Fenghua Yu <fenghua...@intel.com>
> > > Cc: Peter Zijlstra <pet...@infradead.org>
> > > Cc: Rik van Riel <r...@redhat.com>
> > > Cc: Thomas Gleixner <t...@linutronix.de>
> > > Cc: Ingo Molnar <mi...@kernel.org>
> > > Cc: Stanislaw Gruszka <sgrus...@redhat.com>
> > > Cc: Wanpeng Li <wanpeng...@hotmail.com>
> > > Signed-off-by: Frederic Weisbecker <fweis...@gmail.com>  
> > 
> > NAK. Good intention but the patch is just broken. with 36 of the 37
> > patches applied all looks good but the last one completely breaks the
> > accounting for s390. This is from an idle system:
> > 
> > top - 10:39:33 up 0 min,  1 user,  load average: 0,00, 0,00, 0,00
> > Tasks: 106 total,   1 running, 105 sleeping,   0 stopped,   0 zombie
> > %Cpu0  :  8,9 us, 21,6 sy,  0,0 ni,  0,0 id,  0,0 wa, 10,8 hi,  4,3 si, 
> > 54,4 st
> > %Cpu1  :  0,0 us, 23,5 sy,  0,0 ni,  0,0 id,  0,0 wa, 19,0 hi, 13,1 si, 
> > 44,3 st
> > %Cpu2  :  0,0 us, 30,3 sy,  0,0 ni,  0,0 id,  0,0 wa, 14,7 hi, 14,8 si, 
> > 40,2 st
> > KiB Mem :  1009304 total,   818808 free,    57284 used,   133212 buff/cache
> > KiB Swap:  1048556 total,  1048556 free,        0 used.   917356 avail Mem  
> 
> Oh ok. I must have done something wrong.
> 
> > 
> > There is another issue that affects precision, there is no s390 specific
> > version of cputime_to_nsecs. The generic version uses cputime_to_usecs
> > and mulitplies by 1000 to get nano-seconds. That already looses precision.  
> 
> That's right. And that's the point of this patch. I'm not sure we can have a
> more precise version of cputime_to_nsecs() if 1 nsec == 4.096 cputime_t

I am thinking about a version that does the calculation (cputime * 1000) >> 12
in several steps, like this

  nsecs = (((((cputime * 5) >> 3) * 5) >> 3) * 5) >> 3

To multiply with 5 is a shift by 2 and an add. In total that would gives us
6 shifts and 3 adds for the conversion.

-- 
blue skies,
   Martin.

"Reality continues to ruin my life." - Calvin.

Reply via email to