On Thu, Apr 11, 2013 at 07:03:58AM -0500, Robin Holt wrote:
> On Thu, Apr 11, 2013 at 02:00:27PM +0200, Ingo Molnar wrote:
> > 
> > * Robin Holt <h...@sgi.com> wrote:
> > 
> > > > Ok, so it looks profilable.
> > > > 
> > > > The result above is not surprising: most CPUs sit in idle and don't do 
> > > > anything, 
> > > > while the loop goes on, right?
> > > > 
> > > > The interesting thing to profile would be the parallel bring-down, with 
> > > > the 
> > > > simplest global lock solution you mentioned. In that case most CPUs 
> > > > should be 
> > > > doing 'something' all the time - maybe spinning on the lock, maybe 
> > > > something else, 
> > > > right?
> > > 
> > > Again, mostly looks idle.
> > 
> > Forgot to suggest:
> > 
> >   perf record -a /sbin/reboot
> 
> I used perf record -a /sbin/reboot -f -d -n

OK.  Looking at Russ' patch, I understand now why it is looking idle.
We are still serially doing the DOWN_PREPARE, etc.  All those other cpus
are still sitting idle.

Can we call the __cpu_down functions from an smp_call_function()?

Robin
> 
> Robin
> > 
> > ... to capture remote CPU activity too.
> > 
> > > Events: 5M cycles
> > >     31.69%          swapper  [kernel.kallsyms]        [k] 
> > > update_cfs_rq_blocked_load
> > >     14.22%          swapper  [kernel.kallsyms]        [k] load_balance
> > >     12.95%          swapper  [kernel.kallsyms]        [k] ktime_get
> > >      4.64%          swapper  [kernel.kallsyms]        [k] idle_cpu
> > >      3.46%          swapper  [kernel.kallsyms]        [k] uv_read_rtc
> > >      2.26%          swapper  [kernel.kallsyms]        [k] 
> > > ktime_get_update_offsets
> > >      2.25%          swapper  [kernel.kallsyms]        [k] 
> > > rcu_check_callbacks
> > >      1.72%          swapper  [kernel.kallsyms]        [k] 
> > > _raw_spin_lock_irqsave
> > >      1.57%          swapper  [kernel.kallsyms]        [k] 
> > > native_write_msr_safe
> > >      1.53%          swapper  [kernel.kallsyms]        [k] native_safe_halt
> > >      1.52%          swapper  [kernel.kallsyms]        [k] 
> > > apic_timer_interrupt
> > >      1.52%          swapper  [kernel.kallsyms]        [k] 
> > > update_blocked_averages
> > >      1.51%          swapper  [kernel.kallsyms]        [k] 
> > > __lock_text_start
> > >      1.48%          swapper  [kernel.kallsyms]        [k] 
> > > rcu_process_gp_end
> > >      1.40%          swapper  [kernel.kallsyms]        [k] 
> > > rcu_process_callbacks
> > >      1.19%           reboot  [kernel.kallsyms]        [k] 
> > > kmem_cache_alloc_node
> > >      0.63%          swapper  [kernel.kallsyms]        [k] 
> > > check_for_new_grace_period
> > >      0.58%          swapper  [kernel.kallsyms]        [k] 
> > > rebalance_domains
> > >      0.55%          swapper  [kernel.kallsyms]        [k] cpumask_next_and
> > >      0.54%          swapper  [kernel.kallsyms]        [k] 
> > > __tick_nohz_idle_enter
> > >      0.53%          swapper  [kernel.kallsyms]        [k] 
> > > perf_adjust_freq_unthr_context
> > >      0.49%          swapper  [kernel.kallsyms]        [k] _raw_spin_lock
> > 
> > If even perf record -a shows a mostly idle system, then the overhead must 
> > be in 
> > sleep/wakeup latencies - for that the next step would be to figure out 
> > where all 
> > the waiting happens, for example via call-graph context-switch profiling:
> > 
> >    perf stat --null perf record -a -g -e sched:sched_switch /sbin/reboot
> > 
> > (the perf stat --null will tell us the runtime of the whole operation.)
> > 
> > Thanks,
> > 
> >     Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to