On Mon, Sep 09, 2013 at 04:21:55PM +0200, Peter Zijlstra wrote:
> On Mon, Sep 09, 2013 at 06:23:43AM -0700, Paul E. McKenney wrote:
> > Peter, in the general case, you are quite correct.  But this is a special
> > case where it really does work.
> > 
> > The key point here is that preemption and migration cannot move a task
> > from a CPU to which RCU is paying attention to a CPU that RCU is ignoring.
> 
> But there's no constraint placed on the migration mask (aka
> task_struct::cpus_allowed) and therefore it can move it thusly.
> 
> What you're trying to say is that by the time the task is running on
> another cpu, that cpu's state will match the state of the previous cpu,
> no?

Yep!  Might be a better way to put it as well.

> > So yes, by the time the task sees the return value from rcu_is_cpu_idle(),
> > that task might be running on some other CPU.  But that is OK, because
> > if RCU was paying attention to the old CPU, then RCU must also be paying
> > attention to the new CPU.
> 
> OK, fair enough.
> 
> > Here is an example of how this works:
> > 
> > 1.  Some task running on a CPU 0 (which RCU is paying attention to)
> >     calls rcu_is_cpu_idle(), which disables preemption, checks the
> >     per-CPU variable, sets ret to zero, then enables preemption.
> > 
> >     At this point, the task is preempted by some high-priority task.
> > 
> > 2.  CPU 1 is currently idle, so RCU is -not- paying attention to it.
> >     However, it is decided that our low-priority task should migrate
> >     to CPU 1.
> > 
> > 3.  CPU 1 is sent an IPI, which forces this CPU out of idle.  This
> >     causes rcu_idle_exit() to be called, which causes RCU to start
> >     paying attention to CPU 1.
> 
> Just a nit, we typically try to avoid using IPIs to wake idle CPUs,
> doesn't change the story much though.

K, if I get to this level of detail in the comments, I will leave IPIs
out, and just say that the CPU is forced out of idle.

> > 4.  CPU 1 switches to the low-priority task, which now sees the
> >     return value of rcu_is_cpu_idle().  Now, this return value did
> >     in fact reflect the old state of CPU 0, and the state of CPU 0
> >     might have changed.  (For example, the high-priority task might
> >     have blocked, so that CPU 0 is now idle, which in turn would
> >     mean that RCU is no longer paying attention to it, so that
> >     if rcu_is_cpu_idle() was called right now, it would return
> >     true rather than the false return computed in step 1 above.)
> > 
> > 5.  But that is OK.  Because of the way RCU and idle interact,
> >     if a call from a given task to rcu_is_cpu_idle() returned false
> >     some time in the past, a call from that same task will also
> >     return false right now.
> > 
> > So yes, in general it is wrong to disable preemption, grab the value
> > of a per-CPU variable, re-enable preemption, and then return the result.
> > But there are a number of special cases where it is OK, and this is
> > one of them.
> 
> Right, worthy of comments though :-)

No argument there!

Now if we can agree on the naming and the exact per-CPU incantation...  ;-)

                                                        Thanx, Paul

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to