Am 9/28/2024 um 11:15 PM schrieb Alan Stern:
On Sat, Sep 28, 2024 at 11:55:22AM -0400, Mathieu Desnoyers wrote:
On 2024-09-28 17:49, Alan Stern wrote:
Isn't it true that on strongly ordered CPUs, a compiler barrier is
sufficient to prevent the rcu_dereference() problem?  So the whole idea
behind ptr_eq() is that it prevents the problem on all CPUs.

Correct. But given that we have ptr_eq(), it's good to show how it
equally prevents the compiler from reordering address-dependent loads
(comparison with constant) *and* prevents the compiler from using
one pointer rather than the other (comparison between two non-constant
pointers) which affects speculation on weakly-ordered CPUs.

I don't see how these two things differ from each other.  In the
comparison-with-a-constant case, how is the compiler reordering
anything?  Isn't it just using the constant address rather than the
loaded pointer and thereby breaking the address dependency?

I also currently don't see any major difference between the constant and register case. The point is that the address is known before loading into b, and hence the compiler + hardware can speculatively load *b before loading into b.

The only difference is how far before loading into b the address is known.

Best wishes,
  jonas


Reply via email to