On 6/25/25 12:09 PM, Boqun Feng wrote:
On Wed, Jun 25, 2025 at 11:52:04AM -0400, Waiman Long wrote:
[...]
+/*
+ * Acquire a hazptr slot and begin the hazard pointer critical section.
+ *
+ * Must be called with preemption disabled, and preemption must remain disabled
+ * until shazptr_clear().
+ */
+static inline struct shazptr_guard shazptr_acquire(void *ptr)
+{
+ struct shazptr_guard guard = {
+ /* Preemption is disabled. */
+ .slot = this_cpu_ptr(&shazptr_slots),
+ .use_wildcard = false,
+ };
+
+ if (likely(!READ_ONCE(*guard.slot))) {
+ WRITE_ONCE(*guard.slot, ptr);
+ } else {
+ guard.use_wildcard = true;
+ WRITE_ONCE(*guard.slot, SHAZPTR_WILDCARD);
+ }
Is it correct to assume that shazptr cannot be used in a mixed context
environment on the same CPU like a task context and an interrupt context
trying to acquire it simultaneously because the current check isn't atomic
with respect to that?
I think the current implementation actually support mixed context usage,
let see (assuming we start in a task context):
if (likely(!READ_ONCE(*guard.slot))) {
if an interrupt happens here, it's fine because the slot is still empty,
as long as the interrupt will eventually clear the slot.
WRITE_ONCE(*guard.slot, ptr);
if an interrupt happens here, it's fine because the interrupt would
notice that the slot is already occupied, hence the interrupt will use a
wildcard, and because it uses a wild, it won't clear the slot after it
returns. However the task context's shazptr_clear() will eventually
clear the slot because its guard's .use_wildcard is false.
} else {
if an interrupt happens here, it's fine because of the same: interrupt
will use wildcard, and it will not clear the slot, and some
shazptr_clear() in the task context will eventually clear it.
guard.use_wildcard = true;
WRITE_ONCE(*guard.slot, SHAZPTR_WILDCARD);
if an interrupt happens here, it's fine because of the same.
}
It's similar to why rcu_read_lock() can be just a non-atomic inc.
You are right.
+
+ smp_mb(); /* Synchronize with smp_mb() at synchronize_shazptr(). */
+
+ return guard;
+}
+
+static inline void shazptr_clear(struct shazptr_guard guard)
+{
+ /* Only clear the slot when the outermost guard is released */
+ if (likely(!guard.use_wildcard))
+ smp_store_release(guard.slot, NULL); /* Pair with ACQUIRE at
synchronize_shazptr() */
+}
Is it better to name it shazptr_release() to be conformant with our current
locking convention?
Maybe, but I will need to think about slot reusing between
shazptr_acquire() and shazptr_release(), in the general hazptr API,
you can hazptr_alloc() a slot, use it and hazptr_clear() and then
use it again, eventually hazptr_free(). I would like to keep both hazptr
APIs consistent as well. Thanks!
Thanks for the explanation. Maybe we can reuse the general hazptr API
names (alloc/clear/free) even though shazptr_free() will be a no-op for
now. Just that the current acquire/clear naming looks odd to me.
Thanks,
Longman