In rwsem_wake(), we can safely check the wait_list to see if waiters
are present without lock when there are spinners to fall back on in
case we miss a waiter.  The advantage is that we can save a pair of
spin_lock/unlock calls when the wait_list is empty. This translates
to a reduction in latency and hence slightly better performance.

The raw_spin_trylock_irqsave() call is also being replaced by a
raw_spin_is_locked() to reduce the overhead of irqsave/irqrestore
especially when the trylock fails.

On a 2-socket 36-core x86-64 E5-2699 v3 system, a rwsem microbenchmark
was run with 36 locking threads doing 1 million writer lock/unlock
operations each, the resulting locking rates (avg of 3 runs) on a
4.11-rc1 based kernel were 4,923 Mop/s and 5,136 Mop/s without and
with the patch respectively. That was an increase of about 4%.

On the same system, a 36-thread fio direct IO random write test to
the same 2GB file on a XFS formatted ramdisk was run. The aggregated
bandwidth was 1564.4 MB/s and 1593.0 MB/s before and after the
patch. That was an increase of about 2%.

Signed-off-by: Waiman Long <long...@redhat.com>
---
 kernel/locking/rwsem-xadd.c | 16 +++++++++++-----
 1 file changed, 11 insertions(+), 5 deletions(-)

diff --git a/kernel/locking/rwsem-xadd.c b/kernel/locking/rwsem-xadd.c
index 34e727f..f31dd61b 100644
--- a/kernel/locking/rwsem-xadd.c
+++ b/kernel/locking/rwsem-xadd.c
@@ -587,7 +587,7 @@ struct rw_semaphore *rwsem_wake(struct rw_semaphore *sem)
 
        /*
         * If a spinner is present, it is not necessary to do the wakeup.
-        * Try to do wakeup only if the trylock succeeds to minimize
+        * Try to do wakeup only if the wait_lock is free to minimize
         * spinlock contention which may introduce too much delay in the
         * unlock operation.
         *
@@ -595,7 +595,7 @@ struct rw_semaphore *rwsem_wake(struct rw_semaphore *sem)
         *    ---------------           -----------------------
         * [S]   osq_unlock()           [L]   osq
         *       MB                           RMB
-        * [RmW] rwsem_try_write_lock() [RmW] spin_trylock(wait_lock)
+        * [RmW] rwsem_try_write_lock() [RmW] spin_is_locked(wait_lock)
         *
         * Here, it is important to make sure that there won't be a missed
         * wakeup while the rwsem is free and the only spinning writer goes
@@ -611,12 +611,18 @@ struct rw_semaphore *rwsem_wake(struct rw_semaphore *sem)
                 * state is consulted before reading the wait_lock.
                 */
                smp_rmb();
-               if (!raw_spin_trylock_irqsave(&sem->wait_lock, flags))
+
+               /*
+                * Normally checking wait_list without wait_lock isn't safe
+                * as we may miss an incoming waiter. With spinners present,
+                * however, we have someone to fall back on in case that
+                * happens.
+                */
+               if (list_empty(&sem->wait_list) ||
+                   raw_spin_is_locked(&sem->wait_lock))
                        return sem;
-               goto locked;
        }
        raw_spin_lock_irqsave(&sem->wait_lock, flags);
-locked:
 
        if (!list_empty(&sem->wait_list))
                __rwsem_mark_wake(sem, RWSEM_WAKE_ANY, &wake_q);
-- 
1.8.3.1

Reply via email to