On Wed, 2014-04-30 at 12:00 +0200, Peter Zijlstra wrote:
> On Mon, Apr 28, 2014 at 03:09:01PM -0700, Davidlohr Bueso wrote:
> >  __visible
> >  struct rw_semaphore __sched *rwsem_down_write_failed(struct rw_semaphore 
> > *sem)
> >  {
> > -   long count, adjustment = -RWSEM_ACTIVE_WRITE_BIAS;
> > +   long count;
> >     struct rwsem_waiter waiter;
> >     struct task_struct *tsk = current;
> > +   bool waiting = true;
> > +
> > +   /* undo write bias from down_write operation, stop active locking */
> > +   count = rwsem_atomic_update(-RWSEM_ACTIVE_WRITE_BIAS, sem);
> > +
> > +   /* do optimistic spinning and steal lock if possible */
> > +   if (rwsem_optimistic_spin(sem))
> > +           goto done;
> 
> Why done, why not return? Afaict there's not yet been a change to the
> state.

Right.

> >  
> >     /* set up my own style of waitqueue */
> >     waiter.task = tsk;
> > @@ -204,34 +382,29 @@ struct rw_semaphore __sched 
> > *rwsem_down_write_failed(struct rw_semaphore *sem)
> >  
> >     raw_spin_lock_irq(&sem->wait_lock);
> >     if (list_empty(&sem->wait_list))
> > -           adjustment += RWSEM_WAITING_BIAS;
> > +           waiting = false;
> >     list_add_tail(&waiter.list, &sem->wait_list);
> >  
> >     /* we're now waiting on the lock, but no longer actively locking */
> > -   count = rwsem_atomic_update(adjustment, sem);
> > +   if (waiting)
> > +           count = ACCESS_ONCE(sem->count);
> > +   else
> > +           count = rwsem_atomic_update(RWSEM_WAITING_BIAS, sem);
> > +
> 
> Is there a reason we must delay this? Why not do away with the waiting
> variable and do it where we check the list_empty() ?

Yeah, that would simplify things, afaict.

> 
> If there is a reason -- eg. we must order the list op vs the count op,
> then there's a comment missing.

There is no such reason.

> 
> > -   /* If there were already threads queued before us and there are no
> > +   /*
> > +    * If there were already threads queued before us and there are no
> >      * active writers, the lock must be read owned; so we try to wake
> > -    * any read locks that were queued ahead of us. */
> > -   if (count > RWSEM_WAITING_BIAS &&
> > -       adjustment == -RWSEM_ACTIVE_WRITE_BIAS)
> > +    * any read locks that were queued ahead of us.
> > +    */
> > +   if ((count > RWSEM_WAITING_BIAS) && waiting)
> >             sem = __rwsem_do_wake(sem, RWSEM_WAKE_READERS);
> >  
> >     /* wait until we successfully acquire the lock */
> >     set_task_state(tsk, TASK_UNINTERRUPTIBLE);
> 
> We should really use set_current_state(), there is no way tsk is
> anything other than current, and using set_task_state() implies we're
> changing someone else's state.
> 
> >     while (true) {
> > -           if (!(count & RWSEM_ACTIVE_MASK)) {
> > -                   /* Try acquiring the write lock. */
> > -                   count = RWSEM_ACTIVE_WRITE_BIAS;
> > -                   if (!list_is_singular(&sem->wait_list))
> > -                           count += RWSEM_WAITING_BIAS;
> > -
> > -                   if (sem->count == RWSEM_WAITING_BIAS &&
> > -                       cmpxchg(&sem->count, RWSEM_WAITING_BIAS, count) ==
> > -                                                   RWSEM_WAITING_BIAS)
> > -                           break;
> > -           }
> > -
> > +           if (rwsem_try_write_lock(count, sem))
> > +                   break;
> >             raw_spin_unlock_irq(&sem->wait_lock);
> >  
> >             /* Block until there are no active lockers. */
> > @@ -245,8 +418,8 @@ struct rw_semaphore __sched 
> > *rwsem_down_write_failed(struct rw_semaphore *sem)
> >  
> >     list_del(&waiter.list);
> >     raw_spin_unlock_irq(&sem->wait_lock);
> > +done:
> >     tsk->state = TASK_RUNNING;
> 
> __set_current_state(TASK_RUNNING);
> 
> Also, I would really expect this to be done right after the wait loop,
> not outside of the lock.

Sure.

> > -
> >     return sem;
> >  }
> 
> Otherwise this looks ok I suppose.

Thanks for the review!

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to