Oleg,

On Sun, 2005-08-21 at 14:59 +0400, Oleg Nesterov wrote:
> CPU_0                                 CPU_1
> 
> release_task:                         posix_timer_fn:
>       write_lock(tasklist);                   lock(timer->it_lock);
> 
>       exit_timers:                            send_sigxxx();
>               lock(timer->it_lock)                    read_lock(tasklist);
>                               
> Deadlock.
> 
> Dear cc-list, what do you think?

The patch below on top of your patch should solve this. We don't need
tasklist_lock to check p->flags. As you pointed out p cannot be invalid
in send_sigqueue as it's protected by get_task_struct() in
create_timer()

For send_group_sigqueue it's protected by exit_itimers() waiting for
k_itimer.it_lock.

It still does not solve the ugly dependency on tasklist_lock but at
least the race and the deadlock are fixed.

tglx

Signed-off-by: Thomas Gleixner <[EMAIL PROTECTED]>


--- linux-2.6.13-rc6.signal/kernel/signal.oleg.c        2005-08-21 
23:07:03.000000000 +0200
+++ linux-2.6.13-rc6.signal/kernel/signal.c     2005-08-21 23:09:07.000000000 
+0200
@@ -1381,11 +1381,14 @@ send_sigqueue(int sig, struct sigqueue *
        int ret = 0;
 
        BUG_ON(!(q->flags & SIGQUEUE_PREALLOC));
-       read_lock(&tasklist_lock);
 
-       if (unlikely(p->flags & PF_EXITING)) {
-               ret = -1;
-               goto out_err;
+retry:
+       if (unlikely(p->flags & PF_EXITING))
+               return -1;
+
+       if ((unlikely(!read_trylock(&tasklist_lock))) {
+               cpu_relax();
+               goto retry;
        }
 
        spin_lock_irqsave(&p->sighand->siglock, flags);
@@ -1414,7 +1417,6 @@ send_sigqueue(int sig, struct sigqueue *
 
 out:
        spin_unlock_irqrestore(&p->sighand->siglock, flags);
-out_err:
        read_unlock(&tasklist_lock);
 
        return ret;
@@ -1427,7 +1429,14 @@ send_group_sigqueue(int sig, struct sigq
        int ret = 0;
 
        BUG_ON(!(q->flags & SIGQUEUE_PREALLOC));
-       read_lock(&tasklist_lock);
+retry:
+       if (unlikely(p->flags & PF_EXITING))
+               return -1;
+
+       if (unlikely(!read_trylock(&tasklist_lock))) {
+               cpu_relax();
+               goto retry;
+       }
        spin_lock_irqsave(&p->sighand->siglock, flags);
        handle_stop_signal(sig, p);
 
--- linux-2.6.13-rc6.signal/kernel/posix-timers.oleg.c  2005-08-21 
23:09:58.000000000 +0200
+++ linux-2.6.13-rc6.signal/kernel/posix-timers.c       2005-08-21 
23:19:42.000000000 +0200
@@ -501,7 +501,8 @@ static void posix_timer_fn(unsigned long
                        remove_from_abslist(timr);
                }
 
-               if (posix_timer_event(timr, si_private))
+               /* Do not rearm the timer, when we are exiting */
+               if (posix_timer_event(timr, si_private) > 0)
                        /*
                         * signal was not sent because of sig_ignor
                         * we will not get a call back to restart it AND


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Reply via email to