Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-10-04 Thread John Hughes

On 28/09/11 11:09, John Hughes wrote:



I think you're right about the problems now being userspace.

I feel that the hang after thaw problems I'm seeing are caused by 
krb5 - it seems to like to get itself into a situation where it has no 
krb5 ticket so it can't access my home directory which prevents the 
unlock window popping up so it can't get my password and ask for a new 
krb5 ticket.


If I log on on another screen or serial terminal then the unlock 
dialogue pops up.


I'm going to continue testing today but so far my feeling is that the 
patch works as advertised - nfs4 shares no longer prevent suspend from 
working.


I think my hangs (system not restarting from freeze, especialy when 
first starting in the morning) are probably related to the situation 
described in Ubuntu bug https://bugs.launchpad.net/ubuntu/+bug/794112 
and nothing to do with Jeff Layton's freezer patches.


I've seen the error message described in the Ubuntu bug: Error: state 
manager encountered RPCSEC_GSS session expired against NFSv4 server.  
The bug report hints Error: state manager encountered RPCSEC_GSS 
session expired against NFSv4 server and I'm running a 3.1 kernel with 
a Debian squeeze userspace.


I'll update the nfs utils and see whether that improves things.



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-28 Thread John Hughes

On 27/09/11 20:27, Jeff Layton wrote:

On Mon, 26 Sep 2011 14:31:32 -0400
Jeff Laytonjlay...@redhat.com  wrote:

   


Not sure what happened here. Looks like the freeze phase of the
suspend worked. My inclination is to think that this is not due to my
patch.

What might be helpful is to try suspending without NFS being mounted.
If that doesn't work either, then maybe back out the patch and try
again without any NFS mounts.
 

Yeah, after looking over the code I really think the instant wakeup
thing is an unrelated problem. This patchset is really intended to fix
the problem where the freezer is unable to freeze tasks that are stuck
waiting on replies from NFS and CIFS servers.

If you aren't seeing these sorts of errors when trying to suspend, then
this is probably a different problem:

  https://bugzilla.redhat.com/show_bug.cgi?id=717735#c0

It would be nice to have some successful tests with this set though
before I propose it, but that might not be possible given the general
fragility of the userspace suspend code :-/
   

I'm testing away.

I think you're right about the problems now being userspace.

I feel that the hang after thaw problems I'm seeing are caused by krb5 
- it seems to like to get itself into a situation where it has no krb5 
ticket so it can't access my home directory which prevents the unlock 
window popping up so it can't get my password and ask for a new krb5 ticket.


If I log on on another screen or serial terminal then the unlock 
dialogue pops up.


I'm going to continue testing today but so far my feeling is that the 
patch works as advertised - nfs4 shares no longer prevent suspend from 
working.



John would you be able to test the things I mentioned in my prior
reply? In particular, seeing whether you're able to suspend
consistently without nfs mounts and without those patches in place? If
that doesn't work either, then I'll be inclined to think that this is
unrelated.
   


So far (I'll confirm tomorrow)

1. I can suspend without nfs mounts
2. I can suspend with nfs mounts if they are not my home directory
3. suspending with nfs mounts of the home directory using echo mem 
/sys/power/state works
4. suspending with nfs mounts of the home directory using pm-suspend 
usualy works
5. suspending with nfs mounts of the home directory using gnome 
shutdown/suspend menu item often hangs.



The problems are much more visible on my klunky old Optiplex GX270 
systems (2.6GHz P4) than my nice new Optiplex 390 (Core I3).





--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-28 Thread Jeff Layton
On Wed, 28 Sep 2011 11:09:33 +0200
John Hughes j...@calva.com wrote:

 On 27/09/11 20:27, Jeff Layton wrote:
  On Mon, 26 Sep 2011 14:31:32 -0400
  Jeff Laytonjlay...@redhat.com  wrote:
 
 
 
  Not sure what happened here. Looks like the freeze phase of the
  suspend worked. My inclination is to think that this is not due to my
  patch.
 
  What might be helpful is to try suspending without NFS being mounted.
  If that doesn't work either, then maybe back out the patch and try
  again without any NFS mounts.
   
  Yeah, after looking over the code I really think the instant wakeup
  thing is an unrelated problem. This patchset is really intended to fix
  the problem where the freezer is unable to freeze tasks that are stuck
  waiting on replies from NFS and CIFS servers.
 
  If you aren't seeing these sorts of errors when trying to suspend, then
  this is probably a different problem:
 
https://bugzilla.redhat.com/show_bug.cgi?id=717735#c0
 
  It would be nice to have some successful tests with this set though
  before I propose it, but that might not be possible given the general
  fragility of the userspace suspend code :-/
 
 I'm testing away.
 
 I think you're right about the problems now being userspace.
 
 I feel that the hang after thaw problems I'm seeing are caused by krb5 
 - it seems to like to get itself into a situation where it has no krb5 
 ticket so it can't access my home directory which prevents the unlock 
 window popping up so it can't get my password and ask for a new krb5 ticket.
 

Maybe... krb5 tickets are usually stored in /tmp, so it's probably not
the lack of that. It is however likely that the GUI requires access to
some files in your homedir, which might be problematic. Still though,
it seems like as long as you have a valid ticket, the resume ought to
work (assuming that rpc.gssd is back up and running).

 If I log on on another screen or serial terminal then the unlock 
 dialogue pops up.
 

That certainly sounds like you're needing to reestablish your
credcache. Are you putting the krb5 credcache in the keyring or
memory or something?

 I'm going to continue testing today but so far my feeling is that the 
 patch works as advertised - nfs4 shares no longer prevent suspend from 
 working.
 
  John would you be able to test the things I mentioned in my prior
  reply? In particular, seeing whether you're able to suspend
  consistently without nfs mounts and without those patches in place? If
  that doesn't work either, then I'll be inclined to think that this is
  unrelated.
 
 
 So far (I'll confirm tomorrow)
 
 1. I can suspend without nfs mounts
 2. I can suspend with nfs mounts if they are not my home directory
 3. suspending with nfs mounts of the home directory using echo mem 
  /sys/power/state works
 4. suspending with nfs mounts of the home directory using pm-suspend 
 usualy works
 5. suspending with nfs mounts of the home directory using gnome 
 shutdown/suspend menu item often hangs.
 
 
 The problems are much more visible on my klunky old Optiplex GX270 
 systems (2.6GHz P4) than my nice new Optiplex 390 (Core I3).
 


Nice, thanks for testing it. I'll go ahead and send these out later
today.

-- 
Jeff Layton jlay...@redhat.com



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-27 Thread Jeff Layton
On Mon, 26 Sep 2011 14:31:32 -0400
Jeff Layton jlay...@redhat.com wrote:

 On Mon, 26 Sep 2011 17:52:58 +0200
 John Hughes j...@calva.com wrote:
 
  On 23/09/11 17:09, Jeff Layton wrote:
   I went ahead and attached the latest one to the RHBZ above. If you can
   grab it from there and test it, that would be great.
  
  
  Ok, I'm now testing the current patch on my Dell GX270 (Pentium-4, ugh) 
  system.
  
  Things are not working too well.
  
  Sometimes the system suspends with no problems, but sometimes it seems 
  to suspend and wake up instantly:
  
  Sep 26 17:27:06 bovic kernel: [  410.158736] PM: Syncing filesystems ... 
  done.
  Sep 26 17:27:06 bovic kernel: [  410.161113] PM: Preparing system for mem 
  sleep
  Sep 26 17:27:22 bovic kernel: [  410.213835] Freezing user space processes 
  ... (elapsed 0.01 seconds) done.
  Sep 26 17:27:22 bovic kernel: [  410.228074] Freezing remaining freezable 
  tasks ... (elapsed 0.01 seconds) done.
 
 Looks like the freeze worked...
 
  Sep 26 17:27:22 bovic kernel: [  410.244045] PM: Entering mem sleep
  Sep 26 17:27:22 bovic kernel: [  410.244065] Suspending console(s) (use 
  no_console_suspend to debug)
  [...]
  Sep 26 17:27:22 bovic kernel: [  410.926483] PM: suspend of devices 
  complete after 682.208 msecs
  Sep 26 17:27:22 bovic kernel: [  410.940182] PM: late suspend of devices 
  complete after 13.691 msecs
  Sep 26 17:27:22 bovic kernel: [  410.940248] ACPI: Preparing to enter 
  system sleep state S3
  Sep 26 17:27:22 bovic kernel: [  410.940935] PM: Saving platform NVS memory
  Sep 26 17:27:22 bovic kernel: [  410.940972] Disabling non-boot CPUs ...
  Sep 26 17:27:22 bovic kernel: [  411.044037] CPU 1 is now offline
  Sep 26 17:27:22 bovic kernel: [  411.044557] ACPI: Low-level resume complete
  Sep 26 17:27:22 bovic kernel: [  411.044557] PM: Restoring platform NVS 
  memory
  Sep 26 17:27:22 bovic kernel: [  411.044557] Enabling non-boot CPUs ...
  Sep 26 17:27:22 bovic kernel: [  411.044557] Booting Node 0 Processor 1 
  APIC 0x1
  Sep 26 17:27:22 bovic kernel: [  411.044557] smpboot cpu 1: start_ip = 9b000
  Sep 26 17:27:22 bovic kernel: [  410.942509] Initializing CPU#1
  Sep 26 17:27:22 bovic kernel: [  410.942509] Calibrating delay loop 
  (skipped) already calibrated this CPU
  Sep 26 17:27:22 bovic kernel: [  411.076635] NMI watchdog enabled, takes 
  one hw-pmu counter.
  Sep 26 17:27:22 bovic kernel: [  411.080018] Switched to NOHz mode on CPU #1
  Sep 26 17:27:22 bovic kernel: [  411.080077] CPU1 is up
  Sep 26 17:27:22 bovic kernel: [  411.080453] ACPI: Waking up from system 
  sleep state S3
  
  And after the wake up things seem not to work right - doing a dmesg on 
  a serial console hangs for example.
  
 
 Not sure what happened here. Looks like the freeze phase of the
 suspend worked. My inclination is to think that this is not due to my
 patch.
 
 What might be helpful is to try suspending without NFS being mounted.
 If that doesn't work either, then maybe back out the patch and try
 again without any NFS mounts.

Yeah, after looking over the code I really think the instant wakeup
thing is an unrelated problem. This patchset is really intended to fix
the problem where the freezer is unable to freeze tasks that are stuck
waiting on replies from NFS and CIFS servers.

If you aren't seeing these sorts of errors when trying to suspend, then
this is probably a different problem:

 https://bugzilla.redhat.com/show_bug.cgi?id=717735#c0

It would be nice to have some successful tests with this set though
before I propose it, but that might not be possible given the general
fragility of the userspace suspend code :-/

John would you be able to test the things I mentioned in my prior
reply? In particular, seeing whether you're able to suspend
consistently without nfs mounts and without those patches in place? If
that doesn't work either, then I'll be inclined to think that this is
unrelated.

-- 
Jeff Layton jlay...@redhat.com



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-26 Thread John Hughes

On 23/09/11 17:09, Jeff Layton wrote:

I went ahead and attached the latest one to the RHBZ above. If you can
grab it from there and test it, that would be great.
   


Ok, I'm now testing the current patch on my Dell GX270 (Pentium-4, ugh) 
system.


Things are not working too well.

Sometimes the system suspends with no problems, but sometimes it seems 
to suspend and wake up instantly:


Sep 26 17:27:06 bovic kernel: [  410.158736] PM: Syncing filesystems ... done.
Sep 26 17:27:06 bovic kernel: [  410.161113] PM: Preparing system for mem sleep
Sep 26 17:27:22 bovic kernel: [  410.213835] Freezing user space processes ... 
(elapsed 0.01 seconds) done.
Sep 26 17:27:22 bovic kernel: [  410.228074] Freezing remaining freezable tasks 
... (elapsed 0.01 seconds) done.
Sep 26 17:27:22 bovic kernel: [  410.244045] PM: Entering mem sleep
Sep 26 17:27:22 bovic kernel: [  410.244065] Suspending console(s) (use 
no_console_suspend to debug)
[...]
Sep 26 17:27:22 bovic kernel: [  410.926483] PM: suspend of devices complete 
after 682.208 msecs
Sep 26 17:27:22 bovic kernel: [  410.940182] PM: late suspend of devices 
complete after 13.691 msecs
Sep 26 17:27:22 bovic kernel: [  410.940248] ACPI: Preparing to enter system 
sleep state S3
Sep 26 17:27:22 bovic kernel: [  410.940935] PM: Saving platform NVS memory
Sep 26 17:27:22 bovic kernel: [  410.940972] Disabling non-boot CPUs ...
Sep 26 17:27:22 bovic kernel: [  411.044037] CPU 1 is now offline
Sep 26 17:27:22 bovic kernel: [  411.044557] ACPI: Low-level resume complete
Sep 26 17:27:22 bovic kernel: [  411.044557] PM: Restoring platform NVS memory
Sep 26 17:27:22 bovic kernel: [  411.044557] Enabling non-boot CPUs ...
Sep 26 17:27:22 bovic kernel: [  411.044557] Booting Node 0 Processor 1 APIC 0x1
Sep 26 17:27:22 bovic kernel: [  411.044557] smpboot cpu 1: start_ip = 9b000
Sep 26 17:27:22 bovic kernel: [  410.942509] Initializing CPU#1
Sep 26 17:27:22 bovic kernel: [  410.942509] Calibrating delay loop (skipped) 
already calibrated this CPU
Sep 26 17:27:22 bovic kernel: [  411.076635] NMI watchdog enabled, takes one 
hw-pmu counter.
Sep 26 17:27:22 bovic kernel: [  411.080018] Switched to NOHz mode on CPU #1
Sep 26 17:27:22 bovic kernel: [  411.080077] CPU1 is up
Sep 26 17:27:22 bovic kernel: [  411.080453] ACPI: Waking up from system sleep 
state S3

And after the wake up things seem not to work right - doing a dmesg on 
a serial console hangs for example.





--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-26 Thread Jeff Layton
On Mon, 26 Sep 2011 17:52:58 +0200
John Hughes j...@calva.com wrote:

 On 23/09/11 17:09, Jeff Layton wrote:
  I went ahead and attached the latest one to the RHBZ above. If you can
  grab it from there and test it, that would be great.
 
 
 Ok, I'm now testing the current patch on my Dell GX270 (Pentium-4, ugh) 
 system.
 
 Things are not working too well.
 
 Sometimes the system suspends with no problems, but sometimes it seems 
 to suspend and wake up instantly:
 
 Sep 26 17:27:06 bovic kernel: [  410.158736] PM: Syncing filesystems ... done.
 Sep 26 17:27:06 bovic kernel: [  410.161113] PM: Preparing system for mem 
 sleep
 Sep 26 17:27:22 bovic kernel: [  410.213835] Freezing user space processes 
 ... (elapsed 0.01 seconds) done.
 Sep 26 17:27:22 bovic kernel: [  410.228074] Freezing remaining freezable 
 tasks ... (elapsed 0.01 seconds) done.

Looks like the freeze worked...

 Sep 26 17:27:22 bovic kernel: [  410.244045] PM: Entering mem sleep
 Sep 26 17:27:22 bovic kernel: [  410.244065] Suspending console(s) (use 
 no_console_suspend to debug)
 [...]
 Sep 26 17:27:22 bovic kernel: [  410.926483] PM: suspend of devices complete 
 after 682.208 msecs
 Sep 26 17:27:22 bovic kernel: [  410.940182] PM: late suspend of devices 
 complete after 13.691 msecs
 Sep 26 17:27:22 bovic kernel: [  410.940248] ACPI: Preparing to enter system 
 sleep state S3
 Sep 26 17:27:22 bovic kernel: [  410.940935] PM: Saving platform NVS memory
 Sep 26 17:27:22 bovic kernel: [  410.940972] Disabling non-boot CPUs ...
 Sep 26 17:27:22 bovic kernel: [  411.044037] CPU 1 is now offline
 Sep 26 17:27:22 bovic kernel: [  411.044557] ACPI: Low-level resume complete
 Sep 26 17:27:22 bovic kernel: [  411.044557] PM: Restoring platform NVS memory
 Sep 26 17:27:22 bovic kernel: [  411.044557] Enabling non-boot CPUs ...
 Sep 26 17:27:22 bovic kernel: [  411.044557] Booting Node 0 Processor 1 APIC 
 0x1
 Sep 26 17:27:22 bovic kernel: [  411.044557] smpboot cpu 1: start_ip = 9b000
 Sep 26 17:27:22 bovic kernel: [  410.942509] Initializing CPU#1
 Sep 26 17:27:22 bovic kernel: [  410.942509] Calibrating delay loop (skipped) 
 already calibrated this CPU
 Sep 26 17:27:22 bovic kernel: [  411.076635] NMI watchdog enabled, takes one 
 hw-pmu counter.
 Sep 26 17:27:22 bovic kernel: [  411.080018] Switched to NOHz mode on CPU #1
 Sep 26 17:27:22 bovic kernel: [  411.080077] CPU1 is up
 Sep 26 17:27:22 bovic kernel: [  411.080453] ACPI: Waking up from system 
 sleep state S3
 
 And after the wake up things seem not to work right - doing a dmesg on 
 a serial console hangs for example.
 

Not sure what happened here. Looks like the freeze phase of the
suspend worked. My inclination is to think that this is not due to my
patch.

What might be helpful is to try suspending without NFS being mounted.
If that doesn't work either, then maybe back out the patch and try
again without any NFS mounts.
-- 
Jeff Layton jlay...@redhat.com



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-24 Thread Jeff Layton
On Fri, 23 Sep 2011 19:58:45 -0500
Jonathan Nieder jrnie...@gmail.com wrote:

 found 642409 linux-2.6/2.6.32-35squeeze2
 found 642409 linux-2.6/3.0.0-4
 found 642409 linux-2.6/3.1.0~rc4-1~experimental.1
 tags 642409 + upstream
 quit
 
 Hi Jeff and John,
 
 John Hughes wrote:
 
  Well, after minor modifications (needed to include freezer.h in
  fs/nfs/inode.c and net/sunrpc/sched.c) I've installed  Jeff Layton's patch
  from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7
  kernel...
 
  ... and it seems to work.
 
 Thanks for writing and testing it.  Has this patch visited the
 linux-nfs@, linux-cifs@, or linux-pm@ list?

Not yet. I was planning to post the revised version next week after
John tested it.

-- 
Jeff Layton jlay...@redhat.com



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-24 Thread Jonathan Nieder
Jeff Layton wrote:
 Jonathan Nieder jrnie...@gmail.com wrote:

 Has this patch visited the
 linux-nfs@, linux-cifs@, or linux-pm@ list?

 Not yet. I was planning to post the revised version next week after
 John tested it.

Ah, that makes sense.  Sorry for the noise.



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread John Hughes

On 22/09/11 19:34, Jonathan Nieder wrote:

Hi John,

John Hughes wrote:

   

On different machines (Dell Optiplex GX270, GX280, 390) with different
versions of the Kernel (x86 2.6.35, amd64 3.0.0) suspend from the kernel
fails if a user who's home directory is on a krb5/nfs4 server.

The kernel seems to be unable to freeze user tasks doing nfs ops.
 

What are the oldest and newest kernels you've experienced this with
   


I'm seeing it with:

Linux version 2.6.32-5-686 (Debian 2.6.32-35squeeze2) (da...@debian.org) 
(gcc version 4.3.5 (Debian 4.3.5-4) ) #1 SMP Fri Sep 9 20:51:05 UTC 2011


(My original bug report was wrong, I mis-read 2.6.32-5 as 2.6.35).

Also on:

Linux version 3.0.0-1-amd64 (Debian 3.0.0-4) (b...@decadent.org.uk) (gcc 
version 4.5.3 (Debian 4.5.3-9) ) #1 SMP Tue Sep 20 07:03:13 UTC 2011


(installed on squeeze)



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread John Hughes

On 22/09/11 19:42, Jonathan Nieder wrote:

Thanks!  That bug log has a message:

| seems to be fixed in 3.1.0-0.rc6.git0.0.fc16.x86_64

and I don't see any relevant patches in the Fedora tree at
http://pkgs.fedoraproject.org/gitweb/?p=kernel.git;a=tree.  So it
could be worth testing 3.1-rc6 from experimental, too.
   


Ah, I didn't see that message.  I'll try that.






--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread John Hughes

On 23/09/11 09:31, John Hughes wrote:

On 22/09/11 19:42, Jonathan Nieder wrote:

So it
could be worth testing 3.1-rc6 from experimental, too.


Ah, I didn't see that message.  I'll try that.


Nothing later than 3.1-rc4 in experimental that I can see.

And that behaves in exactly the same way.




--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread John Hughes

On 22/09/11 19:42, Jonathan Nieder wrote:

So it
could be worth testing 3.1-rc6 from experimental, too.
   


I built my own kernel from github (3.1-rc7).

Linux version 3.1.0-rc7 (john@medic) (gcc version 4.4.5 (Debian 4.4.5-8) 
) #1 SMP Fri Sep 23 12:15:59 CEST 2011


Same problem.

[   95.568860] PM: Syncing filesystems ... done.
[   95.569423] PM: Preparing system for mem sleep
[   95.798364] Freezing user space processes ...
[  115.809770] Freezing of tasks failed after 20.01 seconds (3 tasks refusing 
to freeze, wq_busy=0):
[  115.809789] nm-applet   D 880125306970 0  2230   2122 0x0084
[  115.809795]  880125306970 0086 88012ec12f80 
880125377790
[  115.809801]  00012f80 88012465ffd8 88012465ffd8 
00012f80
[  115.809813]  880125306970 88012465e010 88012ec12f80 
81044162
[  115.809816] Call Trace:
[  115.809822]  [81044162] ? ttwu_do_wakeup+0x51/0xc8
[  115.809832]  [a03137dc] ? rpc_queue_empty+0x26/0x26 [sunrpc]
[  115.809837]  [a031380b] ? rpc_wait_bit_killable+0x2f/0x33 [sunrpc]
[  115.809840]  [81346f79] ? __wait_on_bit+0x3e/0x6f
[  115.809841]  [81347018] ? out_of_line_wait_on_bit+0x6e/0x77
[  115.809845]  [a03137dc] ? rpc_queue_empty+0x26/0x26 [sunrpc]
[  115.809848]  [81063bf7] ? autoremove_wake_function+0x2a/0x2a
[  115.809852]  [a030db02] ? rpc_run_task+0x71/0x79 [sunrpc]
[  115.809860]  [a038242f] ? nfs4_run_open_task+0xde/0x10b [nfs]
[  115.809867]  [a039174d] ? nfs4_get_open_state+0x154/0x1ac [nfs]
[  115.809873]  [a0383041] ? nfs4_do_open+0x11a/0x33a [nfs]
[  115.809878]  [a0314b14] ? put_rpccred+0x35/0x11b [sunrpc]
[  115.809883]  [a038327c] ? nfs4_atomic_open+0x1b/0x33 [nfs]
[  115.809887]  [a036cd2b] ? nfs_open_revalidate+0x113/0x1db [nfs]
[  115.809890]  [8110b174] ? walk_component+0x26c/0x3cb
[  115.809892]  [8110b3c8] ? do_last+0xf5/0x56a
[  115.809895]  [8110bdaa] ? path_openat+0xce/0x31f
[  115.809897]  [8110c0c8] ? do_filp_open+0x2c/0x72
[  115.809899]  [81115a34] ? alloc_fd+0x69/0x10b
[  115.809901]  [810ff715] ? do_sys_open+0x61/0xe8
[  115.809904]  [8134d252] ? system_call_fastpath+0x16/0x1b
...




--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread John Hughes
Well, after minor modifications (needed to include freezer.h in 
fs/nfs/inode.c and net/sunrpc/sched.c) I've installed  Jeff Layton's 
patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 
3.1-rc7 kernel...


... and it seems to work.

Or at least it works a lot better.

I've suspended and restarted many times without problems.

This is on the Dell Optiplex 390 (x86_64 processor).

I will build a kernel and try on the Optiplex GX270 (x86 processor).




--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread Jeff Layton
On Fri, 23 Sep 2011 16:30:36 +0200
John Hughes j...@calva.com wrote:

 Well, after minor modifications (needed to include freezer.h in 
 fs/nfs/inode.c and net/sunrpc/sched.c) I've installed  Jeff Layton's 
 patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 
 3.1-rc7 kernel...
 
 ... and it seems to work.
 
 Or at least it works a lot better.
 
 I've suspended and restarted many times without problems.
 
 This is on the Dell Optiplex 390 (x86_64 processor).
 
 I will build a kernel and try on the Optiplex GX270 (x86 processor).
 

Thanks for testing them. I actually have a revised version that
includes the right header and also fixes some other cases where the NFS
code can sleep like this. I'll see if I can get something together for
3.2.

-- 
Jeff Layton jlay...@redhat.com



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread John Hughes

On 23/09/11 16:40, Jeff Layton wrote:

On Fri, 23 Sep 2011 16:30:36 +0200
John Hughesj...@calva.com  wrote:

   

Well, after minor modifications (needed to include freezer.h in
fs/nfs/inode.c and net/sunrpc/sched.c) I've installed  Jeff Layton's
patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my
3.1-rc7 kernel...

 

Thanks for testing them. I actually have a revised version that
includes the right header and also fixes some other cases where the NFS
code can sleep like this. I'll see if I can get something together for
3.2.
   


Could you send me the revised version - I thought I saw one problem when 
I started testing, but wrote it off to having the wrong kernel.


Anyway, thanks for the patch.





--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread Jeff Layton
On Fri, 23 Sep 2011 17:01:20 +0200
John Hughes j...@calva.com wrote:

 On 23/09/11 16:40, Jeff Layton wrote:
  On Fri, 23 Sep 2011 16:30:36 +0200
  John Hughesj...@calva.com  wrote:
 
 
  Well, after minor modifications (needed to include freezer.h in
  fs/nfs/inode.c and net/sunrpc/sched.c) I've installed  Jeff Layton's
  patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my
  3.1-rc7 kernel...
 
   
  Thanks for testing them. I actually have a revised version that
  includes the right header and also fixes some other cases where the NFS
  code can sleep like this. I'll see if I can get something together for
  3.2.
 
 
 Could you send me the revised version - I thought I saw one problem when 
 I started testing, but wrote it off to having the wrong kernel.
 
 Anyway, thanks for the patch.
 
 

I went ahead and attached the latest one to the RHBZ above. If you can
grab it from there and test it, that would be great.

Thanks,
-- 
Jeff Layton jlay...@redhat.com



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread John Hughes

On 23/09/11 17:09, Jeff Layton wrote:

I went ahead and attached the latest one to the RHBZ above. If you can
grab it from there and test it, that would be great.
   


Ok, I'll try that Monday.




--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-23 Thread Jonathan Nieder
found 642409 linux-2.6/2.6.32-35squeeze2
found 642409 linux-2.6/3.0.0-4
found 642409 linux-2.6/3.1.0~rc4-1~experimental.1
tags 642409 + upstream
quit

Hi Jeff and John,

John Hughes wrote:

 Well, after minor modifications (needed to include freezer.h in
 fs/nfs/inode.c and net/sunrpc/sched.c) I've installed  Jeff Layton's patch
 from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7
 kernel...

 ... and it seems to work.

Thanks for writing and testing it.  Has this patch visited the
linux-nfs@, linux-cifs@, or linux-pm@ list?



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-22 Thread John Hughes
Package: linux-2.6
Severity: normal


On different machines (Dell Optiplex GX270, GX280, 390) with different
versions of the Kernel (x86 2.6.35, amd64 3.0.0) suspend from the kernel
fails if a user who's home directory is on a krb5/nfs4 server.

The kernel seems to be unable to freeze user tasks doing nfs ops.

Here's a bit of syslog that looks relevant:


Sep 22 12:49:45 medic kernel: [85320.038047] PM: Syncing filesystems ... done.
Sep 22 12:49:45 medic kernel: [85320.039687] PM: Preparing system for mem sleep
Sep 22 12:50:05 medic kernel: [85320.238666] Freezing user space processes ... 
Sep 22 12:50:05 medic kernel: [85340.250049] Freezing of tasks failed after 
20.01 seconds (3 tasks refusing to freeze, wq_busy=0):
Sep 22 12:50:05 medic kernel: [85340.250070] nm-applet   D 88012666f060 
0  4428   4332 0x0084
Sep 22 12:50:05 medic kernel: [85340.250076]  88012666f060 0082 
0003 
Sep 22 12:50:05 medic kernel: [85340.250089]  880126ee8870 00012800 
880127e0ffd8 880127e0ffd8
Sep 22 12:50:05 medic kernel: [85340.250092]  00012800 88012666f060 
00012800 00012800
Sep 22 12:50:05 medic kernel: [85340.250095] Call Trace:
Sep 22 12:50:05 medic kernel: [85340.250101]  [81071f28] ? 
arch_local_irq_save+0x14/0x1d
Sep 22 12:50:05 medic kernel: [85340.250112]  [a030723a] ? 
rpc_queue_empty+0x28/0x28 [sunrpc]
Sep 22 12:50:05 medic kernel: [85340.250116]  [a030726c] ? 
rpc_wait_bit_killable+0x32/0x36 [sunrpc]
Sep 22 12:50:05 medic kernel: [85340.250120]  [81335dec] ? 
__wait_on_bit+0x3e/0x71
Sep 22 12:50:05 medic kernel: [85340.250122]  [81335e8d] ? 
out_of_line_wait_on_bit+0x6e/0x77
Sep 22 12:50:05 medic kernel: [85340.250126]  [a030723a] ? 
rpc_queue_empty+0x28/0x28 [sunrpc]
Sep 22 12:50:05 medic kernel: [85340.250129]  [810600ad] ? 
autoremove_wake_function+0x2a/0x2a
Sep 22 12:50:05 medic kernel: [85340.250133]  [a0301c7a] ? 
rpc_run_task+0xcc/0xd4 [sunrpc]
Sep 22 12:50:05 medic kernel: [85340.250141]  [a0367e6a] ? 
nfs4_run_open_task+0xdc/0x109 [nfs]
Sep 22 12:50:05 medic kernel: [85340.250149]  [a03728be] ? 
nfs4_get_open_state+0xdc/0x130 [nfs]
Sep 22 12:50:05 medic kernel: [85340.250155]  [a036abd8] ? 
nfs4_do_open.clone.26+0x149/0x38a [nfs]
Sep 22 12:50:05 medic kernel: [85340.250160]  [a036ae38] ? 
nfs4_atomic_open+0x1f/0x37 [nfs]
Sep 22 12:50:05 medic kernel: [85340.250165]  [a0353a94] ? 
nfs_open_revalidate+0xee/0x19d [nfs]
Sep 22 12:50:05 medic kernel: [85340.250167]  [811051c9] ? 
walk_component+0x249/0x3ce
Sep 22 12:50:05 medic kernel: [85340.250169]  [81103b49] ? 
exec_permission+0x1f/0x7d
Sep 22 12:50:05 medic kernel: [85340.250171]  [8110620a] ? 
link_path_walk+0x91/0x442
Sep 22 12:50:05 medic kernel: [85340.250173]  [811057c6] ? 
do_last+0xee/0x584
Sep 22 12:50:05 medic kernel: [85340.250175]  [81106ece] ? 
path_openat+0xc7/0x349
Sep 22 12:50:05 medic kernel: [85340.250180]  [a035ccd4] ? 
atomic_inc+0x4/0x4 [nfs]
Sep 22 12:50:05 medic kernel: [85340.250183]  [810b6965] ? 
__filemap_fdatawrite_range+0x4b/0x50
Sep 22 12:50:05 medic kernel: [85340.250186]  [8103840a] ? 
should_resched+0x5/0x24
Sep 22 12:50:05 medic kernel: [85340.250188]  [8110717c] ? 
do_filp_open+0x2c/0x72
Sep 22 12:50:05 medic kernel: [85340.250190]  [813358d7] ? 
_cond_resched+0x9/0x20
Sep 22 12:50:05 medic kernel: [85340.250193]  [811aca91] ? 
__strncpy_from_user+0x19/0x4a
Sep 22 12:50:05 medic kernel: [85340.250195]  [811104ec] ? 
alloc_fd+0x69/0x110
Sep 22 12:50:05 medic kernel: [85340.250198]  [810fb41c] ? 
do_sys_open+0x5f/0xe6
Sep 22 12:50:05 medic kernel: [85340.250200]  [8133bd12] ? 
system_call_fastpath+0x16/0x1b
Sep 22 12:50:05 medic kernel: [85340.250203] gnome-screensav D 880127134400 
0  4447  1 0x0084
Sep 22 12:50:05 medic kernel: [85340.250205]  880127134400 0086 
88012ec52800 88012ec52800
Sep 22 12:50:05 medic kernel: [85340.250207]  880126b095d0 00012800 
88011ff39fd8 88011ff39fd8
Sep 22 12:50:05 medic kernel: [85340.250210]  00012800 880127134400 
00012800 00012800
Sep 22 12:50:05 medic kernel: [85340.250212] Call Trace:
Sep 22 12:50:05 medic kernel: [85340.250214]  [81071f28] ? 
arch_local_irq_save+0x14/0x1d
Sep 22 12:50:05 medic kernel: [85340.250219]  [a030723a] ? 
rpc_queue_empty+0x28/0x28 [sunrpc]
Sep 22 12:50:05 medic kernel: [85340.250223]  [a030726c] ? 
rpc_wait_bit_killable+0x32/0x36 [sunrpc]
Sep 22 12:50:05 medic kernel: [85340.250225]  [81335dec] ? 
__wait_on_bit+0x3e/0x71
Sep 22 12:50:05 medic kernel: [85340.250227]  [81335e8d] ? 
out_of_line_wait_on_bit+0x6e/0x77
Sep 22 12:50:05 medic kernel: [85340.250231]  [a030723a] ? 
rpc_queue_empty+0x28/0x28 

Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-22 Thread Jonathan Nieder
Hi John,

John Hughes wrote:

 On different machines (Dell Optiplex GX270, GX280, 390) with different
 versions of the Kernel (x86 2.6.35, amd64 3.0.0) suspend from the kernel
 fails if a user who's home directory is on a krb5/nfs4 server.

 The kernel seems to be unable to freeze user tasks doing nfs ops.

What are the oldest and newest kernels you've experienced this with
(you can get kernel versions from the string starting with Debian in
/proc/version, or from /var/log/dpkg.log)?  Is this reproducible with
a squeeze (2.6.32.y) kernel (or in other words, is it a regression)?

Thanks for writing,
Jonathan



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in

2011-09-22 Thread Jonathan Nieder
John Hughes wrote:

 Redhat has a similar sounding bug:
 https://bugzilla.redhat.com/show_bug.cgi?id=717735

 With a patch.

 I'll see if I can test it.

Thanks!  That bug log has a message:

| seems to be fixed in 3.1.0-0.rc6.git0.0.fc16.x86_64

and I don't see any relevant patches in the Fedora tree at
http://pkgs.fedoraproject.org/gitweb/?p=kernel.git;a=tree.  So it
could be worth testing 3.1-rc6 from experimental, too.



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org