Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 28/09/11 11:09, John Hughes wrote: I think you're right about the problems now being userspace. I feel that the hang after thaw problems I'm seeing are caused by krb5 - it seems to like to get itself into a situation where it has no krb5 ticket so it can't access my home directory which prevents the unlock window popping up so it can't get my password and ask for a new krb5 ticket. If I log on on another screen or serial terminal then the unlock dialogue pops up. I'm going to continue testing today but so far my feeling is that the patch works as advertised - nfs4 shares no longer prevent suspend from working. I think my hangs (system not restarting from freeze, especialy when first starting in the morning) are probably related to the situation described in Ubuntu bug https://bugs.launchpad.net/ubuntu/+bug/794112 and nothing to do with Jeff Layton's freezer patches. I've seen the error message described in the Ubuntu bug: Error: state manager encountered RPCSEC_GSS session expired against NFSv4 server. The bug report hints Error: state manager encountered RPCSEC_GSS session expired against NFSv4 server and I'm running a 3.1 kernel with a Debian squeeze userspace. I'll update the nfs utils and see whether that improves things. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 27/09/11 20:27, Jeff Layton wrote: On Mon, 26 Sep 2011 14:31:32 -0400 Jeff Laytonjlay...@redhat.com wrote: Not sure what happened here. Looks like the freeze phase of the suspend worked. My inclination is to think that this is not due to my patch. What might be helpful is to try suspending without NFS being mounted. If that doesn't work either, then maybe back out the patch and try again without any NFS mounts. Yeah, after looking over the code I really think the instant wakeup thing is an unrelated problem. This patchset is really intended to fix the problem where the freezer is unable to freeze tasks that are stuck waiting on replies from NFS and CIFS servers. If you aren't seeing these sorts of errors when trying to suspend, then this is probably a different problem: https://bugzilla.redhat.com/show_bug.cgi?id=717735#c0 It would be nice to have some successful tests with this set though before I propose it, but that might not be possible given the general fragility of the userspace suspend code :-/ I'm testing away. I think you're right about the problems now being userspace. I feel that the hang after thaw problems I'm seeing are caused by krb5 - it seems to like to get itself into a situation where it has no krb5 ticket so it can't access my home directory which prevents the unlock window popping up so it can't get my password and ask for a new krb5 ticket. If I log on on another screen or serial terminal then the unlock dialogue pops up. I'm going to continue testing today but so far my feeling is that the patch works as advertised - nfs4 shares no longer prevent suspend from working. John would you be able to test the things I mentioned in my prior reply? In particular, seeing whether you're able to suspend consistently without nfs mounts and without those patches in place? If that doesn't work either, then I'll be inclined to think that this is unrelated. So far (I'll confirm tomorrow) 1. I can suspend without nfs mounts 2. I can suspend with nfs mounts if they are not my home directory 3. suspending with nfs mounts of the home directory using echo mem /sys/power/state works 4. suspending with nfs mounts of the home directory using pm-suspend usualy works 5. suspending with nfs mounts of the home directory using gnome shutdown/suspend menu item often hangs. The problems are much more visible on my klunky old Optiplex GX270 systems (2.6GHz P4) than my nice new Optiplex 390 (Core I3). -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On Wed, 28 Sep 2011 11:09:33 +0200 John Hughes j...@calva.com wrote: On 27/09/11 20:27, Jeff Layton wrote: On Mon, 26 Sep 2011 14:31:32 -0400 Jeff Laytonjlay...@redhat.com wrote: Not sure what happened here. Looks like the freeze phase of the suspend worked. My inclination is to think that this is not due to my patch. What might be helpful is to try suspending without NFS being mounted. If that doesn't work either, then maybe back out the patch and try again without any NFS mounts. Yeah, after looking over the code I really think the instant wakeup thing is an unrelated problem. This patchset is really intended to fix the problem where the freezer is unable to freeze tasks that are stuck waiting on replies from NFS and CIFS servers. If you aren't seeing these sorts of errors when trying to suspend, then this is probably a different problem: https://bugzilla.redhat.com/show_bug.cgi?id=717735#c0 It would be nice to have some successful tests with this set though before I propose it, but that might not be possible given the general fragility of the userspace suspend code :-/ I'm testing away. I think you're right about the problems now being userspace. I feel that the hang after thaw problems I'm seeing are caused by krb5 - it seems to like to get itself into a situation where it has no krb5 ticket so it can't access my home directory which prevents the unlock window popping up so it can't get my password and ask for a new krb5 ticket. Maybe... krb5 tickets are usually stored in /tmp, so it's probably not the lack of that. It is however likely that the GUI requires access to some files in your homedir, which might be problematic. Still though, it seems like as long as you have a valid ticket, the resume ought to work (assuming that rpc.gssd is back up and running). If I log on on another screen or serial terminal then the unlock dialogue pops up. That certainly sounds like you're needing to reestablish your credcache. Are you putting the krb5 credcache in the keyring or memory or something? I'm going to continue testing today but so far my feeling is that the patch works as advertised - nfs4 shares no longer prevent suspend from working. John would you be able to test the things I mentioned in my prior reply? In particular, seeing whether you're able to suspend consistently without nfs mounts and without those patches in place? If that doesn't work either, then I'll be inclined to think that this is unrelated. So far (I'll confirm tomorrow) 1. I can suspend without nfs mounts 2. I can suspend with nfs mounts if they are not my home directory 3. suspending with nfs mounts of the home directory using echo mem /sys/power/state works 4. suspending with nfs mounts of the home directory using pm-suspend usualy works 5. suspending with nfs mounts of the home directory using gnome shutdown/suspend menu item often hangs. The problems are much more visible on my klunky old Optiplex GX270 systems (2.6GHz P4) than my nice new Optiplex 390 (Core I3). Nice, thanks for testing it. I'll go ahead and send these out later today. -- Jeff Layton jlay...@redhat.com -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On Mon, 26 Sep 2011 14:31:32 -0400 Jeff Layton jlay...@redhat.com wrote: On Mon, 26 Sep 2011 17:52:58 +0200 John Hughes j...@calva.com wrote: On 23/09/11 17:09, Jeff Layton wrote: I went ahead and attached the latest one to the RHBZ above. If you can grab it from there and test it, that would be great. Ok, I'm now testing the current patch on my Dell GX270 (Pentium-4, ugh) system. Things are not working too well. Sometimes the system suspends with no problems, but sometimes it seems to suspend and wake up instantly: Sep 26 17:27:06 bovic kernel: [ 410.158736] PM: Syncing filesystems ... done. Sep 26 17:27:06 bovic kernel: [ 410.161113] PM: Preparing system for mem sleep Sep 26 17:27:22 bovic kernel: [ 410.213835] Freezing user space processes ... (elapsed 0.01 seconds) done. Sep 26 17:27:22 bovic kernel: [ 410.228074] Freezing remaining freezable tasks ... (elapsed 0.01 seconds) done. Looks like the freeze worked... Sep 26 17:27:22 bovic kernel: [ 410.244045] PM: Entering mem sleep Sep 26 17:27:22 bovic kernel: [ 410.244065] Suspending console(s) (use no_console_suspend to debug) [...] Sep 26 17:27:22 bovic kernel: [ 410.926483] PM: suspend of devices complete after 682.208 msecs Sep 26 17:27:22 bovic kernel: [ 410.940182] PM: late suspend of devices complete after 13.691 msecs Sep 26 17:27:22 bovic kernel: [ 410.940248] ACPI: Preparing to enter system sleep state S3 Sep 26 17:27:22 bovic kernel: [ 410.940935] PM: Saving platform NVS memory Sep 26 17:27:22 bovic kernel: [ 410.940972] Disabling non-boot CPUs ... Sep 26 17:27:22 bovic kernel: [ 411.044037] CPU 1 is now offline Sep 26 17:27:22 bovic kernel: [ 411.044557] ACPI: Low-level resume complete Sep 26 17:27:22 bovic kernel: [ 411.044557] PM: Restoring platform NVS memory Sep 26 17:27:22 bovic kernel: [ 411.044557] Enabling non-boot CPUs ... Sep 26 17:27:22 bovic kernel: [ 411.044557] Booting Node 0 Processor 1 APIC 0x1 Sep 26 17:27:22 bovic kernel: [ 411.044557] smpboot cpu 1: start_ip = 9b000 Sep 26 17:27:22 bovic kernel: [ 410.942509] Initializing CPU#1 Sep 26 17:27:22 bovic kernel: [ 410.942509] Calibrating delay loop (skipped) already calibrated this CPU Sep 26 17:27:22 bovic kernel: [ 411.076635] NMI watchdog enabled, takes one hw-pmu counter. Sep 26 17:27:22 bovic kernel: [ 411.080018] Switched to NOHz mode on CPU #1 Sep 26 17:27:22 bovic kernel: [ 411.080077] CPU1 is up Sep 26 17:27:22 bovic kernel: [ 411.080453] ACPI: Waking up from system sleep state S3 And after the wake up things seem not to work right - doing a dmesg on a serial console hangs for example. Not sure what happened here. Looks like the freeze phase of the suspend worked. My inclination is to think that this is not due to my patch. What might be helpful is to try suspending without NFS being mounted. If that doesn't work either, then maybe back out the patch and try again without any NFS mounts. Yeah, after looking over the code I really think the instant wakeup thing is an unrelated problem. This patchset is really intended to fix the problem where the freezer is unable to freeze tasks that are stuck waiting on replies from NFS and CIFS servers. If you aren't seeing these sorts of errors when trying to suspend, then this is probably a different problem: https://bugzilla.redhat.com/show_bug.cgi?id=717735#c0 It would be nice to have some successful tests with this set though before I propose it, but that might not be possible given the general fragility of the userspace suspend code :-/ John would you be able to test the things I mentioned in my prior reply? In particular, seeing whether you're able to suspend consistently without nfs mounts and without those patches in place? If that doesn't work either, then I'll be inclined to think that this is unrelated. -- Jeff Layton jlay...@redhat.com -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 23/09/11 17:09, Jeff Layton wrote: I went ahead and attached the latest one to the RHBZ above. If you can grab it from there and test it, that would be great. Ok, I'm now testing the current patch on my Dell GX270 (Pentium-4, ugh) system. Things are not working too well. Sometimes the system suspends with no problems, but sometimes it seems to suspend and wake up instantly: Sep 26 17:27:06 bovic kernel: [ 410.158736] PM: Syncing filesystems ... done. Sep 26 17:27:06 bovic kernel: [ 410.161113] PM: Preparing system for mem sleep Sep 26 17:27:22 bovic kernel: [ 410.213835] Freezing user space processes ... (elapsed 0.01 seconds) done. Sep 26 17:27:22 bovic kernel: [ 410.228074] Freezing remaining freezable tasks ... (elapsed 0.01 seconds) done. Sep 26 17:27:22 bovic kernel: [ 410.244045] PM: Entering mem sleep Sep 26 17:27:22 bovic kernel: [ 410.244065] Suspending console(s) (use no_console_suspend to debug) [...] Sep 26 17:27:22 bovic kernel: [ 410.926483] PM: suspend of devices complete after 682.208 msecs Sep 26 17:27:22 bovic kernel: [ 410.940182] PM: late suspend of devices complete after 13.691 msecs Sep 26 17:27:22 bovic kernel: [ 410.940248] ACPI: Preparing to enter system sleep state S3 Sep 26 17:27:22 bovic kernel: [ 410.940935] PM: Saving platform NVS memory Sep 26 17:27:22 bovic kernel: [ 410.940972] Disabling non-boot CPUs ... Sep 26 17:27:22 bovic kernel: [ 411.044037] CPU 1 is now offline Sep 26 17:27:22 bovic kernel: [ 411.044557] ACPI: Low-level resume complete Sep 26 17:27:22 bovic kernel: [ 411.044557] PM: Restoring platform NVS memory Sep 26 17:27:22 bovic kernel: [ 411.044557] Enabling non-boot CPUs ... Sep 26 17:27:22 bovic kernel: [ 411.044557] Booting Node 0 Processor 1 APIC 0x1 Sep 26 17:27:22 bovic kernel: [ 411.044557] smpboot cpu 1: start_ip = 9b000 Sep 26 17:27:22 bovic kernel: [ 410.942509] Initializing CPU#1 Sep 26 17:27:22 bovic kernel: [ 410.942509] Calibrating delay loop (skipped) already calibrated this CPU Sep 26 17:27:22 bovic kernel: [ 411.076635] NMI watchdog enabled, takes one hw-pmu counter. Sep 26 17:27:22 bovic kernel: [ 411.080018] Switched to NOHz mode on CPU #1 Sep 26 17:27:22 bovic kernel: [ 411.080077] CPU1 is up Sep 26 17:27:22 bovic kernel: [ 411.080453] ACPI: Waking up from system sleep state S3 And after the wake up things seem not to work right - doing a dmesg on a serial console hangs for example. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On Mon, 26 Sep 2011 17:52:58 +0200 John Hughes j...@calva.com wrote: On 23/09/11 17:09, Jeff Layton wrote: I went ahead and attached the latest one to the RHBZ above. If you can grab it from there and test it, that would be great. Ok, I'm now testing the current patch on my Dell GX270 (Pentium-4, ugh) system. Things are not working too well. Sometimes the system suspends with no problems, but sometimes it seems to suspend and wake up instantly: Sep 26 17:27:06 bovic kernel: [ 410.158736] PM: Syncing filesystems ... done. Sep 26 17:27:06 bovic kernel: [ 410.161113] PM: Preparing system for mem sleep Sep 26 17:27:22 bovic kernel: [ 410.213835] Freezing user space processes ... (elapsed 0.01 seconds) done. Sep 26 17:27:22 bovic kernel: [ 410.228074] Freezing remaining freezable tasks ... (elapsed 0.01 seconds) done. Looks like the freeze worked... Sep 26 17:27:22 bovic kernel: [ 410.244045] PM: Entering mem sleep Sep 26 17:27:22 bovic kernel: [ 410.244065] Suspending console(s) (use no_console_suspend to debug) [...] Sep 26 17:27:22 bovic kernel: [ 410.926483] PM: suspend of devices complete after 682.208 msecs Sep 26 17:27:22 bovic kernel: [ 410.940182] PM: late suspend of devices complete after 13.691 msecs Sep 26 17:27:22 bovic kernel: [ 410.940248] ACPI: Preparing to enter system sleep state S3 Sep 26 17:27:22 bovic kernel: [ 410.940935] PM: Saving platform NVS memory Sep 26 17:27:22 bovic kernel: [ 410.940972] Disabling non-boot CPUs ... Sep 26 17:27:22 bovic kernel: [ 411.044037] CPU 1 is now offline Sep 26 17:27:22 bovic kernel: [ 411.044557] ACPI: Low-level resume complete Sep 26 17:27:22 bovic kernel: [ 411.044557] PM: Restoring platform NVS memory Sep 26 17:27:22 bovic kernel: [ 411.044557] Enabling non-boot CPUs ... Sep 26 17:27:22 bovic kernel: [ 411.044557] Booting Node 0 Processor 1 APIC 0x1 Sep 26 17:27:22 bovic kernel: [ 411.044557] smpboot cpu 1: start_ip = 9b000 Sep 26 17:27:22 bovic kernel: [ 410.942509] Initializing CPU#1 Sep 26 17:27:22 bovic kernel: [ 410.942509] Calibrating delay loop (skipped) already calibrated this CPU Sep 26 17:27:22 bovic kernel: [ 411.076635] NMI watchdog enabled, takes one hw-pmu counter. Sep 26 17:27:22 bovic kernel: [ 411.080018] Switched to NOHz mode on CPU #1 Sep 26 17:27:22 bovic kernel: [ 411.080077] CPU1 is up Sep 26 17:27:22 bovic kernel: [ 411.080453] ACPI: Waking up from system sleep state S3 And after the wake up things seem not to work right - doing a dmesg on a serial console hangs for example. Not sure what happened here. Looks like the freeze phase of the suspend worked. My inclination is to think that this is not due to my patch. What might be helpful is to try suspending without NFS being mounted. If that doesn't work either, then maybe back out the patch and try again without any NFS mounts. -- Jeff Layton jlay...@redhat.com -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On Fri, 23 Sep 2011 19:58:45 -0500 Jonathan Nieder jrnie...@gmail.com wrote: found 642409 linux-2.6/2.6.32-35squeeze2 found 642409 linux-2.6/3.0.0-4 found 642409 linux-2.6/3.1.0~rc4-1~experimental.1 tags 642409 + upstream quit Hi Jeff and John, John Hughes wrote: Well, after minor modifications (needed to include freezer.h in fs/nfs/inode.c and net/sunrpc/sched.c) I've installed Jeff Layton's patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7 kernel... ... and it seems to work. Thanks for writing and testing it. Has this patch visited the linux-nfs@, linux-cifs@, or linux-pm@ list? Not yet. I was planning to post the revised version next week after John tested it. -- Jeff Layton jlay...@redhat.com -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
Jeff Layton wrote: Jonathan Nieder jrnie...@gmail.com wrote: Has this patch visited the linux-nfs@, linux-cifs@, or linux-pm@ list? Not yet. I was planning to post the revised version next week after John tested it. Ah, that makes sense. Sorry for the noise. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 22/09/11 19:34, Jonathan Nieder wrote: Hi John, John Hughes wrote: On different machines (Dell Optiplex GX270, GX280, 390) with different versions of the Kernel (x86 2.6.35, amd64 3.0.0) suspend from the kernel fails if a user who's home directory is on a krb5/nfs4 server. The kernel seems to be unable to freeze user tasks doing nfs ops. What are the oldest and newest kernels you've experienced this with I'm seeing it with: Linux version 2.6.32-5-686 (Debian 2.6.32-35squeeze2) (da...@debian.org) (gcc version 4.3.5 (Debian 4.3.5-4) ) #1 SMP Fri Sep 9 20:51:05 UTC 2011 (My original bug report was wrong, I mis-read 2.6.32-5 as 2.6.35). Also on: Linux version 3.0.0-1-amd64 (Debian 3.0.0-4) (b...@decadent.org.uk) (gcc version 4.5.3 (Debian 4.5.3-9) ) #1 SMP Tue Sep 20 07:03:13 UTC 2011 (installed on squeeze) -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 22/09/11 19:42, Jonathan Nieder wrote: Thanks! That bug log has a message: | seems to be fixed in 3.1.0-0.rc6.git0.0.fc16.x86_64 and I don't see any relevant patches in the Fedora tree at http://pkgs.fedoraproject.org/gitweb/?p=kernel.git;a=tree. So it could be worth testing 3.1-rc6 from experimental, too. Ah, I didn't see that message. I'll try that. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 23/09/11 09:31, John Hughes wrote: On 22/09/11 19:42, Jonathan Nieder wrote: So it could be worth testing 3.1-rc6 from experimental, too. Ah, I didn't see that message. I'll try that. Nothing later than 3.1-rc4 in experimental that I can see. And that behaves in exactly the same way. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 22/09/11 19:42, Jonathan Nieder wrote: So it could be worth testing 3.1-rc6 from experimental, too. I built my own kernel from github (3.1-rc7). Linux version 3.1.0-rc7 (john@medic) (gcc version 4.4.5 (Debian 4.4.5-8) ) #1 SMP Fri Sep 23 12:15:59 CEST 2011 Same problem. [ 95.568860] PM: Syncing filesystems ... done. [ 95.569423] PM: Preparing system for mem sleep [ 95.798364] Freezing user space processes ... [ 115.809770] Freezing of tasks failed after 20.01 seconds (3 tasks refusing to freeze, wq_busy=0): [ 115.809789] nm-applet D 880125306970 0 2230 2122 0x0084 [ 115.809795] 880125306970 0086 88012ec12f80 880125377790 [ 115.809801] 00012f80 88012465ffd8 88012465ffd8 00012f80 [ 115.809813] 880125306970 88012465e010 88012ec12f80 81044162 [ 115.809816] Call Trace: [ 115.809822] [81044162] ? ttwu_do_wakeup+0x51/0xc8 [ 115.809832] [a03137dc] ? rpc_queue_empty+0x26/0x26 [sunrpc] [ 115.809837] [a031380b] ? rpc_wait_bit_killable+0x2f/0x33 [sunrpc] [ 115.809840] [81346f79] ? __wait_on_bit+0x3e/0x6f [ 115.809841] [81347018] ? out_of_line_wait_on_bit+0x6e/0x77 [ 115.809845] [a03137dc] ? rpc_queue_empty+0x26/0x26 [sunrpc] [ 115.809848] [81063bf7] ? autoremove_wake_function+0x2a/0x2a [ 115.809852] [a030db02] ? rpc_run_task+0x71/0x79 [sunrpc] [ 115.809860] [a038242f] ? nfs4_run_open_task+0xde/0x10b [nfs] [ 115.809867] [a039174d] ? nfs4_get_open_state+0x154/0x1ac [nfs] [ 115.809873] [a0383041] ? nfs4_do_open+0x11a/0x33a [nfs] [ 115.809878] [a0314b14] ? put_rpccred+0x35/0x11b [sunrpc] [ 115.809883] [a038327c] ? nfs4_atomic_open+0x1b/0x33 [nfs] [ 115.809887] [a036cd2b] ? nfs_open_revalidate+0x113/0x1db [nfs] [ 115.809890] [8110b174] ? walk_component+0x26c/0x3cb [ 115.809892] [8110b3c8] ? do_last+0xf5/0x56a [ 115.809895] [8110bdaa] ? path_openat+0xce/0x31f [ 115.809897] [8110c0c8] ? do_filp_open+0x2c/0x72 [ 115.809899] [81115a34] ? alloc_fd+0x69/0x10b [ 115.809901] [810ff715] ? do_sys_open+0x61/0xe8 [ 115.809904] [8134d252] ? system_call_fastpath+0x16/0x1b ... -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
Well, after minor modifications (needed to include freezer.h in fs/nfs/inode.c and net/sunrpc/sched.c) I've installed Jeff Layton's patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7 kernel... ... and it seems to work. Or at least it works a lot better. I've suspended and restarted many times without problems. This is on the Dell Optiplex 390 (x86_64 processor). I will build a kernel and try on the Optiplex GX270 (x86 processor). -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On Fri, 23 Sep 2011 16:30:36 +0200 John Hughes j...@calva.com wrote: Well, after minor modifications (needed to include freezer.h in fs/nfs/inode.c and net/sunrpc/sched.c) I've installed Jeff Layton's patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7 kernel... ... and it seems to work. Or at least it works a lot better. I've suspended and restarted many times without problems. This is on the Dell Optiplex 390 (x86_64 processor). I will build a kernel and try on the Optiplex GX270 (x86 processor). Thanks for testing them. I actually have a revised version that includes the right header and also fixes some other cases where the NFS code can sleep like this. I'll see if I can get something together for 3.2. -- Jeff Layton jlay...@redhat.com -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 23/09/11 16:40, Jeff Layton wrote: On Fri, 23 Sep 2011 16:30:36 +0200 John Hughesj...@calva.com wrote: Well, after minor modifications (needed to include freezer.h in fs/nfs/inode.c and net/sunrpc/sched.c) I've installed Jeff Layton's patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7 kernel... Thanks for testing them. I actually have a revised version that includes the right header and also fixes some other cases where the NFS code can sleep like this. I'll see if I can get something together for 3.2. Could you send me the revised version - I thought I saw one problem when I started testing, but wrote it off to having the wrong kernel. Anyway, thanks for the patch. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On Fri, 23 Sep 2011 17:01:20 +0200 John Hughes j...@calva.com wrote: On 23/09/11 16:40, Jeff Layton wrote: On Fri, 23 Sep 2011 16:30:36 +0200 John Hughesj...@calva.com wrote: Well, after minor modifications (needed to include freezer.h in fs/nfs/inode.c and net/sunrpc/sched.c) I've installed Jeff Layton's patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7 kernel... Thanks for testing them. I actually have a revised version that includes the right header and also fixes some other cases where the NFS code can sleep like this. I'll see if I can get something together for 3.2. Could you send me the revised version - I thought I saw one problem when I started testing, but wrote it off to having the wrong kernel. Anyway, thanks for the patch. I went ahead and attached the latest one to the RHBZ above. If you can grab it from there and test it, that would be great. Thanks, -- Jeff Layton jlay...@redhat.com -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
On 23/09/11 17:09, Jeff Layton wrote: I went ahead and attached the latest one to the RHBZ above. If you can grab it from there and test it, that would be great. Ok, I'll try that Monday. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
found 642409 linux-2.6/2.6.32-35squeeze2 found 642409 linux-2.6/3.0.0-4 found 642409 linux-2.6/3.1.0~rc4-1~experimental.1 tags 642409 + upstream quit Hi Jeff and John, John Hughes wrote: Well, after minor modifications (needed to include freezer.h in fs/nfs/inode.c and net/sunrpc/sched.c) I've installed Jeff Layton's patch from https://bugzilla.redhat.com/show_bug.cgi?id=717735 in my 3.1-rc7 kernel... ... and it seems to work. Thanks for writing and testing it. Has this patch visited the linux-nfs@, linux-cifs@, or linux-pm@ list? -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
Package: linux-2.6 Severity: normal On different machines (Dell Optiplex GX270, GX280, 390) with different versions of the Kernel (x86 2.6.35, amd64 3.0.0) suspend from the kernel fails if a user who's home directory is on a krb5/nfs4 server. The kernel seems to be unable to freeze user tasks doing nfs ops. Here's a bit of syslog that looks relevant: Sep 22 12:49:45 medic kernel: [85320.038047] PM: Syncing filesystems ... done. Sep 22 12:49:45 medic kernel: [85320.039687] PM: Preparing system for mem sleep Sep 22 12:50:05 medic kernel: [85320.238666] Freezing user space processes ... Sep 22 12:50:05 medic kernel: [85340.250049] Freezing of tasks failed after 20.01 seconds (3 tasks refusing to freeze, wq_busy=0): Sep 22 12:50:05 medic kernel: [85340.250070] nm-applet D 88012666f060 0 4428 4332 0x0084 Sep 22 12:50:05 medic kernel: [85340.250076] 88012666f060 0082 0003 Sep 22 12:50:05 medic kernel: [85340.250089] 880126ee8870 00012800 880127e0ffd8 880127e0ffd8 Sep 22 12:50:05 medic kernel: [85340.250092] 00012800 88012666f060 00012800 00012800 Sep 22 12:50:05 medic kernel: [85340.250095] Call Trace: Sep 22 12:50:05 medic kernel: [85340.250101] [81071f28] ? arch_local_irq_save+0x14/0x1d Sep 22 12:50:05 medic kernel: [85340.250112] [a030723a] ? rpc_queue_empty+0x28/0x28 [sunrpc] Sep 22 12:50:05 medic kernel: [85340.250116] [a030726c] ? rpc_wait_bit_killable+0x32/0x36 [sunrpc] Sep 22 12:50:05 medic kernel: [85340.250120] [81335dec] ? __wait_on_bit+0x3e/0x71 Sep 22 12:50:05 medic kernel: [85340.250122] [81335e8d] ? out_of_line_wait_on_bit+0x6e/0x77 Sep 22 12:50:05 medic kernel: [85340.250126] [a030723a] ? rpc_queue_empty+0x28/0x28 [sunrpc] Sep 22 12:50:05 medic kernel: [85340.250129] [810600ad] ? autoremove_wake_function+0x2a/0x2a Sep 22 12:50:05 medic kernel: [85340.250133] [a0301c7a] ? rpc_run_task+0xcc/0xd4 [sunrpc] Sep 22 12:50:05 medic kernel: [85340.250141] [a0367e6a] ? nfs4_run_open_task+0xdc/0x109 [nfs] Sep 22 12:50:05 medic kernel: [85340.250149] [a03728be] ? nfs4_get_open_state+0xdc/0x130 [nfs] Sep 22 12:50:05 medic kernel: [85340.250155] [a036abd8] ? nfs4_do_open.clone.26+0x149/0x38a [nfs] Sep 22 12:50:05 medic kernel: [85340.250160] [a036ae38] ? nfs4_atomic_open+0x1f/0x37 [nfs] Sep 22 12:50:05 medic kernel: [85340.250165] [a0353a94] ? nfs_open_revalidate+0xee/0x19d [nfs] Sep 22 12:50:05 medic kernel: [85340.250167] [811051c9] ? walk_component+0x249/0x3ce Sep 22 12:50:05 medic kernel: [85340.250169] [81103b49] ? exec_permission+0x1f/0x7d Sep 22 12:50:05 medic kernel: [85340.250171] [8110620a] ? link_path_walk+0x91/0x442 Sep 22 12:50:05 medic kernel: [85340.250173] [811057c6] ? do_last+0xee/0x584 Sep 22 12:50:05 medic kernel: [85340.250175] [81106ece] ? path_openat+0xc7/0x349 Sep 22 12:50:05 medic kernel: [85340.250180] [a035ccd4] ? atomic_inc+0x4/0x4 [nfs] Sep 22 12:50:05 medic kernel: [85340.250183] [810b6965] ? __filemap_fdatawrite_range+0x4b/0x50 Sep 22 12:50:05 medic kernel: [85340.250186] [8103840a] ? should_resched+0x5/0x24 Sep 22 12:50:05 medic kernel: [85340.250188] [8110717c] ? do_filp_open+0x2c/0x72 Sep 22 12:50:05 medic kernel: [85340.250190] [813358d7] ? _cond_resched+0x9/0x20 Sep 22 12:50:05 medic kernel: [85340.250193] [811aca91] ? __strncpy_from_user+0x19/0x4a Sep 22 12:50:05 medic kernel: [85340.250195] [811104ec] ? alloc_fd+0x69/0x110 Sep 22 12:50:05 medic kernel: [85340.250198] [810fb41c] ? do_sys_open+0x5f/0xe6 Sep 22 12:50:05 medic kernel: [85340.250200] [8133bd12] ? system_call_fastpath+0x16/0x1b Sep 22 12:50:05 medic kernel: [85340.250203] gnome-screensav D 880127134400 0 4447 1 0x0084 Sep 22 12:50:05 medic kernel: [85340.250205] 880127134400 0086 88012ec52800 88012ec52800 Sep 22 12:50:05 medic kernel: [85340.250207] 880126b095d0 00012800 88011ff39fd8 88011ff39fd8 Sep 22 12:50:05 medic kernel: [85340.250210] 00012800 880127134400 00012800 00012800 Sep 22 12:50:05 medic kernel: [85340.250212] Call Trace: Sep 22 12:50:05 medic kernel: [85340.250214] [81071f28] ? arch_local_irq_save+0x14/0x1d Sep 22 12:50:05 medic kernel: [85340.250219] [a030723a] ? rpc_queue_empty+0x28/0x28 [sunrpc] Sep 22 12:50:05 medic kernel: [85340.250223] [a030726c] ? rpc_wait_bit_killable+0x32/0x36 [sunrpc] Sep 22 12:50:05 medic kernel: [85340.250225] [81335dec] ? __wait_on_bit+0x3e/0x71 Sep 22 12:50:05 medic kernel: [85340.250227] [81335e8d] ? out_of_line_wait_on_bit+0x6e/0x77 Sep 22 12:50:05 medic kernel: [85340.250231] [a030723a] ? rpc_queue_empty+0x28/0x28
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
Hi John, John Hughes wrote: On different machines (Dell Optiplex GX270, GX280, 390) with different versions of the Kernel (x86 2.6.35, amd64 3.0.0) suspend from the kernel fails if a user who's home directory is on a krb5/nfs4 server. The kernel seems to be unable to freeze user tasks doing nfs ops. What are the oldest and newest kernels you've experienced this with (you can get kernel versions from the string starting with Debian in /proc/version, or from /var/log/dpkg.log)? Is this reproducible with a squeeze (2.6.32.y) kernel (or in other words, is it a regression)? Thanks for writing, Jonathan -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#642409: linux-2.6: suspend from gnome shutdown fails when krb5/nfs4 user logged in
John Hughes wrote: Redhat has a similar sounding bug: https://bugzilla.redhat.com/show_bug.cgi?id=717735 With a patch. I'll see if I can test it. Thanks! That bug log has a message: | seems to be fixed in 3.1.0-0.rc6.git0.0.fc16.x86_64 and I don't see any relevant patches in the Fedora tree at http://pkgs.fedoraproject.org/gitweb/?p=kernel.git;a=tree. So it could be worth testing 3.1-rc6 from experimental, too. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org