Public bug reported:

Binary package hint: nfs-common

I have a little 2-computer network using NFS.
The client is ubuntu 8.04.1, the server is the current Debian testing
distribution.

When I share files with nfs4, the connection sometimes hangs
so you can do "ls n" (where "n" is a NFS mountpoint) and it
hangs forever.    It typically runs for tens of minutes with light use
before it freezes.   When it freezes, the kernel is still running,
everything is running that does not try to access files mounted
over NFS.

I know it's not a server freeze because I can connect
to the same NFS server, same exported file system from
another computer, and it'll work  (at least for a while).
I've seen this both ways:   where Ubuntu freezes, but
Debian can access itself via NFS and where Debian freezes
but Ubuntu can access Debian via NFS.

The equivalent configuration works reliably with NFSv3.

When it's frozen, my /var/log/syslog on the Ubuntu client
side rapidly fills up with error messages:

Aug 16 20:04:47 kitchen kernel: [10854.865221] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 1
Aug 16 20:04:47 kitchen kernel: [10854.866133] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 10017
Aug 16 20:04:47 kitchen kernel: [10854.866849] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 10017
Aug 16 20:04:47 kitchen kernel: [10854.867614] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 1
Aug 16 20:04:47 kitchen kernel: [10854.867995] NFSv4 callback: too many open 
TCP sockets, consider increasing the number of nfsd threads
Aug 16 20:04:47 kitchen kernel: [10854.868003] NFSv4 callback: last TCP connect 
from 192.168.3.2, port=42971
Aug 16 20:04:47 kitchen kernel: [10854.869477] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 1
Aug 16 20:04:47 kitchen kernel: [10854.870381] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 10017
Aug 16 20:04:47 kitchen kernel: [10854.871131] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 10017
Aug 16 20:04:47 kitchen kernel: [10854.871874] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 1
Aug 16 20:04:47 kitchen kernel: [10854.872880] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 1
Aug 16 20:04:47 kitchen kernel: [10854.873780] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 10017
Aug 16 20:04:47 kitchen kernel: [10854.874491] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 10017
...
Aug 16 20:04:47 kitchen kernel: [10854.971314] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.972318] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.974021] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.975075] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.976335] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.977571] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.978437] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.979118] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.979788] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.980474] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
Aug 16 20:04:47 kitchen kernel: [10854.983996] Error: state recovery failed on 
NFSv4 server 192.168.3.1 with error 22
...


It's mostly error 22.

This goes on for thousands and thousands of lines, at the rate of 1000
lines per second!


$ lsb_release -rd
Description:    Ubuntu 8.04.1
Release:        8.04
$ 

$ apt-cache policy nfs-common
nfs-common:
  Installed: 1:1.1.2-2ubuntu2.1
  Candidate: 1:1.1.2-2ubuntu2.1
  Version table:
 *** 1:1.1.2-2ubuntu2.1 0
        500 http://gb.archive.ubuntu.com hardy-updates/main Packages
        100 /var/lib/dpkg/status
     1:1.1.2-2ubuntu2 0
        500 http://gb.archive.ubuntu.com hardy/main Packages
$


Relevant lines from /etc/fstab:  (NFS4 is currently commented out,
but it was active earlier).

# desk.local:/gpk /home/gpk/n nfs4      bg,intr         0       0
desk.local:/export/big/gpk /home/gpk/n nfs      bg,intr         0       3
# desk.local:/MyDocuments/gpk /home/gpk/MyDocuments nfs4        bg,intr         
0       0
desk.local:/export/big/MyDocuments/gpk /home/gpk/MyDocuments nfs        bg,intr 
        0       3

Here's /etc/exports on the server (again, the NFS4 lines are currently 
commented out,
but they were active a little while ago):

# /export/big 192.168.2.2(rw,fsid=0,root_squash,subtree_check) 
127.0.0.1(rw,fsid=0,root_squash,subtree_check) 
192.168.3.2(rw,fsid=0,root_squash,subtree_check)
/export/big 192.168.2.2(rw,root_squash,subtree_check) 
127.0.0.1(rw,root_squash,subtree_check) 
192.168.3.2(rw,root_squash,subtree_check)
[EMAIL PROTECTED]:~$ 

ssh, web, and ping connections between the two machines work nicely.
It's a standard wired network, specified in /etc/networks/interfaces .

** Affects: nfs-utils (Ubuntu)
     Importance: Undecided
         Status: New

-- 
nfs4 occasionally hangs, lots of errors in syslog
https://bugs.launchpad.net/bugs/258651
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to