Re: NFS-Locking problem with 6.4/7.1-RELEASE

2009-01-19 Thread Garrett Cooper
On Sun, Jan 18, 2009 at 2:15 AM, Matthias Schuendehuette m...@snafu.de wrote:
 Hello,

 I operate two FreeBSD-Servers in a Windows- and HP-UX Environment. One is a
 SAMBA-Server as a gateway between the Windows and the Unix world, the other
 is NFS-Server for the HP-UX 11i v1 Workstations. Both are HP ProLiants DL380
 with additional external disks on SmartRAID Controllers.

 Since the HP-UX Workstations and their disks are becoming quite old, I
 started to move the home-directories to the FreeBSD Server, wich worked with
 6.3-RELEASE quite good so far.


 Brave as I am, I updated the servers to 6.4 RELEASE and since then the users
 on the HP-UX machines with the homedirs on the FreeBSD-Server were locked...
 :-(

 I tried to find out what was happening and this are my results:

 When a user logs in on a HP-UX machine, his '.profile' file is opened and
 read/executed, but it seems, that it cannot be closed any more. So if the
 last line in the '.profile' is echo foo bar you *can* see foo bar on the
 screen, but then nothing happens any more, the machine is locked.

 I recorded such a session with 'tcpdump' and looked at the dump... the only
 noticeable things are *Bursts* of NLM V4 CANCEL_MSGes on the same
 filehandle. Eg:

 V4 CANCEL_MSG Call FH:0x644201fe svid: pos:0-0

 This line is repeated 7 times with various values for 'svid'.

 I'm no NFS specialist at all, so I cannot tell you more :-/ But I can supply
 the dump (if needed),
 it's 92KB, so the size should not be a problem...

 BTW: I tried this with and without kernel support for NFS-Locking - no
 difference. I also tried the new replacement server with FreeBSD
 7.1-RELEASE: Just the same problems, with and without kernel support.

 I hope someone is willing to work on that issue...

 As mentioned, a new, non-productive server is available in the moment, so
 tests are easily possible.

 TIA

 Matthew

What OS and what NFS version are the HP-UX servers running? Have
you checked /var/log/messages on the clients and on the server for
helpful messages?
Thanks,
-Garrett
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: NFS-Locking problem with 6.4/7.1-RELEASE

2009-01-19 Thread Matthias Schuendehuette

Hi Garret,

Am 19.01.2009 um 22:29 schrieb Garrett Cooper:


   What OS and what NFS version are the HP-UX servers running?


The OS is HP-UX 11.11 a.k.a HP-UX 11iv1.

NFS-Version is NFSV3 (of course ;-) via TCP


Have
you checked /var/log/messages on the clients and on the server for
helpful messages?


No, I'll look tomorrow. I tested today against 7.1-STABLE (of today)
but no change in behaviour so far.

In the meantime I found PR kern/130628 and that could be the
problem here too.

I started 'wireshark' on the NFS-server today and it showed endless
requests from the client to release a lock and equally endless
error-replies...

At least I would expect that an error like that in kern/130628 would
look like what I observed today - but I may fail.


Thanks for your reply - Matthew

--
Ciao/BSD - Matthias

Matthias Schuendehuettemsch [at] snafu.de, Berlin (Germany)





NFS-Locking problem with 6.4/7.1-RELEASE

2009-01-18 Thread Matthias Schuendehuette

Hello,

I operate two FreeBSD-Servers in a Windows- and HP-UX Environment. One  
is a SAMBA-Server as a gateway between the Windows and the Unix world,  
the other is NFS-Server for the HP-UX 11i v1 Workstations. Both are HP  
ProLiants DL380 with additional external disks on SmartRAID Controllers.


Since the HP-UX Workstations and their disks are becoming quite old, I  
started to move the home-directories to the FreeBSD Server, wich  
worked with 6.3-RELEASE quite good so far.



Brave as I am, I updated the servers to 6.4 RELEASE and since then the  
users on the HP-UX machines with the homedirs on the FreeBSD-Server  
were locked... :-(


I tried to find out what was happening and this are my results:

When a user logs in on a HP-UX machine, his '.profile' file is opened  
and read/executed, but it seems, that it cannot be closed any more. So  
if the last line in the '.profile' is echo foo bar you *can* see  
foo bar on the screen, but then nothing happens any more, the  
machine is locked.


I recorded such a session with 'tcpdump' and looked at the dump... the  
only noticeable things are *Bursts* of NLM V4 CANCEL_MSGes on the same  
filehandle. Eg:


V4 CANCEL_MSG Call FH:0x644201fe svid: pos:0-0

This line is repeated 7 times with various values for 'svid'.

I'm no NFS specialist at all, so I cannot tell you more :-/ But I can  
supply the dump (if needed),

it's 92KB, so the size should not be a problem...

BTW: I tried this with and without kernel support for NFS-Locking - no  
difference. I also tried the new replacement server with FreeBSD 7.1- 
RELEASE: Just the same problems, with and without kernel support.


I hope someone is willing to work on that issue...

As mentioned, a new, non-productive server is available in the moment,  
so tests are easily possible.


TIA

Matthew

--
Ciao/BSD - Matthias

Matthias Schuendehuettemsch [at] snafu.de, Berlin (Germany)