On Sun, Jun 7, 2009 at 5:02 PM, Russ Allbery<r...@stanford.edu> wrote: > Jim Rowan <j...@qualcomm.com> writes: >> On Jun 7, 2009, at 1:32 PM, Russ Allbery wrote: > >>> We tracked a similar problem down to user programs who were trying to >>> access directories to which they didn't have permission (in our case >>> because their tokens had expired). Sufficient pounding on such >>> directories will trigger the Rx backoff handling in the file server >>> and start delaying Rx calls from that client, which can result in the >>> client deciding the file server is down or no longer responding. > >> How did you track that down? > > Correlating system kernel messages with error messages from the web > server.
simple way is "watch tcpdump for aborts coming from the fileserver with decreasing frequency over time while at the same time the client retransmits" -- Derrick _______________________________________________ OpenAFS-info mailing list OpenAFS-info@openafs.org https://lists.openafs.org/mailman/listinfo/openafs-info