On Sun, Jun 7, 2009 at 5:02 PM, Russ Allbery<r...@stanford.edu> wrote:
> Jim Rowan <j...@qualcomm.com> writes:
>> On Jun 7, 2009, at 1:32 PM, Russ Allbery wrote:
>
>>> We tracked a similar problem down to user programs who were trying to
>>> access directories to which they didn't have permission (in our case
>>> because their tokens had expired).  Sufficient pounding on such
>>> directories will trigger the Rx backoff handling in the file server
>>> and start delaying Rx calls from that client, which can result in the
>>> client deciding the file server is down or no longer responding.
>
>> How did you track that down?
>
> Correlating system kernel messages with error messages from the web
> server.

simple way is "watch tcpdump for aborts coming from the fileserver
with decreasing frequency over time while at the same time the client
retransmits"


-- 
Derrick
_______________________________________________
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info

Reply via email to