Re: [OpenAFS] OpenAFS 1.8.4 Linux kernel BUG

2020-05-01 Thread Chris Cooke
Hi Ben, That's a nice idea; thanks a lot. But it seems that the disks are OK? I've just queried the RAID controller and it seems that neither of this machine's disks has flagged a SMART error. The controller does a "patrol read" test of both disks once a week, on top of their normal use. Chris

Re: [OpenAFS] OpenAFS 1.8.4 Linux kernel BUG

2020-05-01 Thread Chris Cooke
Hi Ben, Many thanks for looking at this. > On 4 Apr 2020, at 04:35, Benjamin Kaduk wrote: > > Had this machine been running for a long time > without restart or needing to flush the (AFS) cache? It had been up for 29 days at that point. I can't find any mention of periodic flushes of the cach

Re: Re: [OpenAFS] Clients are blocked with error code -3 of RXAFSCB_ProbeUuid

2020-05-01 Thread Benjamin Kaduk
On Tue, Apr 28, 2020 at 10:30:50AM +0800, huangql wrote: > Hello Ben, > > Thank you for your reply. > > Actually, our farm experiences this issue for some time. And we spent a lot > of time to figure out it. We found when there is large IO throughput to > consume the network bandwidth and there