[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-01 Thread Loic Tortay
On 01/05/2023 11:35, Frank Schilder wrote: Hi all, I think we might be hitting a known problem (https://tracker.ceph.com/issues/57244). I don't want to fail the mds yet, because we have troubles with older kclients that miss the mds restart and hold on to cache entries referring to the killed

[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-02 Thread MARTEL Arnaud
Hi, Or you can query the MDS(s) with: ceph tell mds.* dump inode 2>/dev/null | grep path for example: user@server:~$ ceph tell mds.* dump inode 1099836155033 2>/dev/null | grep path "path": "/ec42/default/joliot/gipsi/gpu_burn.sif", "stray_prior_path": "", Arnaud Le 01/05/2023 15:07

[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-02 Thread Frank Schilder
m S14 From: MARTEL Arnaud Sent: Tuesday, May 2, 2023 11:20 AM To: Frank Schilder; ceph-users@ceph.io Subject: Re: [ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc Hi, Or you can query the MDS(s)

[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-04 Thread Xiubo Li
On 5/1/23 17:35, Frank Schilder wrote: Hi all, I think we might be hitting a known problem (https://tracker.ceph.com/issues/57244). I don't want to fail the mds yet, because we have troubles with older kclients that miss the mds restart and hold on to cache entries referring to the killed in

[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-09 Thread Frank Schilder
Dear Xiubo, both issues will cause problems, the one reported in the subject (https://tracker.ceph.com/issues/57244) and the potential follow-up on MDS restart (https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/LYY7TBK63XPR6X6TD7372I2YEPJO2L6F). Either one will cause compute jobs

[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-09 Thread Xiubo Li
On 5/9/23 16:23, Frank Schilder wrote: Dear Xiubo, both issues will cause problems, the one reported in the subject (https://tracker.ceph.com/issues/57244) and the potential follow-up on MDS restart (https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/LYY7TBK63XPR6X6TD7372I2YEPJO

[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-10 Thread Frank Schilder
Hi Xiubo. > IMO evicting the corresponding client could also resolve this issue > instead of restarting the MDS. Yes, it can get rid of the stuck caps release request, but it will also make any process accessing the file system crash. After a client eviction we usually have to reboot the server

[ceph-users] Re: client isn't responding to mclientcaps(revoke), pending pAsLsXsFsc issued pAsLsXsFsc

2023-05-11 Thread Xiubo Li
On 5/10/23 19:35, Frank Schilder wrote: Hi Xiubo. IMO evicting the corresponding client could also resolve this issue instead of restarting the MDS. Yes, it can get rid of the stuck caps release request, but it will also make any process accessing the file system crash. After a client evicti