[ceph-users] Re: Annoying MDS_CLIENT_RECALL Warning

2021-11-22 Thread 胡 玮文
onn...@redhat.com> 发送时间: 2021年11月20日 3:20 收件人: 胡 玮文<mailto:huw...@outlook.com> 抄送: Dan van der Ster<mailto:d...@vanderster.com>; ceph-users@ceph.io<mailto:ceph-users@ceph.io> 主题: Re: [ceph-users] Re: Annoying MDS_CLIENT_RECALL Warning On Fri, Nov 19, 2021 at 2:14 AM 胡 玮文 wrot

[ceph-users] Re: Annoying MDS_CLIENT_RECALL Warning

2021-11-19 Thread Patrick Donnelly
On Fri, Nov 19, 2021 at 2:14 AM 胡 玮文 wrote: > > Thanks Dan, > > I choose one of the stuck client to investigate, as shown below, it currently > holds ~269700 caps, which is pretty high with no obvious reason. I cannot > understand most of the output, and failed to find any documents about it. >

[ceph-users] Re: Annoying MDS_CLIENT_RECALL Warning

2021-11-18 Thread 胡 玮文
Hi Patrick, One of the stuck client has num_caps at around 269700, and well above the number of files opened on the client (about 9k). See my reply to Dan for details. So I don't think this warning is simply caused by "mds_min_caps_working_set" being set too low. > -邮件原件- > 发件人: Patric

[ceph-users] Re: Annoying MDS_CLIENT_RECALL Warning

2021-11-18 Thread 胡 玮文
Thanks Dan, I choose one of the stuck client to investigate, as shown below, it currently holds ~269700 caps, which is pretty high with no obvious reason. I cannot understand most of the output, and failed to find any documents about it. # ceph tell mds.cephfs.gpu018.ovxvoz client ls id=7915658

[ceph-users] Re: Annoying MDS_CLIENT_RECALL Warning

2021-11-18 Thread Patrick Donnelly
On Thu, Nov 18, 2021 at 12:36 AM 胡 玮文 wrote: > > Hi all, > > We are consistently seeing the MDS_CLIENT_RECALL warning in our cluster, it > seems harmless, but we cannot get HEALTH_OK, which is annoying. > > The clients that are reported failing to respond to cache pressure are > constantly chang

[ceph-users] Re: Annoying MDS_CLIENT_RECALL Warning

2021-11-18 Thread Dan van der Ster
Hi, We sometimes have similar stuck client recall warnings. To debug you can try: (1) ceph health detail that will show you the client ids which are generating the warning. (e.g. 1234) (2) ceph tell mds.* client ls id=1234 this will show lots of client statistics for the session. Notabl