[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-09-13 Thread Frank Schilder
From: Frank Schilder Sent: 07 September 2021 15:30 To: Dan van der Ster; Patrick Donnelly Cc: ceph-users Subject: [ceph-users] Re: MDS daemons stuck in resolve, please help Hi Dan and Patrick, I collected some additional information trying the following: delete a snapshot, add a snapshot. My

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-09-07 Thread Frank Schilder
ervice window and try it out without load on it. Or upgrade first :) > ... I've cc'd Patrick. Thanks a lot! It would be really good if we could resolve the mystery of extra snapshots in pool con-fs2-data2. Best regards, ========= Frank Schilder AIT Risø Campus Bygning 109, rum

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-09-07 Thread Frank Schilder
, rum S14 From: Dan van der Ster Sent: 07 September 2021 14:20:58 To: Frank Schilder; Patrick Donnelly Cc: ceph-users Subject: Re: [ceph-users] Re: MDS daemons stuck in resolve, please help Hi, On Tue, Sep 7, 2021 at 1:55 PM Frank Schilder wrote: > > Hi Dan, >

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-09-07 Thread Frank Schilder
Subject: Re: [ceph-users] Re: MDS daemons stuck in resolve, please help Hi Frank, That's unfortunate! Most of those options relax warnings and relax when a client is considered having too many caps. The option mds_recall_max_caps might be CPU intensive -- the MDS would be busy recalling caps if indeed

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-09-07 Thread Dan van der Ster
gt; Maybe you could point one of the ceph fs devs to this problem? Yeah I certainly can't add more to clarify what you asked above; I simply don't know the snapshot code enough to speculate what might be going wrong here. I've cc'd Patrick. Cheers, dan > > Thanks and best regards, > ===

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-09-06 Thread Frank Schilder
Ster Cc: ceph-users Subject: [ceph-users] Re: MDS daemons stuck in resolve, please help Hi Dan, I'm running mimic latest version. Thanks for the link to the PR, this looks good. Directory pinning does not work in mimic, I had another case on that. The required xattribs are not implemented

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-09-06 Thread Dan van der Ster
and best regards, > ========= > Frank Schilder > AIT Risø Campus > Bygning 109, rum S14 > > > From: Dan van der Ster > Sent: 31 August 2021 15:26:17 > To: Frank Schilder > Cc: ceph-users > Subject: Re: [ceph-u

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-08-31 Thread Frank Schilder
that. Thanks and best regards, = Frank Schilder AIT Risø Campus Bygning 109, rum S14 From: Dan van der Ster Sent: 31 August 2021 15:26:17 To: Frank Schilder Cc: ceph-users Subject: Re: [ceph-users] Re: MDS daemons stuck in resolve, please help

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-08-31 Thread Frank Schilder
Bygning 109, rum S14 From: Frank Schilder Sent: 30 August 2021 21:37:18 To: ceph-users Subject: [ceph-users] Re: MDS daemons stuck in resolve, please help The MDS cluster came back up again, but I lost a number of standby MDS daemons. I cleared the OSD

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-08-31 Thread Dan van der Ster
Campus > Bygning 109, rum S14 > > ________________ > From: Frank Schilder > Sent: 30 August 2021 21:37:18 > To: ceph-users > Subject: [ceph-users] Re: MDS daemons stuck in resolve, please help > > The MDS cluster came back up again, but I lost a number

[ceph-users] Re: MDS daemons stuck in resolve, please help

2021-08-30 Thread Frank Schilder
The MDS cluster came back up again, but I lost a number of standby MDS daemons. I cleared the OSD blacklist, but they do not show up as stand-by daemons again. The daemon itself is running, but does not seem to re-join the cluster. The log shows: 2021-08-30 21:32:34.896 7fc9e22f8700 1