[ceph-users] MDS lost, Filesystem degraded and wont mount

2020-12-04 Thread Anton Aleksandrov
Hello community, we are on ceph 13.2.8 - today something happenned with one MDS and cephs status tells, that filesystem is degraded. It won't mount either. I have take server with MDS, that was not working down. There are 2 more MDS servers, but they stay in "rejoin" state. Also only 1 is show

[ceph-users] Re: MDS lost, Filesystem degraded and wont mount

2020-12-04 Thread Anton Aleksandrov
MDS while it is rejoining. Is that single MDS running out of memory during the rejoin phase? -- dan On Fri, Dec 4, 2020 at 10:49 AM Anton Aleksandrov wrote: Hello community, we are on ceph 13.2.8 - today something happenned with one MDS and cephs status tells, that filesystem is degraded. It won&#x

[ceph-users] Re: MDS lost, Filesystem degraded and wont mount

2020-12-04 Thread Anton Aleksandrov
l Stop all MDS, then: # rados -p cephfs_metadata_pool rm mds0_openfiles.0 then start one MDS. -- Dan On Fri, Dec 4, 2020 at 11:05 AM Anton Aleksandrov wrote: Yes, MDS eats all memory+swap, stays like this for a moment and then frees memory. mds_beacon_grace was already set to 1800 Also on o