[ceph-users] MDS Bug/Problem

2018-03-23 Thread Perrin, Christopher (zimkop1)
Hi, Last week out MDSs started failing one after another, and could not be started anymore. After a lot of tinkering I found out that MDSs crashed after trying to rejoin the Cluster. The only Solution I found that, let them start again was resetting the journal vie cephfs-journal-tool. Now I ha

Re: [ceph-users] MDS Bug/Problem

2018-03-23 Thread John Spray
On Fri, Mar 23, 2018 at 7:45 PM, Perrin, Christopher (zimkop1) wrote: > Hi, > > Last week out MDSs started failing one after another, and could not be > started anymore. After a lot of tinkering I found out that MDSs crashed after > trying to rejoin the Cluster. The only Solution I found that, l

Re: [ceph-users] MDS Bug/Problem

2018-03-24 Thread Yan, Zheng
On Fri, Mar 23, 2018 at 7:45 PM, Perrin, Christopher (zimkop1) wrote: > Hi, > > Last week out MDSs started failing one after another, and could not be > started anymore. After a lot of tinkering I found out that MDSs crashed after > trying to rejoin the Cluster. The only Solution I found that, l

Re: [ceph-users] MDS Bug/Problem

2018-03-28 Thread Perrin, Christopher (zimkop1)
Hi It is Possible that I have extracted the wrong log message. I will look into that. What happened is that out of the blue all MDSs started failing. Only after many failed stating attempts with the OSDs blocking "old" messages I reset the journal. After the MDSs where running again we had sever