Something in the MDS log is bad or is poking at a bug in the code. Can
you turn on MDS debugging and restart a daemon and put that log
somewhere accessible?
debug mds = 20
debug journaler = 20
debug ms = 1
-Greg

On Mon, Oct 15, 2012 at 10:02 AM, Nick Couchman <nick.couch...@seakr.com> wrote:
> Well, both of my MDSs seem to be down right now, and then continually 
> segfault (every time I try to start them) with the following:
>
> ceph-mdsmon-a:~ # ceph-mds -n mds.b -c /etc/ceph/ceph.conf -f
> starting mds.b at :/0
> *** Caught signal (Segmentation fault) **
>  in thread 7fbe0d61d700
>  ceph version 0.48.1argonaut (commit:a7ad701b9bd479f20429f19e6fea7373ca6bba7c)
>  1: ceph-mds() [0x7ef83a]
>  2: (()+0xfd00) [0x7fbe15a0cd00]
>  3: (ESession::replay(MDS*)+0x3ea) [0x4dcfea]
>  4: (MDLog::_replay_thread()+0x6b6) [0x6a2446]
>  5: (MDLog::ReplayThread::entry()+0xd) [0x4cf5ed]
>  6: (()+0x7f05) [0x7fbe15a04f05]
>  7: (clone()+0x6d) [0x7fbe14bc410d]
> 2012-10-15 10:57:35.449161 7fbe0d61d700 -1 *** Caught signal (Segmentation 
> fault) **
>  in thread 7fbe0d61d700
>
>  ceph version 0.48.1argonaut (commit:a7ad701b9bd479f20429f19e6fea7373ca6bba7c)
>  1: ceph-mds() [0x7ef83a]
>  2: (()+0xfd00) [0x7fbe15a0cd00]
>  3: (ESession::replay(MDS*)+0x3ea) [0x4dcfea]
>  4: (MDLog::_replay_thread()+0x6b6) [0x6a2446]
>  5: (MDLog::ReplayThread::entry()+0xd) [0x4cf5ed]
>  6: (()+0x7f05) [0x7fbe15a04f05]
>  7: (clone()+0x6d) [0x7fbe14bc410d]
>  NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to 
> interpret this.
>
>      0> 2012-10-15 10:57:35.449161 7fbe0d61d700 -1 *** Caught signal 
> (Segmentation fault) **
>  in thread 7fbe0d61d700
>
>  ceph version 0.48.1argonaut (commit:a7ad701b9bd479f20429f19e6fea7373ca6bba7c)
>  1: ceph-mds() [0x7ef83a]
>  2: (()+0xfd00) [0x7fbe15a0cd00]
>  3: (ESession::replay(MDS*)+0x3ea) [0x4dcfea]
>  4: (MDLog::_replay_thread()+0x6b6) [0x6a2446]
>  5: (MDLog::ReplayThread::entry()+0xd) [0x4cf5ed]
>  6: (()+0x7f05) [0x7fbe15a04f05]
>  7: (clone()+0x6d) [0x7fbe14bc410d]
>  NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to 
> interpret this.
>
> Segmentation fault
>
> Anyone have any hints on recovering?  I'm running 0.48.1argonaut - I can 
> attempt to upgrade to 0.48.2 and see if that helps, but I figured if anyone 
> can offer any insight as to what to do to get the replay to run without 
> segfaulting?
>
>
>
> --------
> This e-mail may contain confidential and privileged material for the sole use 
> of the intended recipient.  If this email is not intended for you, or you are 
> not responsible for the delivery of this message to the intended recipient, 
> please note that this message may contain SEAKR Engineering (SEAKR) 
> Privileged/Proprietary Information.  In such a case, you are strictly 
> prohibited from downloading, photocopying, distributing or otherwise using 
> this message, its contents or attachments in any way.  If you have received 
> this message in error, please notify us immediately by replying to this 
> e-mail and delete the message from your mailbox.  Information contained in 
> this message that does not relate to the business of SEAKR is neither 
> endorsed by nor attributable to SEAKR.
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majord...@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to