On Fri, 17 May 2013, Joe Ryner wrote:
> Hi All,
> 
> I have had an issue recently while working on my ceph clusters.  The 
> following issue seems to be true on bobtail and cuttlefish.  I have two 
> production clusters in two different data centers and a test cluster.  We are 
> using ceph to run virtual machines.  I use rbd as block devices for sanlock.
> 
> I am running Fedora 18.
> 
> I have been moving monitors around and in the process I got the cluster 
> out of quorum, so ceph stopped responding.  During this time I decided 
> to reboot a ceph node that performs an rbd map during startup.  The 
> system boots ok but the service script that is performing the rbd map 
> doesn't finish and eventually the system will OOPS and then finally 
> panic.  I was able to disable the rbd map during boot and finally got 
> the cluster back in quorum and everything settled down nicely.

What kernel version?  Are you using cephx authentication?  If you could 
open a bug at tracker.ceph.com that would be most helpful!

> Question, has anyone seen this behavior of crashing/panic?  I have seen this 
> happen on both of my production clusters.
> Secondly, the ceph command hangs when the cluster is out of quorum, is there 
> a timeout available?

Not currently.  You can do this yourself with 'timeout 120 ...' with any 
recent coreutils.

Thanks-
sage
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to