Re: [ceph-users] Monitor failure after series of traumatic network failures

2015-03-24 Thread Greg Chavez
This was excellent advice. It should be on some official Ceph troubleshooting page. It takes a while for the monitors to deal with new info, but it works. Thanks again! --Greg On Wed, Mar 18, 2015 at 5:24 PM, Sage Weil wrote: > On Wed, 18 Mar 2015, Greg Chavez wrote: > > We have a cuttlefish (0

Re: [ceph-users] Monitor failure after series of traumatic network failures

2015-03-18 Thread Sage Weil
On Wed, 18 Mar 2015, Greg Chavez wrote: > We have a cuttlefish (0.61.9) 192-OSD cluster that has lost network > availability several times since this past Thursday and whose nodes were all > rebooted twice (hastily and inadvisably each time). The final reboot, which > was supposed to be "the last t

[ceph-users] Monitor failure after series of traumatic network failures

2015-03-18 Thread Greg Chavez
We have a cuttlefish (0.61.9) 192-OSD cluster that has lost network availability several times since this past Thursday and whose nodes were all rebooted twice (hastily and inadvisably each time). The final reboot, which was supposed to be "the last thing" before recovery according to our data cent