Re: [ceph-users] Recovery after datacenter outage

Gregory Farnum Fri, 22 Jun 2018 13:48:01 -0700

On Fri, Jun 22, 2018 at 2:26 AM Christian Zunker
<christian.zunker@codecentric.cloud> wrote:


> Hi List,
>
> we are running a ceph cluster (12.2.5) as backend to our OpenStack cloud.
>
> Yesterday our datacenter had a power outage. As this wouldn't be enough,
> we also had a separated ceph cluster because of networking problems.
>
> First of all thanks a lot to the ceph developers. After the network was
> back to normal, ceph recovered itself. You saved us from a lot of downtime,
> lack of sleep and insanity.
>
> Now to our problem/question:
> After ceph recovered, we tried to bring up our VMs. They have cinder
> volumes saved in ceph. All VMs didn't start because of I/O problems during
> start:
> [    4.393246] JBD2: recovery failed
> [    4.395949] EXT4-fs (vda1): error loading journal
> [    4.400811] VFS: Dirty inode writeback failed for block device vda1
> (err=-5).
> mount: mounting /dev/vda1 on /root failed: Input/output error
> done.
> Begin: Running /scripts/local-bottom ... done.
> Begin: Running /scripts/init-bottom ... mount: mounting /dev on /root/dev
> failed: No such file or directory
>
> We tried to recover the disk with different methods, but all failed
> because of different reasons. What helped us at the end was a rebuild on
> the object map of each image:
> rbd object-map rebuild volumes/<uuid>
>
> From what we understood, object-map is a feature for ceph internal
> speedup. How can this lead to I/O errors in our VMs?
> Is this the expected way for a recovery?
> Did we miss something?
> Is there any documentation describing what leads to invalid object-maps
> and how to recover? (We did not find a doc on that topic...)
>

An object map definitely shouldn't lead to IO errors in your VMs; in fact I
thought it auto-repaired itself if necessary. Maybe the RBD guys can chime
in here about probable causes of trouble.

My *guess* is that perhaps your VMs or QEMU were configured to ignore
barriers or some similar thing, so that when the power failed a write was
"lost" as it got written to a new RBD object but not committed into the
object map, but the FS or database journal recorded it as complete. I can't
be sure about that though.
-Greg


>
>
> regards
> Christian
> _______________________________________________
> ceph-users mailing list
> ceph-users@lists.ceph.com
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>

_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] Recovery after datacenter outage

Reply via email to