Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication]

Geert Jansen Tue, 29 May 2012 09:20:20 -0700

Hi,

On 05/24/2012 04:19 PM, Paolo Bonzini wrote:

Here is how the bitmaps are handled when doing I/O on the source:
- after writing to the source:
   - clear bit in the volatile in-flight bitmap
   - set bit in the persistent dirty bitmap

- after flushing the source:
   - msync the persistent bitmap to disk


Here is how the bitmaps are handled in the drive-mirror coroutine:
- before reading from the source:
   - set bit in the volatile in-flight bitmap

- after writing to the target:
   - if the dirty count will become zero, flush the target
   - if the bit is still set in the in-flight bitmap, clear bit in the
     persistent dirty bitmap
   - clear bit in the volatile in-flight bitmap


I have a few questions, apologies if some of these are obvious..

I assume the target can be any QEmu block driver including e.g. NBD? Anetworked block driver would be required for a continuous replicationsolution.

Does the drive-mirror coroutine send the writes to the target in thesame order as they are sent to the source? I assume so.

Does the drive-mirror coroutine require that writes are acknowledged?I'd assume so, as you mention that the bit from the persistent bitmap iscleared after a write, so you'd need to know the write arrived otherwiseyou cannot safely clear the bit.

If the two above are true (sending in-order, and require acknowledgmentof writes by the target), then I assume there is a need to keep anin-memory list with the IOs that still need to be sent to the target?That list could get too large if i.e. the target cannot keep up orbecomes unavailable. When this happens, the dirty bitmap is needed tore-establish synchronized state again between the two images.

For this re-sync, i think there will be two phases. The first phasewould send blocks marked as dirty by the bitmap. I assume these would besent in arbitrary order, not the order in which they were sent to thesource, right?

After the copy phase is done, in order to avoid race conditions, thebitmap should be reset and mirroring should start directly andatomically. Is that currently handed by your design?

Also probably the target would need some kind of signal that the copyended and that we are now mirroring because this is when writes arein-order again, and therefore only in this phase the solution canprovide crash consistent protection. In the copy phase no crashconsistency can be provided if i am not mistaken.

Finally, again if i am not mistaken, I think that the scenario wheresynchronization is lost with the target is exactly the same as when youneed to do an initial copy, expect that in the latter case all bits inthe bitmap are set, right?


Regards,
Geert

Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication]

Reply via email to