Re: [Qemu-devel] [RFC PATCH 0/5] asynchronous migration state change handlers

Yonit Halperin Wed, 06 Jun 2012 02:13:11 -0700

Hi,

I would like to add some more points to Gerd's explanation:
On 06/05/2012 04:15 PM, Gerd Hoffmann wrote:

Hi,

Absolutely not.  This is hideously ugly and affects a bunch of code.

Spice is *not* getting a hook in migration where it gets to add
arbitrary amounts of downtime to the migration traffic.  That's a
terrible idea.

I'd like to be more constructive in my response, but you aren't
explaining the problem well enough for me to offer an alternative
solution.  You need to find another way to solve this problem.

Actually, this is not the first time we address you with this issues.For example:http://lists.gnu.org/archive/html/qemu-devel/2012-03/msg01805.html (Thefirst part of the above discussion is not directly related to thecurrent one). I'll try to explain in more details:

As Gerd mentioned, migrating the spice connection smoothly requires thesrc server to keep running and send/receive data to/from the client,after migration has already completed, till the client completelytransfers to the target. The suggested patch series only delays themigration state change from ACTIVE to COMPLETED/ERROR/CANCELED, tillspice signals it has completed its part in migration.As I see it, if spice connection does exists, its migration should betreated as a non separate part of the whole migration process, and thus,the migration state shouldn't change from ACTIVE, till spice hascompleted its part. Hence, I don't think we should have a qmp event forsignaling libvirt about spice migration.

The second challenge we are facing, which I addressed in the "plans"part of the cover-letter, and on which I think you (anthony) actuallyreplied, is how to tackle migrating spice data from the src server tothe target server. Such data can be usb/smartcard packets sent from adevice connected on the client, to the server, and that haven't reachedthe device. Or partial data that has been read from a guest characterdevice and that haven't been sent to the client. Other data can beinternal server-client state data we would wish to keep on the server inorder to avoid establishing the connection to the target from scratch,and possibly also suffer from a slower responsiveness at start.In the cover-letter I suggested to transfer spice migration data via thevmstate infrastructure. The other alternative which we also discussed inthe link above, is to transfer the data via the client. The latter alsorequires holding the src process alive after migration completion, inorder to manage to complete transferring the data from the src to theclient.The vmstate option has the advantages of faster data transfer (src->dst,instead of src->client->dst), and in addition employing an alreadyexisting reliable mechanism for data migration. The disadvantage is thatin order to have an updated vmstate we need to communicate with spiceclient and get all in-flight data before saving the vmstate. So, we caneither busy wait on the relevant fds during the pre_save of thevmstates, or have async pre_save, so that the main loop will be active(but I think that it can be risky once the non-live phase started), orhave an async notifier for changing from live to non-live phase, (spicewill be able to update the vmstates during this notification handler).Of course, we would in any case use a timeout in order to prevent toolong delay.

To summarize, since we can still use the client to transfer data fromthe src to the target (instead of using vmstate), the major requirementof spice, is to keep the src running after migration has completed.


Yonit.


Very short version:  The requirement is simply to not kill qemu on the
source side until the source spice-server has finished session handover
to the target spice-server.

Long version:  spice-client connects automatically to the target
machine, so the user ideally doesn't notice that his virtual machine was
just migrated over to another host.

Today this happens via "switch-host", which is a simple message asking
the spice client to connect to the new host.

We want move to "seamless migration" model where we don't start over
from scratch, but hand over the session from the source to the target.
Advantage is that various state cached in spice-client will stay valid
and doesn't need to be retransmitted.  It also requires a handshake
between spice-servers on source and target.  libvirt killing qemu on the
source host before the handshake is done isn't exactly helpful.

[ Side note: In theory this issue exists even today: in case the data
   pipe to the client is full spice-server will queue up the switch-host
   message and qemu might be killed before it is sent out.  In practice
   it doesn't happen though because it goes through the low-traffic main
   channel so the socket buffers usually have enougth space. ]

So, the big question is how to tackle the issue?

Option (1): Wait until spice-server is done before signaling completion
to libvirt.  This is what this patch series implements.

Advantage is that it is completely transparent for libvirt, thats why I
like it.

Disadvantage is that it indeed adds a small delay for the spice-server
handshake.  The target qemu doesn't process main loop events while the
incoming migration is running, and because of that the spice-server
handshake doesn't run in parallel with the final stage of vm migration,
which it could in theory.

BTW: There will be no "arbitrary amounts of downtime".  Seamless spice
client migration is pretty pointless if it doesn't finish within a
fraction of a second, so we can go with a very short timeout there.

Option (2): Add a new QMP event which is emmitted when spice-server is
done, then make libvirt wait for it before killing qemu.

Obvious disadvantage is that it requires libvirt changes.

Option (3): Your suggestion?

thanks,
   Gerd

Re: [Qemu-devel] [RFC PATCH 0/5] asynchronous migration state change handlers

Reply via email to