Re: [Qemu-devel] [RFC][PATCH 11/12] qcow2: Convert qcow2 to use coroutines for async I/O

Anthony Liguori Wed, 26 Jan 2011 09:12:52 -0800

On 01/26/2011 10:38 AM, Avi Kivity wrote:

On 01/26/2011 06:28 PM, Anthony Liguori wrote:
On 01/26/2011 10:13 AM, Avi Kivity wrote:
Serializing against a global mutex has the advantage that it can betreated as a global lock that is decomposed into fine-grained locks.
For example, we can start the code conversion from an explict asyncmodel to a threaded sync model by converting the mutex into ashared/exclusive lock. Operations like read and write take the lockfor shared access (and take a fine-grained mutex on the metadatacache entry), while operation like creating a snapshot take the lockfor exclusive access. That doesn't work with freeze/thaw.
The trouble with this is that you increase the amount of re-entrancewhereas freeze/thaw doesn't.
The code from the beginning of the request to where the mutex isacquired will be executed for every single request even whilerequests are blocked at the mutex acquisition.
It's just a few instructions.
With freeze/thaw, you freeze the queue and prevent any request fromstarting until you thaw. You only thaw and return control to allowanother request to execute when you begin executing an asynchronousI/O callback.
What do you actually save? The longjmp() to the coroutine code,linking in to the mutex wait queue, and another switch back to themain coroutine? Given that we don't expect to block often, it seemshardly a cost worth optimizing.


It's a matter of correctness, not optimization.

Consider the following example:

coroutine {
   l2 = find_l2(offset);
   // allocates but does not increment max cluster offset
   l2[l2_offset(offset)] = alloc_cluster();

   co_mutex_lock(&lock);
   write_l2(l2);
   co_mutex_unlock(&lock);

   l1[l1_offset(offset)] = l2;

   co_mutex_lock(&lock);
   write_l1(l1);
   co_mutex_unlock(&lock);

   commit_cluster(l2[l2_offset(offset)]);
}

This code is incorrect. The code before the first co_mutex_lock() maybe called a dozen times before before anyone finishes this routine.That means the same cluster is reused multiple times.

This code was correct before we introduced coroutines because weeffectively had one big lock around the whole thing.


Regards,

Anthony Liguori

I think my previous example was wrong, you really want to do:

qcow2_aio_writev() {
   coroutine {
       freeze();
       sync_io(); // existing qcow2 code
       thaw();
       // existing non I/O code
       bdrv_aio_writev(callback); // no explicit freeze/thaw needed
   }
}
This is equivalent to our existing code because no new re-entrance isintroduced. The only re-entrancy points are in thebdrv_aio_{readv,writev} calls.
This requires you to know which code is sync, and which code isasync. My conversion allows you to wrap the code blindly with amutex, and have it do the right thing automatically. This is mostuseful where the code can be sync or async depending on data (which isthe case for qcow2).

Re: [Qemu-devel] [RFC][PATCH 11/12] qcow2: Convert qcow2 to use coroutines for async I/O

Reply via email to