> > > > scsi_disk_emulate_command() can be called from a coroutine;
> > > > when req->cmd.buf[0] is ALLOW_MEDIUM_REMOVAL, the synchronous
> > > > blk_lock_medium() is called, which hits assert(!qemu_in_coroutine()),
> > > > and crashes:
> > > > 
> > > >         qemu-system-hppa: block/block-gen.c:1692: blk_lock_medium:
> > > >         Assertion `!qemu_in_coroutine()' failed.
> > > > 
> > > > blk_eject() has the same problem, it can be called from coroutine,
> > > > because the same vtable entry (SCSIReqOps.send_command,
> > > > scsi_disk_emulate_command() here) calls blk_eject when req->cmd.buf[0]
> > > > is START_STOP.
> > > > 
> > > > Fix by switching to co_wrapper_mixed for blk_lock_medium() and
> > > > blk_eject() instead of just co_wrapper.
> > > > 
> > > > Signed-off-by: Boudewijn van der Heide <[email protected]>
> > > 
> > > Yes, that will fix your immediate symptoms. I'm not completely sure if
> > > that's the level where we want to fix things, though, which is why I
> > > added Paolo and Stefan to Cc.
> > > > ---
> > > > Observed crash on fedora qemu-10.1.5-1.fc43 on x86_64 host using 
> > > > qemu-system-hppa.
> > > > 
> > > > trace:
> > > > 
> > > > Thread 1 (...):
> > > >         # 5 __assert_fail (...) at assert.c
> > > >         # 6 blk_lock_medium (...) at block/block-gen.c
> > > >         # 7 scsi_disk_emulate_command (...) at ../hw/scsi/scsi-disk.c
> > > >         # 8 scsi_req_enqueue (...) at ../hw/scsi/scsi-bus.c
> > > >         # 9 lsi_do_command (...) at ../hw/scsi/lsi53c895a.c
> > > >         # 10 lsi_execute_script (...) at ../hw/scsi/lsi53c895a.c
> > > >         # 11 scsi_read_complete_noio (...) at ../hw/scsi/scsi-disk.c
> > > >         # 12 blk_aio_complete (...) at ../block/block-backend.c
> > > >         # 14 blk_aio_read_entry (...) at ../block/block-backend.c
> > > >         # 15 in coroutine_trampoline (...) at 
> > > > ../util/coroutine-ucontext.c
> > > > 
> > > > blk_eject() has the same path, but req->cmd.buf[0] is START_STOP instead
> > > > of ALLOW_MEDIUM_REMOVAL inside scsi_disk_emulate_command(),
> > > > so fix that aswell.
> > > 
> > > This would be useful information to have in the commit message proper.

I can add that in a later revision if you would want to, do you mean the trace,
or the comment, or both?

> > > 
> > > The step from #10 to #11 seems to be a rather big one that left out
> > > intermediate function calls, so I'm not sure if I fully understand it.
> > > 
> > > Either way, I don't think lsi_execute_script() and the functions called
> > > by it were ever written with the intention to run in a coroutine. I'm
> > > almost sure that problems can be in more than just the two functions
> > > you're changing here. So my question is mostly, should the path involve
> > > a BH somewhere to break out of coroutine context, instead of making one
> > > or two specific cases work?
> > 
> > The code path in this bug is from the block layer (blk_aio_read_entry ->
> > blk_aio_complete) to scsi-disk (scsi_read_complete_noio) into the LSI
> > device (probably lsi_command_complete -> lsi_resume_script ->
> > lsi_execute_script).

IIUC, from what I can see,
we can trace it manually to this
(note; I reversed it to make it more readable):

         # 11 scsi_read_complete_noio (...) at ../hw/scsi/scsi-disk.c

         scsi_read_complete_noio() ->
         scsi_req_data(&r->req, r->qiov.size);
         r->req->bus->info->transfer_data
             // from the &r->req in scsi_req_data() call; 
             // void (*transfer_data) vtable ->

         .transfer_data = lsi_transfer_data,  
             // inside scsibusinfo lsi_scsi_info, for our lsi here ->

         lsi_transfer_data() ->
         lsi_resume_script() ->
         lsi_execute_script() ->
         # 10 lsi_execute_script (...) at ../hw/scsi/lsi53c895a.c

Also for the BH breakout path idea, IMHO it would be a bit hacky.
I tested something local, but I had to defer mid-execution script
to a cb and it would be mimicking essentially what the co_wrapper_mixed
generated code would do.

> > I'm not sure if the LSI device, which is unaware of coroutines, should
> > have to work around this. It seems cleaner for blk_aio_preadv() to
> > invoke cb() from outside coroutine context since that API does not say
> > anything about coroutine contexts.

I'm also not sure if LSI device should handle/intereact with this.

Also I'm not sure that blk_aio_preadv()/blk_aio_prwv()
is the right place for this, we would be changing the API semantics there, no?
blk_aio_preadv() calls blk_aio_prwv(), which takes CoroutineEntry co_entry
and then calls qemu_coroutine_create().

> > On the other hand, there will be a performance overhead for leaving
> > coroutine context even when it would be fine to run in a coroutine...
> 
> Yes, I don't think it should be done by the block layer when most
> callbacks are just fine with running in a coroutine. I was thinking that
> maybe something in the SCSI layer should do it, or even scsi-disk
> specifically. Though of course, if virtio-scsi doesn't have the problem
> (does it?), taking the performance overhead for fixing LSI is probably
> also not what we want.
> 
> Another, more radical solution, could be to move SCSI requests as a
> whole into coroutines, changing the assumption everywhere from
> no_coroutine_fn to coroutine_fn without bothering with mixed functions
> at all. The SCSI state machine is quite complex and I often find the
> control flow in the current callback-based code hard to understand, so I
> think this has the potential to improve the code, though of course it
> would also be a big change.

I think I agree that refactoring SCSI requests all into coroutines
would be a good future approach, however what if there is some path
for SCSI state machine that really can't handle coroutines somewhere
and so then up in the chain you make it no_coroutine_fn,
we would still be leaving other downstream paths which are coroutine safe off 
the table
and we would be in the same situation as we currently are.

Maybe it's worth considering adding some compile time checking
for the coroutine wrapper generator. 
This would ensure coroutine API being upheld at compile-time.
Though, this might be a pretty big undertaking, with potentially
some compiler specific hacks.

> Or, of course, we just take this patch, and keep fixing instances of
> the problem as they come up.

We can do both IMO, for shorter term we can patch the offending paths
like I did in my patch here and also formulate a long term approach.

> Kevin

Thanks,
Boudewijn

> > > 
> > > > ---
> > > >  include/system/block-backend-io.h | 4 ++--
> > > >  1 file changed, 2 insertions(+), 2 deletions(-)
> > > > 
> > > > diff --git a/include/system/block-backend-io.h 
> > > > b/include/system/block-backend-io.h
> > > > index fd84723d9d..7368ad5c09 100644
> > > > --- a/include/system/block-backend-io.h
> > > > +++ b/include/system/block-backend-io.h
> > > > @@ -81,10 +81,10 @@ bool coroutine_fn GRAPH_RDLOCK 
> > > > blk_co_is_available(BlockBackend *blk);
> > > >  bool co_wrapper_mixed_bdrv_rdlock blk_is_available(BlockBackend *blk);
> > > >  
> > > >  void coroutine_fn blk_co_lock_medium(BlockBackend *blk, bool locked);
> > > > -void co_wrapper blk_lock_medium(BlockBackend *blk, bool locked);
> > > > +void co_wrapper_mixed blk_lock_medium(BlockBackend *blk, bool locked);
> > > >  
> > > >  void coroutine_fn blk_co_eject(BlockBackend *blk, bool eject_flag);
> > > > -void co_wrapper blk_eject(BlockBackend *blk, bool eject_flag);
> > > > +void co_wrapper_mixed blk_eject(BlockBackend *blk, bool eject_flag);
> > > >  
> > > >  int64_t coroutine_fn blk_co_getlength(BlockBackend *blk);
> > > >  int64_t co_wrapper_mixed blk_getlength(BlockBackend *blk);
> > > > -- 
> > > > 2.54.0
> > > > 
> > >

Reply via email to