Re: [PATCH v5 30/31] crypto: delegate permission functions to JobDriver .pre_run

Hanna Reitz Thu, 23 Dec 2021 09:16:03 -0800

On 20.12.21 16:47, Emanuele Giuseppe Esposito wrote:

On 17/12/2021 13:29, Hanna Reitz wrote:
On 24.11.21 07:44, Emanuele Giuseppe Esposito wrote:
block_crypto_amend_options_generic_luks uses the block layer
permission API, therefore it should be called with the BQL held.

However, the same function is being called ib two BlockDriver
s/ ib / by /
callbacks: bdrv_amend_options (under BQL) and bdrv_co_amend (I/O).

The latter is I/O because it is invoked by block/amend.c's
blockdev_amend_run(), a .run callback of the amend JobDriver

Therefore we want to 1) change block_crypto_amend_options_generic_luks
to use the permission API only when the BQL is held, and
2) use the .pre_run JobDriver callback to check for
permissions before switching to the job aiocontext. This has also
the benefit of applying the same permission operation to all
amend implementations, not only luks.

Signed-off-by: Emanuele Giuseppe Esposito <eespo...@redhat.com>
---
  block/amend.c  | 20 ++++++++++++++++++++
  block/crypto.c | 18 ++++++++++++------
  2 files changed, 32 insertions(+), 6 deletions(-)

diff --git a/block/amend.c b/block/amend.c
index 392df9ef83..fba6add51a 100644
--- a/block/amend.c
+++ b/block/amend.c
@@ -53,10 +53,30 @@ static int coroutine_fn blockdev_amend_run(Job*job, Error **errp)
      return ret;
  }
+static int blockdev_amend_refresh_perms(Job *job, Error **errp)
+{
+    BlockdevAmendJob *s = container_of(job, BlockdevAmendJob, common);
+
+    return bdrv_child_refresh_perms(s->bs, s->bs->file, errp);
+}
I miss some documentation for this function, why we do it and how itworks together with the bdrv_co_amend implementation.
I was trying to come up with an example text, but then I wondered –how does it actually work? bdrv_child_refresh_perms() eventually endsup in block_crypto_child_perms(). However, that will only returnexceptional permissions if crypto->updating_keys is true. But that’sset only in block_crypto_amend_options_generic_luks() – i.e. when thejob runs. That’s exactly why that function callsbdrv_child_refresh_perms() only after it has modifiedcrypto->updating_keys.
Reproducer (amend on a LUKS image with read-only=true, so it doesn’thave the WRITE permission continuously, but needs to take it as anexception in block_crypto_child_perms()):
$ qemu-img create \
     -f luks \
     --object secret,id=sec0,data=123456 \
     -o key-secret=sec0 \
     test.luks \
     64M
Formatting 'test.luks', fmt=luks size=67108864 key-secret=sec0

$ ./qemu-system-x86_64 \
     -object secret,id=sec0,data=123456 \
     -object iothread,id=iothr0 \
     -blockdev file,node-name=node0,filename=test.luks \
-blockdevluks,node-name=node1,key-secret=sec0,file=node0,read-only=true \
     -device virtio-blk,drive=node1,iothread=iothr0 -qmp stdio \
     <<EOF
{"execute": "qmp_capabilities"}
{
     "execute": "x-blockdev-amend",
     "arguments": {
         "job-id": "amend0",
         "node-name": "node1",
         "options": {
             "driver": "luks",
             "state": "active",
             "new-secret": "sec0"
         }
     }
}
EOF
{"QMP": {"version": {"qemu": {"micro": 93, "minor": 1, "major": 6},"package": "v6.2.0-rc3-50-gdb635fc4e7"}, "capabilities": ["oob"]}}
{"return": {}}
{"timestamp": {"seconds": 1639742600, "microseconds": 574641},"event": "JOB_STATUS_CHANGE", "data": {"status": "created", "id":"amend0"}}{"timestamp": {"seconds": 1639742600, "microseconds": 574919},"event": "JOB_STATUS_CHANGE", "data": {"status": "running", "id":"amend0"}}
{"return": {}}
qemu-system-x86_64: ../block/io.c:2041: bdrv_co_write_req_prepare:Assertion `child->perm & BLK_PERM_WRITE' failed.[1] 55880 IOT instruction (core dumped) ./qemu-system-x86_64-object secret,id=sec0,data=123456 -object -blockdev
I believe this means we need some new block driver function toprepare for an amendment operation. If so, another question comesup, which is whether this preparatory function should then also callbdrv_child_refresh_perms(), and then whether we should have aclean-up function for symmetry.
Yes, unfortunately it means that (see at the end of the mail for more).
I think it does not work because of crypto->updating_keys missing inblockdev_amend_pre_run(). That is why the permission is not correctlyset and the example fails.
+
+static int blockdev_amend_pre_run(Job *job, Error **errp)
+{
+    return blockdev_amend_refresh_perms(job, errp);
+}
+
+static void blockdev_amend_clean(Job *job)
+{
+    Error *errp;
+    blockdev_amend_refresh_perms(job, &errp);
Do we really want to ignore this error? If so, we shouldn’t pass apointer to an unused local variable, but NULL.
If we don’t want to ignore it, we have the option of doing what youdo here and then at least reporting a potential error witherror_report_err(), and then freeing it, and we also must initializeerrp to NULL in this case.
Going with this one above, thanks.
If we expect no error to happen (e.g. because we require the amendimplementation to only release/share permissions and notacquire/unshare them), then I’d expect passing &error_abort here.
+}
+
  static const JobDriver blockdev_amend_job_driver = {
      .instance_size = sizeof(BlockdevAmendJob),
      .job_type      = JOB_TYPE_AMEND,
      .run           = blockdev_amend_run,
+    .pre_run       = blockdev_amend_pre_run,
+    .clean         = blockdev_amend_clean,
  };
  void qmp_x_blockdev_amend(const char *job_id,
diff --git a/block/crypto.c b/block/crypto.c
index c8ba4681e2..82f154516c 100644
--- a/block/crypto.c
+++ b/block/crypto.c
@@ -780,6 +780,7 @@block_crypto_get_specific_info_luks(BlockDriverState *bs, Error **errp)
  static int
  block_crypto_amend_options_generic_luks(BlockDriverState *bs,
QCryptoBlockAmendOptions *amend_options,
+                                        bool under_bql,
This name makes sense in the context of this series, but not so muchoutside of it.
I’d rename it to e.g. “in_amend_job” (and invert its value), and thenexplain that we don’t need to refresh the child permissions whenrunning in an amend job, because that job has already taken care ofthat.
OTOH, given that I believe we need some separate preparatory functionanyway, perhaps we should just pull out thebdrv_child_refresh_perms() from this function altogether, so that wehave:
block_crypto_amend_options_luks():

/* sets updating_keys to true, and invokes bdrv_child_refresh_perms() */
block_crypto_amend_options_prepare();
block_crypto_amend_options_generic_luks();
/* sets updating_keys to false, and invokesbdrv_child_refresh_perms() */
block_crypto_amend_options_clean();


block_crypto_co_amend_luks():
/* No need to prepare or clean up, that is taken care of by the amendjob */
block_crypto_amend_options_generic_luks();
(If we decide not to put bdrv_child_refresh_perms() intoprepare()/clean(), then it would need to be called byblock_crypto_amend_options_luks(); and if we decide not to have ablock_crypto_amend_options_clean(), then we’d need to inline it fully.)
So a couple of things I will change (according with your feedbacks):
- Remove the assertion job->aio_context == qemu_in_main_thread() donein job_co_entry, as it is wrong. I don't know why I added that, but wecannot assume that job->run() always run in the main context, becausethe job aiocontext can be different. I don't think there is a testdoing that now, but it is possible. If run() was in the main context,then bdrv_co_amend (called only in blockdev_amend_run) would be GStoo, but it isn't, also according with your comment in v4:
"[...] .bdrv_co_amend very much strikes me like a GS function, but
it isn’t.  I’m afraid it must work on nodes that are not in the main
context, and it launches a job, so AFAIU we absolutely cannot run it
under the BQL."
- Introduce block_crypto_amend_options_prepare andblock_crypto_amend_options_clean, as you suggested above. These fixthe GS call stack of block_crypto_amend_options_generic_luks()
- Introduce .bdrv_pre_run() and .bdrv_cleanup(), respectively calledby .job_pre_run() and .job_cleanup(). The reason is that we need toset crypto->updating_keys, otherwise the job amend won't temporarygive the write permission so the example above would fail.
So for the I/O callstack of block_crypto_amend_options_generic_luks()we will have:
job->pre_run():
    .bdrv_pre_run();
        crypto->update_keys = true;
    blockdev_amend_refresh_perms()

job->run():
    block_crypto_amend_options_generic_luks()

job->cleanup():
    .bdrv_cleanup();
        crypto->update_keys = false;
    blockdev_amend_refresh_perms()

Sounds good! The only adjustment I’d make is to add “amend” somewherein the .bdrv functions (e.g. “.bdrv_amend_pre_run” and“.bdrv_amend_cleanup”), because AFAIU they’ll still be amend-specific,right?


(Happy holidays :))

Hanna

Re: [PATCH v5 30/31] crypto: delegate permission functions to JobDriver .pre_run

Reply via email to