Re: [PATCH v3 14/17] vfio/migration: Reset device if setting recover state fails

2022-11-20 Thread Avihai Horon



On 17/11/2022 20:18, Alex Williamson wrote:

External email: Use caution opening links or attachments


On Thu, 17 Nov 2022 19:11:47 +0200
Avihai Horon  wrote:


On 16/11/2022 20:36, Alex Williamson wrote:

External email: Use caution opening links or attachments


On Thu, 3 Nov 2022 18:16:17 +0200
Avihai Horon  wrote:


If vfio_migration_set_state() fails to set the device in the requested
state it tries to put it in a recover state. If setting the device in
the recover state fails as well, hw_error is triggered and the VM is
aborted.

To improve user experience and avoid VM data loss, reset the device with
VFIO_RESET_DEVICE instead of aborting the VM.

Signed-off-by: Avihai Horon 
---
   hw/vfio/migration.c | 14 --
   1 file changed, 12 insertions(+), 2 deletions(-)

diff --git a/hw/vfio/migration.c b/hw/vfio/migration.c
index f8c3228314..e8068b9147 100644
--- a/hw/vfio/migration.c
+++ b/hw/vfio/migration.c
@@ -92,8 +92,18 @@ static int vfio_migration_set_state(VFIODevice *vbasedev,

   mig_state->device_state = recover_state;
   if (ioctl(vbasedev->fd, VFIO_DEVICE_FEATURE, feature)) {
-hw_error("%s: Failed setting device in recover state, err: %s",
- vbasedev->name, strerror(errno));
+error_report(
+"%s: Failed setting device in recover state, err: %s. Resetting 
device",
+ vbasedev->name, strerror(errno));
+
+if (ioctl(vbasedev->fd, VFIO_DEVICE_RESET)) {
+hw_error("%s: Failed resetting device, err: %s", 
vbasedev->name,
+ strerror(errno));
+}
+
+migration->device_state = VFIO_DEVICE_STATE_RUNNING;
+
+return -1;
   }

   migration->device_state = recover_state;

This addresses one of my comments on 12/ and should probably be rolled
in there.

Not sure to which comment you refer to. Could you elaborate?

Hmm, I guess I thought this was in the section immediately following
where I questioned going to recovery state.  I'm still not sure why
this is a separate patch from the initial implementation of the
function in 12/ though.


This adds new functionality comparing to v1, so I thought this should be 
in its own patch.


I can squash it to patch 12 if you want.




Re: [PATCH v3 14/17] vfio/migration: Reset device if setting recover state fails

2022-11-17 Thread Alex Williamson
On Thu, 17 Nov 2022 19:11:47 +0200
Avihai Horon  wrote:

> On 16/11/2022 20:36, Alex Williamson wrote:
> > External email: Use caution opening links or attachments
> >
> >
> > On Thu, 3 Nov 2022 18:16:17 +0200
> > Avihai Horon  wrote:
> >  
> >> If vfio_migration_set_state() fails to set the device in the requested
> >> state it tries to put it in a recover state. If setting the device in
> >> the recover state fails as well, hw_error is triggered and the VM is
> >> aborted.
> >>
> >> To improve user experience and avoid VM data loss, reset the device with
> >> VFIO_RESET_DEVICE instead of aborting the VM.
> >>
> >> Signed-off-by: Avihai Horon 
> >> ---
> >>   hw/vfio/migration.c | 14 --
> >>   1 file changed, 12 insertions(+), 2 deletions(-)
> >>
> >> diff --git a/hw/vfio/migration.c b/hw/vfio/migration.c
> >> index f8c3228314..e8068b9147 100644
> >> --- a/hw/vfio/migration.c
> >> +++ b/hw/vfio/migration.c
> >> @@ -92,8 +92,18 @@ static int vfio_migration_set_state(VFIODevice 
> >> *vbasedev,
> >>
> >>   mig_state->device_state = recover_state;
> >>   if (ioctl(vbasedev->fd, VFIO_DEVICE_FEATURE, feature)) {
> >> -hw_error("%s: Failed setting device in recover state, err: 
> >> %s",
> >> - vbasedev->name, strerror(errno));
> >> +error_report(
> >> +"%s: Failed setting device in recover state, err: %s. 
> >> Resetting device",
> >> + vbasedev->name, strerror(errno));
> >> +
> >> +if (ioctl(vbasedev->fd, VFIO_DEVICE_RESET)) {
> >> +hw_error("%s: Failed resetting device, err: %s", 
> >> vbasedev->name,
> >> + strerror(errno));
> >> +}
> >> +
> >> +migration->device_state = VFIO_DEVICE_STATE_RUNNING;
> >> +
> >> +return -1;
> >>   }
> >>
> >>   migration->device_state = recover_state;  
> > This addresses one of my comments on 12/ and should probably be rolled
> > in there.  
> 
> Not sure to which comment you refer to. Could you elaborate?

Hmm, I guess I thought this was in the section immediately following
where I questioned going to recovery state.  I'm still not sure why
this is a separate patch from the initial implementation of the
function in 12/ though.  Thanks,
'
Alex




Re: [PATCH v3 14/17] vfio/migration: Reset device if setting recover state fails

2022-11-17 Thread Avihai Horon



On 16/11/2022 20:36, Alex Williamson wrote:

External email: Use caution opening links or attachments


On Thu, 3 Nov 2022 18:16:17 +0200
Avihai Horon  wrote:


If vfio_migration_set_state() fails to set the device in the requested
state it tries to put it in a recover state. If setting the device in
the recover state fails as well, hw_error is triggered and the VM is
aborted.

To improve user experience and avoid VM data loss, reset the device with
VFIO_RESET_DEVICE instead of aborting the VM.

Signed-off-by: Avihai Horon 
---
  hw/vfio/migration.c | 14 --
  1 file changed, 12 insertions(+), 2 deletions(-)

diff --git a/hw/vfio/migration.c b/hw/vfio/migration.c
index f8c3228314..e8068b9147 100644
--- a/hw/vfio/migration.c
+++ b/hw/vfio/migration.c
@@ -92,8 +92,18 @@ static int vfio_migration_set_state(VFIODevice *vbasedev,

  mig_state->device_state = recover_state;
  if (ioctl(vbasedev->fd, VFIO_DEVICE_FEATURE, feature)) {
-hw_error("%s: Failed setting device in recover state, err: %s",
- vbasedev->name, strerror(errno));
+error_report(
+"%s: Failed setting device in recover state, err: %s. Resetting 
device",
+ vbasedev->name, strerror(errno));
+
+if (ioctl(vbasedev->fd, VFIO_DEVICE_RESET)) {
+hw_error("%s: Failed resetting device, err: %s", 
vbasedev->name,
+ strerror(errno));
+}
+
+migration->device_state = VFIO_DEVICE_STATE_RUNNING;
+
+return -1;
  }

  migration->device_state = recover_state;

This addresses one of my comments on 12/ and should probably be rolled
in there.


Not sure to which comment you refer to. Could you elaborate?

Thanks!


   Thanks,

Alex





Re: [PATCH v3 14/17] vfio/migration: Reset device if setting recover state fails

2022-11-16 Thread Alex Williamson
On Thu, 3 Nov 2022 18:16:17 +0200
Avihai Horon  wrote:

> If vfio_migration_set_state() fails to set the device in the requested
> state it tries to put it in a recover state. If setting the device in
> the recover state fails as well, hw_error is triggered and the VM is
> aborted.
> 
> To improve user experience and avoid VM data loss, reset the device with
> VFIO_RESET_DEVICE instead of aborting the VM.
> 
> Signed-off-by: Avihai Horon 
> ---
>  hw/vfio/migration.c | 14 --
>  1 file changed, 12 insertions(+), 2 deletions(-)
> 
> diff --git a/hw/vfio/migration.c b/hw/vfio/migration.c
> index f8c3228314..e8068b9147 100644
> --- a/hw/vfio/migration.c
> +++ b/hw/vfio/migration.c
> @@ -92,8 +92,18 @@ static int vfio_migration_set_state(VFIODevice *vbasedev,
>  
>  mig_state->device_state = recover_state;
>  if (ioctl(vbasedev->fd, VFIO_DEVICE_FEATURE, feature)) {
> -hw_error("%s: Failed setting device in recover state, err: %s",
> - vbasedev->name, strerror(errno));
> +error_report(
> +"%s: Failed setting device in recover state, err: %s. 
> Resetting device",
> + vbasedev->name, strerror(errno));
> +
> +if (ioctl(vbasedev->fd, VFIO_DEVICE_RESET)) {
> +hw_error("%s: Failed resetting device, err: %s", 
> vbasedev->name,
> + strerror(errno));
> +}
> +
> +migration->device_state = VFIO_DEVICE_STATE_RUNNING;
> +
> +return -1;
>  }
>  
>  migration->device_state = recover_state;

This addresses one of my comments on 12/ and should probably be rolled
in there.  Thanks,

Alex