[Qemu-devel] [PATCH] vhost_net: start/stop guest notifiers properly

2014-08-18 Thread Jason Wang
commit a9f98bb5ebe6fb1869321dcc58e72041ae626ad8 vhost: multiqueue
support changed the order of stopping the device. Previously
vhost_dev_stop would disable backend and only afterwards, unset guest
notifiers. We now unset guest notifiers while vhost is still
active. This can lose interrupts causing guest networking to fail.

Additionally, remove the hdev-started assert in vhost.c since we may
want to start the guest notifiers before vhost starts and stop the
guest notifiers after vhost is stopped.

In particular, this has been observed during migration.

Reported-by: Zhangjie (HZ) zhangji...@huawei.com
Signed-off-by: Michael S. Tsirkin m...@redhat.com
Signed-off-by: Jason Wang jasow...@redhat.com

--

Zhang Jie, please test this patch to see if it fixes the issue.
---
 hw/net/vhost_net.c | 20 ++--
 hw/virtio/vhost.c  |  2 --
 2 files changed, 10 insertions(+), 12 deletions(-)

diff --git a/hw/net/vhost_net.c b/hw/net/vhost_net.c
index 006576d..72084ba 100644
--- a/hw/net/vhost_net.c
+++ b/hw/net/vhost_net.c
@@ -223,6 +223,12 @@ int vhost_net_start(VirtIODevice *dev, NetClientState *ncs,
 goto err;
 }
 
+r = k-set_guest_notifiers(qbus-parent, total_queues * 2, true);
+if (r  0) {
+error_report(Error binding guest notifier: %d, -r);
+goto err;
+}
+
 for (i = 0; i  total_queues; i++) {
 r = vhost_net_start_one(tap_get_vhost_net(ncs[i].peer), dev, i * 2);
 
@@ -231,12 +237,6 @@ int vhost_net_start(VirtIODevice *dev, NetClientState *ncs,
 }
 }
 
-r = k-set_guest_notifiers(qbus-parent, total_queues * 2, true);
-if (r  0) {
-error_report(Error binding guest notifier: %d, -r);
-goto err;
-}
-
 return 0;
 
 err:
@@ -254,16 +254,16 @@ void vhost_net_stop(VirtIODevice *dev, NetClientState 
*ncs,
 VirtioBusClass *k = VIRTIO_BUS_GET_CLASS(vbus);
 int i, r;
 
+for (i = 0; i  total_queues; i++) {
+vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
+}
+
 r = k-set_guest_notifiers(qbus-parent, total_queues * 2, false);
 if (r  0) {
 fprintf(stderr, vhost guest notifier cleanup failed: %d\n, r);
 fflush(stderr);
 }
 assert(r = 0);
-
-for (i = 0; i  total_queues; i++) {
-vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
-}
 }
 
 void vhost_net_cleanup(struct vhost_net *net)
diff --git a/hw/virtio/vhost.c b/hw/virtio/vhost.c
index 9e336ad..d74514a 100644
--- a/hw/virtio/vhost.c
+++ b/hw/virtio/vhost.c
@@ -969,7 +969,6 @@ void vhost_dev_disable_notifiers(struct vhost_dev *hdev, 
VirtIODevice *vdev)
 bool vhost_virtqueue_pending(struct vhost_dev *hdev, int n)
 {
 struct vhost_virtqueue *vq = hdev-vqs + n - hdev-vq_index;
-assert(hdev-started);
 assert(n = hdev-vq_index  n  hdev-vq_index + hdev-nvqs);
 return event_notifier_test_and_clear(vq-masked_notifier);
 }
@@ -981,7 +980,6 @@ void vhost_virtqueue_mask(struct vhost_dev *hdev, 
VirtIODevice *vdev, int n,
 struct VirtQueue *vvq = virtio_get_queue(vdev, n);
 int r, index = n - hdev-vq_index;
 
-assert(hdev-started);
 assert(n = hdev-vq_index  n  hdev-vq_index + hdev-nvqs);
 
 struct vhost_vring_file file = {
-- 
1.8.3.1




Re: [Qemu-devel] [PATCH] vhost_net: start/stop guest notifiers properly

2014-08-18 Thread Zhangjie (HZ)

On 2014/8/18 17:51, Jason Wang wrote:
 commit a9f98bb5ebe6fb1869321dcc58e72041ae626ad8 vhost: multiqueue
 support changed the order of stopping the device. Previously
 vhost_dev_stop would disable backend and only afterwards, unset guest
 notifiers. We now unset guest notifiers while vhost is still
 active. This can lose interrupts causing guest networking to fail.
 
 Additionally, remove the hdev-started assert in vhost.c since we may
 want to start the guest notifiers before vhost starts and stop the
 guest notifiers after vhost is stopped.
 
 In particular, this has been observed during migration.

 
Thanks! I will have a test about your patch today! :-)
-- 
Best Wishes!
Zhang Jie




Re: [Qemu-devel] [PATCH] vhost_net: start/stop guest notifiers properly

2014-08-18 Thread William Dauchy
On Mon, Aug 18, 2014 at 11:51 AM, Jason Wang jasow...@redhat.com wrote:
  err:
 @@ -254,16 +254,16 @@ void vhost_net_stop(VirtIODevice *dev, NetClientState 
 *ncs,
  VirtioBusClass *k = VIRTIO_BUS_GET_CLASS(vbus);
  int i, r;

 +for (i = 0; i  total_queues; i++) {
 +vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
 +}
 +
  r = k-set_guest_notifiers(qbus-parent, total_queues * 2, false);
  if (r  0) {
  fprintf(stderr, vhost guest notifier cleanup failed: %d\n, r);
  fflush(stderr);
  }
  assert(r = 0);
 -
 -for (i = 0; i  total_queues; i++) {
 -vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
 -}
  }

since
ed8b4af Refactor virtio-net to use generic get_vhost_net
get_vhost_net is used instead of tap_get_vhost_net

Could you rebase your patch to facilitate tests or is it intentional?

Thanks,
-- 
William



Re: [Qemu-devel] [PATCH] vhost_net: start/stop guest notifiers properly

2014-08-18 Thread Michael S. Tsirkin
On Mon, Aug 18, 2014 at 05:51:31PM +0800, Jason Wang wrote:
 commit a9f98bb5ebe6fb1869321dcc58e72041ae626ad8 vhost: multiqueue
 support changed the order of stopping the device. Previously
 vhost_dev_stop would disable backend and only afterwards, unset guest
 notifiers. We now unset guest notifiers while vhost is still
 active. This can lose interrupts causing guest networking to fail.
 
 Additionally, remove the hdev-started assert in vhost.c since we may
 want to start the guest notifiers before vhost starts and stop the
 guest notifiers after vhost is stopped.
 
 In particular, this has been observed during migration.
 
 Reported-by: Zhangjie (HZ) zhangji...@huawei.com
 Signed-off-by: Michael S. Tsirkin m...@redhat.com
 Signed-off-by: Jason Wang jasow...@redhat.com


This doesn't seem to apply to master.
Can you rebase please?
 --
 
 Zhang Jie, please test this patch to see if it fixes the issue.
 ---
  hw/net/vhost_net.c | 20 ++--
  hw/virtio/vhost.c  |  2 --
  2 files changed, 10 insertions(+), 12 deletions(-)
 
 diff --git a/hw/net/vhost_net.c b/hw/net/vhost_net.c
 index 006576d..72084ba 100644
 --- a/hw/net/vhost_net.c
 +++ b/hw/net/vhost_net.c
 @@ -223,6 +223,12 @@ int vhost_net_start(VirtIODevice *dev, NetClientState 
 *ncs,
  goto err;
  }
  
 +r = k-set_guest_notifiers(qbus-parent, total_queues * 2, true);
 +if (r  0) {
 +error_report(Error binding guest notifier: %d, -r);
 +goto err;
 +}
 +
  for (i = 0; i  total_queues; i++) {
  r = vhost_net_start_one(tap_get_vhost_net(ncs[i].peer), dev, i * 2);
  
 @@ -231,12 +237,6 @@ int vhost_net_start(VirtIODevice *dev, NetClientState 
 *ncs,
  }
  }
  
 -r = k-set_guest_notifiers(qbus-parent, total_queues * 2, true);
 -if (r  0) {
 -error_report(Error binding guest notifier: %d, -r);
 -goto err;
 -}
 -
  return 0;
  
  err:
 @@ -254,16 +254,16 @@ void vhost_net_stop(VirtIODevice *dev, NetClientState 
 *ncs,
  VirtioBusClass *k = VIRTIO_BUS_GET_CLASS(vbus);
  int i, r;
  
 +for (i = 0; i  total_queues; i++) {
 +vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
 +}
 +
  r = k-set_guest_notifiers(qbus-parent, total_queues * 2, false);
  if (r  0) {
  fprintf(stderr, vhost guest notifier cleanup failed: %d\n, r);
  fflush(stderr);
  }
  assert(r = 0);
 -
 -for (i = 0; i  total_queues; i++) {
 -vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
 -}
  }
  
  void vhost_net_cleanup(struct vhost_net *net)
 diff --git a/hw/virtio/vhost.c b/hw/virtio/vhost.c
 index 9e336ad..d74514a 100644
 --- a/hw/virtio/vhost.c
 +++ b/hw/virtio/vhost.c
 @@ -969,7 +969,6 @@ void vhost_dev_disable_notifiers(struct vhost_dev *hdev, 
 VirtIODevice *vdev)
  bool vhost_virtqueue_pending(struct vhost_dev *hdev, int n)
  {
  struct vhost_virtqueue *vq = hdev-vqs + n - hdev-vq_index;
 -assert(hdev-started);
  assert(n = hdev-vq_index  n  hdev-vq_index + hdev-nvqs);
  return event_notifier_test_and_clear(vq-masked_notifier);
  }
 @@ -981,7 +980,6 @@ void vhost_virtqueue_mask(struct vhost_dev *hdev, 
 VirtIODevice *vdev, int n,
  struct VirtQueue *vvq = virtio_get_queue(vdev, n);
  int r, index = n - hdev-vq_index;
  
 -assert(hdev-started);
  assert(n = hdev-vq_index  n  hdev-vq_index + hdev-nvqs);
  
  struct vhost_vring_file file = {
 -- 
 1.8.3.1



Re: [Qemu-devel] [PATCH] vhost_net: start/stop guest notifiers properly

2014-08-18 Thread Jason Wang
On 08/18/2014 08:11 PM, Zhangjie (HZ) wrote:
 On 2014/8/18 17:51, Jason Wang wrote:
 commit a9f98bb5ebe6fb1869321dcc58e72041ae626ad8 vhost: multiqueue
 support changed the order of stopping the device. Previously
 vhost_dev_stop would disable backend and only afterwards, unset guest
 notifiers. We now unset guest notifiers while vhost is still
 active. This can lose interrupts causing guest networking to fail.

 Additionally, remove the hdev-started assert in vhost.c since we may
 want to start the guest notifiers before vhost starts and stop the
 guest notifiers after vhost is stopped.

 In particular, this has been observed during migration.
 Thanks! I will have a test about your patch today! :-)

The patch was reported not applied cleanly. I will rebase it and send a
new one.

Please test that patch.

Thanks.



Re: [Qemu-devel] [PATCH] vhost_net: start/stop guest notifiers properly

2014-08-18 Thread Jason Wang
On 08/18/2014 09:20 PM, William Dauchy wrote:
 On Mon, Aug 18, 2014 at 11:51 AM, Jason Wang jasow...@redhat.com wrote:
  err:
 @@ -254,16 +254,16 @@ void vhost_net_stop(VirtIODevice *dev, NetClientState 
 *ncs,
  VirtioBusClass *k = VIRTIO_BUS_GET_CLASS(vbus);
  int i, r;

 +for (i = 0; i  total_queues; i++) {
 +vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
 +}
 +
  r = k-set_guest_notifiers(qbus-parent, total_queues * 2, false);
  if (r  0) {
  fprintf(stderr, vhost guest notifier cleanup failed: %d\n, r);
  fflush(stderr);
  }
  assert(r = 0);
 -
 -for (i = 0; i  total_queues; i++) {
 -vhost_net_stop_one(tap_get_vhost_net(ncs[i].peer), dev);
 -}
  }
 since
 ed8b4af Refactor virtio-net to use generic get_vhost_net
 get_vhost_net is used instead of tap_get_vhost_net

 Could you rebase your patch to facilitate tests or is it intentional?

Not intentional, my tree is out of date. I will rebase the patch.

Thanks for pointing this out.

 Thanks,




Re: [Qemu-devel] [PATCH] vhost_net: start/stop guest notifiers properly

2014-08-18 Thread Jason Wang
On 08/19/2014 03:53 AM, Michael S. Tsirkin wrote:
 On Mon, Aug 18, 2014 at 05:51:31PM +0800, Jason Wang wrote:
  commit a9f98bb5ebe6fb1869321dcc58e72041ae626ad8 vhost: multiqueue
  support changed the order of stopping the device. Previously
  vhost_dev_stop would disable backend and only afterwards, unset guest
  notifiers. We now unset guest notifiers while vhost is still
  active. This can lose interrupts causing guest networking to fail.
  
  Additionally, remove the hdev-started assert in vhost.c since we may
  want to start the guest notifiers before vhost starts and stop the
  guest notifiers after vhost is stopped.
  
  In particular, this has been observed during migration.
  
  Reported-by: Zhangjie (HZ) zhangji...@huawei.com
  Signed-off-by: Michael S. Tsirkin m...@redhat.com
  Signed-off-by: Jason Wang jasow...@redhat.com
 This doesn't seem to apply to master.
 Can you rebase please?

Yes, will send a new version.