date:20150408

Re: [Qemu-devel] [PATCH qemu v5 04/12] spapr_pci_vfio: Enable multiple groups per container

2015-04-08 Thread David Gibson

On Wed, Apr 08, 2015 at 01:45:19PM +1000, Alexey Kardashevskiy wrote:
> On 04/08/2015 12:01 PM, David Gibson wrote:
> >On Tue, Mar 31, 2015 at 04:28:39PM +1100, Alexey Kardashevskiy wrote:
> >>This enables multiple IOMMU groups in one VFIO container which means
> >>that multiple devices from different groups can share the same IOMMU
> >>table (or tables if DDW).
> >>
> >>This removes a group id from vfio_container_ioctl(). The kernel support
> >>is required for this; if the host kernel does not have the support,
> >>it will allow only one group per container. The PHB's "iommuid" property
> >>is ignored.
> >>
> >>This adds a sanity check that there is just one VFIO container per
> >>PHB address space.
> >>
> >>Signed-off-by: Alexey Kardashevskiy 
> >
> >[snip]
> >>diff --git a/hw/vfio/common.c b/hw/vfio/common.c
> >>index b012620..99e1900 100644
> >>--- a/hw/vfio/common.c
> >>+++ b/hw/vfio/common.c
> >>@@ -915,21 +915,23 @@ void vfio_put_base_device(VFIODevice *vbasedev)
> >>  close(vbasedev->fd);
> >>  }
> >>
> >>-static int vfio_container_do_ioctl(AddressSpace *as, int32_t groupid,
> >>+static int vfio_container_do_ioctl(AddressSpace *as,
> >> int req, void *param)
> >>  {
> >>-VFIOGroup *group;
> >>  VFIOContainer *container;
> >>-int ret = -1;
> >>+int ret;
> >>+VFIOAddressSpace *space;
> >>
> >>-group = vfio_get_group(groupid, as);
> >>-if (!group) {
> >>-error_report("vfio: group %d not registered", groupid);
> >>-return ret;
> >>-}
> >>+space = vfio_get_address_space(as);
> >>+container = QLIST_FIRST(&space->containers);
> >
> >So getting the container handle from the address space, rather than
> >the group id certainly makes more sense to me.
> >
> >>-container = group->container;
> >>-if (group->container) {
> >>+if (!container) {
> >>+error_report("vfio: container is not set");
> >>+return -1;
> >>+} else if (QLIST_NEXT(container, next)) {
> >>+error_report("vfio: multiple containers per PHB are not 
> >>supported");
> >>+return -1;
> >
> >But if only one PHB per address space is possible, why is the
> >containers field a list in the first place?
> 
> 
> Historically the list was added in 3df3e0a5872 (the patch of yours
> :) ).

Heh.

> In theory we could implement spapr-pci-bridge (derived from pci-bridge) with
> isolation capability (i.e. its own LIOBN/DMA window), in this case there
> could be multiple containers per PHB address space. Other archs could want
> multiple containers for some other reason. It would help me a lot if you
> remembered why you kept the list at the first place :)

Ok, I've looked over the patch and it has jogged my memory a bit.  So
the dumb answer is that it's because the per address-space list was
replacing a global list of containers

The more useful answer is that I think it was because I was
anticipating the possibility of working around the
one-group-per-container limit by allowing a single VFIOAddressSpace in
qemu to be backed by several containers, whose mappings would be kept
in sync from the userspace side by duplicating all mappings.

Anyway, I think that means the right way to implement this is by
duplicating the ioctl() across all the attached containers, rather
than picking just one.

> For now I guess I'll move the next patch ("vfio: spapr: Move SPAPR-related
> code to a separate file") before this one, do s/vfio_container_do_ioctl/
> vfio_spapr_container_do_ioctl/ and move it to hw/vfio/spapr.c. Makes
> sense?

That sounds fine, though I don't see that it really addresses the
question here.


> 
> 
> >>+} else {
> >>  ret = ioctl(container->fd, req, param);
> >>  if (ret < 0) {
> >>  error_report("vfio: failed to ioctl %d to container: ret=%d, 
> >> %s",
> >>@@ -937,12 +939,10 @@ static int vfio_container_do_ioctl(AddressSpace *as, 
> >>int32_t groupid,
> >>  }
> >>  }
> >>
> >>-vfio_put_group(group);
> >>-
> >>  return ret;
> >>  }
> >>
> >>-int vfio_container_ioctl(AddressSpace *as, int32_t groupid,
> >>+int vfio_container_ioctl(AddressSpace *as,
> >>   int req, void *param)
> >>  {
> >>  /* We allow only certain ioctls to the container */
> >>@@ -957,5 +957,5 @@ int vfio_container_ioctl(AddressSpace *as, int32_t 
> >>groupid,
> >>  return -1;
> >>  }
> >>
> >>-return vfio_container_do_ioctl(as, groupid, req, param);
> >>+return vfio_container_do_ioctl(as, req, param);
> >>  }
> >>diff --git a/include/hw/vfio/vfio.h b/include/hw/vfio/vfio.h
> >>index 0b26cd8..76b5744 100644
> >>--- a/include/hw/vfio/vfio.h
> >>+++ b/include/hw/vfio/vfio.h
> >>@@ -3,7 +3,7 @@
> >>
> >>  #include "qemu/typedefs.h"
> >>
> >>-extern int vfio_container_ioctl(AddressSpace *as, int32_t groupid,
> >>+extern int vfio_container_ioctl(AddressSpace *as,
> >>  int req, void *param);
> >>
> >>  #endif
> >
> 
>

Re: [Qemu-devel] [Qemu-block] Migration sometimes fails with IDE and Qemu 2.2.1

2015-04-08 Thread Peter Lieven


Am 07.04.2015 um 22:05 schrieb Paolo Bonzini:


On 07/04/2015 20:44, Peter Lieven wrote:

Has the cdrom the power of taking down the bus?

IDE can only issue one command per bus, so hda/hdb can take down each
other, and hdc/hdd can take down each other.  However, hda cannot take
down hdc and vice versa---so likely the CDROM cannot take down the hard
disk.


Right confirmed that the machines use BMDMA and the CDROM is hdc while
the boot disk is hda. IDE driveres report as E-IDE Revision 7.0.0alpha2

Peter

Re: [Qemu-devel] [Qemu-block] Migration sometimes fails with IDE and Qemu 2.2.1

2015-04-08 Thread Peter Lieven

Am 07.04.2015 um 21:13 schrieb John Snow:

On 04/07/2015 03:02 PM, Peter Lieven wrote:

Am 07.04.2015 um 20:56 schrieb John Snow:

On 04/07/2015 02:44 PM, Peter Lieven wrote:

Am 07.04.2015 um 17:29 schrieb Dr. David Alan Gilbert:

* Peter Lieven (p...@kamp.de) wrote:

Hi David,

Am 07.04.2015 um 10:43 schrieb Dr. David Alan Gilbert:

Any particular workload or reproducer?

Workload is almost zero. I try to figure out if there is a way to trigger it.

Maybe playing a role: Machine type is -M pc1.2 and we set -kvmclock as
CPU flag since kvmclock seemed to be quite buggy in 2.6.16...

Exact cmdline is:
/usr/bin/qemu-2.2.1 -enable-kvm -M pc-1.2 -nodefaults -netdev type=tap,id=guest2,script=no,downscript=no,ifname=tap2 -device e1000,netdev=guest2,mac=52:54:00:ff:00:65 -drive
format=raw,file=iscsi://172.21.200.53/iqn.2001-05.com.equallogic:4-52aed6-88a7e99a4-d9e00040fdc509a3-XXX-hd0/0,if=ide,cache=writeback,aio=native -serial null -parallel null -m 1024 -smp 2,sockets=1,cores=2,threads=1 -monitor
tcp:0:4003,server,nowait -vnc :3 -qmp tcp:0:3003,server,nowait -name 'XXX' -boot order=c,once=dc,menu=off -drive index=2,media=cdrom,if=ide,cache=unsafe,aio=native,readonly=on -k de -incoming tcp:0:5003 -pidfile /var/run/qemu/vm-146.pid
-mem-path /hugepages -mem-prealloc -rtc base=utc -usb -usbdevice tablet -no-hpet -vga cirrus -cpu qemu64,-kvmclock

Exact kernel is:
2.6.16.46-0.12-smp (i think this is SLES10 or sth.)

The machine does not hang. It seems just I/O is hanging. So you can type at the
console or ping the system, but no longer login.

Thank you,
Peter

Interesting observation: Migrating the vServer again seems to fix to problem
(at least in one case I could test just now).

2.6.8-24-smp is also affected.

How often does it fail - you say 'sometimes' - is it a 1/10 or a 1/1000 ?

Its more often than 1/10 I would say.

OK, that's not too bad - it's the 1/1000 that are really nasty to find.
In your setup, how easy would it be for you to try :
with either 2.1 or current head?
with a newer machine-type?
without the cdrom?

Its all possible. I can clone the system and try everything on my test systems.
I hope
it reproduces there.

Has the cdrom the power of taking down the bus?

Peter

I don't know if CDROM could stall the entire bus, but I suspect the reason for asking is this: dgilbert and I had tracked down a problem previously where during migration, outstanding requests being handled by the ATAPI code can get lost during
migration if, for instance, the user has only prepared the command (via bmdma) but has not yet written to the register to activate the command yet.

That sounds like it could be related.

So if something like this happens:

- User writes to the ATA registers to program a command
- Migration occurs
- User writes to the BMDMA register to initiate the command

We can lose some of the state and data of the request. David had checked in a
workaround for at least ATAPI that simply coaxes the guest OS into trying the
command again to unstick it.

Do you have the commit for me?

http://lists.gnu.org/archive/html/qemu-devel/2014-12/msg01109.html

I think we determined last time that we couldn't fix this problem without changing the migration format, so we opted not to do it for 2.3. We had also only noticed it with ATAPI drives, not HDDs, so a proper fix got kicked down the road since we
thought the workaround was sufficient.

Maybe normally we use virtio nowadays and maybe the new kernel implementation
(libata /dev/sdX) can't get locked? What I do not understand is how a second
migration can unlock from this state?

IIRC our success rate with reproducing it was something on the order of 1/50,
too.

If you can reproduce it without a CDROM but using the BMDMA interface, that's a
good data point. If you can't reproduce it using the ISA interface, that's a
phenomenal data point and implicates BMDMA pretty heavily.

To be 100% sure we are talking about the same? How would I use the ISA and how
would I use the BMDMA interface?

Thanks,
Peter

BMDMA is the PCI HBA for IDE, I think it's the default for most machines that
aren't using the AHCI HBA.

To get ISA, try launching with the machine "isapc" which will force it, or add the device
manually, it's named "isa-ide".
The BMDMA PCI device is just named "ide".

I will start more debugging today I found that other SuSE servers which use the
newer interface (presenting as /dev/sdX)
do not suffer from the problem.

Peter

Re: [Qemu-devel] [PATCH v4 05/20] hw/acpi/aml-build: Add aml_interrupt() term

2015-04-08 Thread Shannon Zhao

On 2015/4/8 22:57, Alex Bennée wrote:
> 
> Shannon Zhao  writes:
> 
>> From: Shannon Zhao 
>>
>> Add aml_interrupt() for describing device interrupt in resource template.
>> These can be used to generating DSDT table for ACPI on ARM.
>>
>> Signed-off-by: Shannon Zhao 
>> Signed-off-by: Shannon Zhao 
>> ---
>>  hw/acpi/aml-build.c | 18 ++
>>  include/hw/acpi/aml-build.h |  1 +
>>  2 files changed, 19 insertions(+)
>>
>> diff --git a/hw/acpi/aml-build.c b/hw/acpi/aml-build.c
>> index fefe7c7..bd1713c 100644
>> --- a/hw/acpi/aml-build.c
>> +++ b/hw/acpi/aml-build.c
>> @@ -527,6 +527,24 @@ Aml *aml_memory32_fixed(uint64_t addr, uint64_t size, 
>> uint8_t rw_flag)
>>  return var;
>>  }
>>  
>> +/*
>> + * ACPI 1.0: 6.4.3.6 Interrupt (Interrupt Resource Descriptor Macro)
>> + */
>> +Aml *aml_interrupt(uint8_t irq_flags, int irq)
>> +{
>> +Aml *var = aml_alloc();
>> +build_append_byte(var->buf, 0x89); /* Extended irq descriptor */
>> +build_append_byte(var->buf, 6); /* Length, bits[7:0] minimum value = 6 
>> */
>> +build_append_byte(var->buf, 0); /* Length, bits[15:8] minimum value = 0 
>> */
>> +build_append_byte(var->buf, irq_flags); /* Interrupt Vector
>> Information. */
> 
> As the spec says [7:4] is RES0 we might want to assert this is the case.
> 

Yes, we should check although the probability is very small.
But the reserve bits are different in ACPI 5.1.

Bit[7:5] Reserved (must be 0)
Bit[4] Wake Capability, _WKC

>> +build_append_byte(var->buf, 0x01); /* Interrupt table length = 1 */
>> +build_append_byte(var->buf, irq & 0xff); /* Interrupt Number bits[7:0] 
>> */
>> +build_append_byte(var->buf, (irq >> 8) & 0xff); /* Interrupt Number 
>> bits[15:8] */
>> +build_append_byte(var->buf, (irq >> 16) & 0xff); /* Interrupt Number 
>> bits[23:16] */
>> +build_append_byte(var->buf, (irq >> 24) & 0xff); /* Interrupt
>> Number bits[31:24] */
> 
> Again extractNN bitops?
> 
>> +return var;
>> +}
>> +
>>  /* ACPI 1.0b: 6.4.2.5 I/O Port Descriptor */
>>  Aml *aml_io(AmlIODecode dec, uint16_t min_base, uint16_t max_base,
>>  uint8_t aln, uint8_t len)
>> diff --git a/include/hw/acpi/aml-build.h b/include/hw/acpi/aml-build.h
>> index baa0652..315c729 100644
>> --- a/include/hw/acpi/aml-build.h
>> +++ b/include/hw/acpi/aml-build.h
>> @@ -163,6 +163,7 @@ Aml *aml_call2(const char *method, Aml *arg1, Aml *arg2);
>>  Aml *aml_call3(const char *method, Aml *arg1, Aml *arg2, Aml *arg3);
>>  Aml *aml_call4(const char *method, Aml *arg1, Aml *arg2, Aml *arg3, Aml 
>> *arg4);
>>  Aml *aml_memory32_fixed(uint64_t addr, uint64_t size, uint8_t rw_flag);
>> +Aml *aml_interrupt(uint8_t irq_flags, int irq);
>>  Aml *aml_io(AmlIODecode dec, uint16_t min_base, uint16_t max_base,
>>  uint8_t aln, uint8_t len);
>>  Aml *aml_operation_region(const char *name, AmlRegionSpace rs,
>

Re: [Qemu-devel] [PATCH v4 04/20] hw/acpi/aml-build: Add aml_memory32_fixed() term

2015-04-08 Thread Shannon Zhao

On 2015/4/8 22:54, Alex Bennée wrote:
> 
> Shannon Zhao  writes:
> 
>> From: Shannon Zhao 
>>
>> Add aml_memory32_fixed() for describing device mmio region in resource 
>> template.
>> These can be used to generating DSDT table for ACPI on ARM.
>>
>> Signed-off-by: Shannon Zhao 
>> Signed-off-by: Shannon Zhao 
>> ---
>>  hw/acpi/aml-build.c | 22 ++
>>  include/hw/acpi/aml-build.h |  1 +
>>  2 files changed, 23 insertions(+)
>>
>> diff --git a/hw/acpi/aml-build.c b/hw/acpi/aml-build.c
>> index 8d01959..fefe7c7 100644
>> --- a/hw/acpi/aml-build.c
>> +++ b/hw/acpi/aml-build.c
>> @@ -505,6 +505,28 @@ Aml *aml_call4(const char *method, Aml *arg1, Aml 
>> *arg2, Aml *arg3, Aml *arg4)
>>  return var;
>>  }
>>  
>> +/*
>> + * ACPI 1.0: 6.4.3.4 Memory32Fixed (Memory Resource Descriptor Macro)
>> + */
>> +Aml *aml_memory32_fixed(uint64_t addr, uint64_t size, uint8_t rw_flag)
>> +{
>> +Aml *var = aml_alloc();
> 
> This is more aimed at the ACPI maintainers but I wonder if there should
> be an aml_alloc_sized that pre-allocates the GArray? Otherwise we spend
> a lot of time realloc'ing while building these entries up. Or even a
> varidac build_append_bytes?
> 
>> +build_append_byte(var->buf, 0x86); /* Memory32Fixed Resource Descriptor 
>> */
>> +build_append_byte(var->buf, 9); /* Length, bits[7:0] value = 9 */
>> +build_append_byte(var->buf, 0); /* Length, bits[15:8] value = 0 */
>> +build_append_byte(var->buf, rw_flag); /* Write status, 1 rw 0 ro */
>> +build_append_byte(var->buf, addr & 0xff); /* Range base address 
>> bits[7:0] */
>> +build_append_byte(var->buf, (addr >> 8) & 0xff); /* Range base address 
>> bits[15:8] */
>> +build_append_byte(var->buf, (addr >> 16) & 0xff); /* Range base address 
>> bits[23:16] */
>> +build_append_byte(var->buf, (addr >> 24) & 0xff); /* Range base
>> address bits[31:24] */
> 
> I'm should point out we have handy utility functions for bit fiddling:
> 
> build_append_byte(var->buf, extract64(addr, 8, 8)); /* Range base address 
> bits[15:8] */
> 

Great, will use these utility functions. Same with the other patch.

>> +
>> +build_append_byte(var->buf, size & 0xff); /* Range length bits[7:0] */
>> +build_append_byte(var->buf, (size >> 8) & 0xff); /* Range length 
>> bits[15:8] */
>> +build_append_byte(var->buf, (size >> 16) & 0xff); /* Range length 
>> bits[23:16] */
>> +build_append_byte(var->buf, (size >> 24) & 0xff); /* Range length
>> bits[31:24] */
> 
> Hmm we seem to have two 64 bit inputs which we only use 32 bits worth
> of. Maybe the prototype should be fixed to avoid accidents of accidentally
> passing in 64 bit values.
> 

Thanks, will fix this.

> 
>> +return var;
>> +}
>> +
>>  /* ACPI 1.0b: 6.4.2.5 I/O Port Descriptor */
>>  Aml *aml_io(AmlIODecode dec, uint16_t min_base, uint16_t max_base,
>>  uint8_t aln, uint8_t len)
>> diff --git a/include/hw/acpi/aml-build.h b/include/hw/acpi/aml-build.h
>> index 1705001..baa0652 100644
>> --- a/include/hw/acpi/aml-build.h
>> +++ b/include/hw/acpi/aml-build.h
>> @@ -162,6 +162,7 @@ Aml *aml_call1(const char *method, Aml *arg1);
>>  Aml *aml_call2(const char *method, Aml *arg1, Aml *arg2);
>>  Aml *aml_call3(const char *method, Aml *arg1, Aml *arg2, Aml *arg3);
>>  Aml *aml_call4(const char *method, Aml *arg1, Aml *arg2, Aml *arg3, Aml 
>> *arg4);
>> +Aml *aml_memory32_fixed(uint64_t addr, uint64_t size, uint8_t rw_flag);
>>  Aml *aml_io(AmlIODecode dec, uint16_t min_base, uint16_t max_base,
>>  uint8_t aln, uint8_t len);
>>  Aml *aml_operation_region(const char *name, AmlRegionSpace rs,
>

Re: [Qemu-devel] [PATCH v4 03/20] hw/arm/virt-acpi-build: Basic framework for building ACPI tables on ARM

2015-04-08 Thread Shannon Zhao

On 2015/4/8 22:37, Alex Bennée wrote:
> 
> Shannon Zhao  writes:
> 
>> From: Shannon Zhao 
>>
>> Introduce a preliminary framework in virt-acpi-build.c with the main
>> ACPI build functions. It exposes the generated ACPI contents to
>> guest over fw_cfg.
>>
>> The required ACPI v5.1 tables for ARM are:
>> - RSDP: Initial table that points to XSDT
>> - RSDT: Points to FADT GTDT MADT tables
>> - FADT: Generic information about the machine
>> - GTDT: Generic timer description table
>> - MADT: Multiple APIC description table
>> - DSDT: Holds all information about system devices/peripherals, pointed by 
>> FADT
>>
>> Signed-off-by: Shannon Zhao 
>> Signed-off-by: Shannon Zhao 
>> ---
>>  hw/arm/Makefile.objs |   1 +
>>  hw/arm/virt-acpi-build.c | 198 
>> +++
>>  include/hw/arm/virt-acpi-build.h |  65 +
>>  3 files changed, 264 insertions(+)
>>  create mode 100644 hw/arm/virt-acpi-build.c
>>  create mode 100644 include/hw/arm/virt-acpi-build.h
>>
>> diff --git a/hw/arm/Makefile.objs b/hw/arm/Makefile.objs
>> index 2577f68..a1bfb19 100644
>> --- a/hw/arm/Makefile.objs
>> +++ b/hw/arm/Makefile.objs
>> @@ -3,6 +3,7 @@ obj-$(CONFIG_DIGIC) += digic_boards.o
>>  obj-y += integratorcp.o kzm.o mainstone.o musicpal.o nseries.o
>>  obj-y += omap_sx1.o palm.o realview.o spitz.o stellaris.o
>>  obj-y += tosa.o versatilepb.o vexpress.o virt.o xilinx_zynq.o z2.o
>> +obj-$(CONFIG_ACPI) += virt-acpi-build.o
>>  obj-y += netduino2.o
>>  
>>  obj-y += armv7m.o exynos4210.o pxa2xx.o pxa2xx_gpio.o pxa2xx_pic.o
>> diff --git a/hw/arm/virt-acpi-build.c b/hw/arm/virt-acpi-build.c
>> new file mode 100644
>> index 000..388838a
>> --- /dev/null
>> +++ b/hw/arm/virt-acpi-build.c
>> @@ -0,0 +1,198 @@
>> +/* Support for generating ACPI tables and passing them to Guests
>> + *
>> + * ARM virt ACPI generation
>> + *
>> + * Copyright (C) 2008-2010  Kevin O'Connor 
>> + * Copyright (C) 2006 Fabrice Bellard
>> + * Copyright (C) 2013 Red Hat Inc
>> + *
>> + * Author: Michael S. Tsirkin 
>> + *
>> + * Copyright (c) 2015 HUAWEI TECHNOLOGIES CO.,LTD.
>> + *
>> + * Author: Shannon Zhao 
>> + *
>> + * This program is free software; you can redistribute it and/or modify
>> + * it under the terms of the GNU General Public License as published by
>> + * the Free Software Foundation; either version 2 of the License, or
>> + * (at your option) any later version.
>> +
>> + * This program is distributed in the hope that it will be useful,
>> + * but WITHOUT ANY WARRANTY; without even the implied warranty of
>> + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
>> + * GNU General Public License for more details.
>> +
>> + * You should have received a copy of the GNU General Public License along
>> + * with this program; if not, see .
>> + */
>> +
>> +#include "hw/arm/virt-acpi-build.h"
>> +#include 
>> +#include 
>> +#include "qemu-common.h"
>> +#include "qemu/bitmap.h"
>> +#include "qemu/osdep.h"
>> +#include "qemu/range.h"
>> +#include "qemu/error-report.h"
>> +#include "qom/cpu.h"
>> +#include "target-arm/cpu.h"
>> +#include "hw/acpi/acpi-defs.h"
>> +#include "hw/acpi/acpi.h"
>> +#include "hw/nvram/fw_cfg.h"
>> +#include "hw/acpi/bios-linker-loader.h"
>> +#include "hw/loader.h"
>> +#include "hw/hw.h"
>> +
>> +#include "hw/acpi/aml-build.h"
>> +
>> +#include "qapi/qmp/qint.h"
>> +#include "qom/qom-qobject.h"
>> +#include "exec/ram_addr.h"
>> +
>> +/* #define DEBUG_ACPI_BUILD */
>> +#ifdef DEBUG_ACPI_BUILD
>> +#define ACPI_BUILD_DPRINTF(fmt, ...)\
>> +do {printf("ACPI_BUILD: " fmt, ## __VA_ARGS__); } while (0)
>> +#else
>> +#define ACPI_BUILD_DPRINTF(fmt, ...)
>> +#endif
> 
> I'd be tempted to rename this to D or at a push VIRT_ACPI_DEBUG just to
> make it distinct from where it was copied from.
> 
> You could also consider something like:
> 
> printf("%s: " fmt, __func__, ##__VA_ARGS__);
> 
> So log statements are pre-pended with their source functions.
> 

Thanks, will fix it.

>> +
>> +typedef
>> +struct AcpiBuildState {
>> +/* Copy of table in RAM (for patching). */
>> +ram_addr_t table_ram;
>> +ram_addr_t rsdp_ram;
>> +ram_addr_t linker_ram;
>> +/* Is table patched? */
>> +uint8_t patched;
>> +VirtGuestInfo *guest_info;
>> +} AcpiBuildState;
>> +
>> +static
>> +void virt_acpi_build(VirtGuestInfo *guest_info, AcpiBuildTables *tables)
>> +{
>> +GArray *table_offsets;
>> +
>> +table_offsets = g_array_new(false, true /* clear */,
>> +sizeof(uint32_t));
>> +
>> +bios_linker_loader_alloc(tables->linker, ACPI_BUILD_TABLE_FILE,
>> + 64, false /* high memory */);
>> +
>> +/*
>> + * The ACPI v5.1 tables for Hardware-reduced ACPI platform are:
>> + * RSDP
>> + * RSDT
>> + * FADT
>> + * GTDT
>> + * MADT
>> + * DSDT
>> + */
>> +
>> +/* Cleanup memory that's no longer used. */
>> +g_arra

Re: [Qemu-devel] 64-bit build of qemu-system-arm with mingw-w64 not functional

2015-04-08 Thread Stefan Weil


Am 08.04.2015 um 22:27 schrieb Liviu Ionescu:

On 08 Apr 2015, at 09:20, Stefan Weil  wrote:

Here is my package list (from Debian Jessie):

ii  binutils-mingw-w64-i686 2.22-8+deb7u2+2+deb7u1amd64
Cross-binutils for Win32 (x86) using MinGW-w64
ii  binutils-mingw-w64-x86-64 2.22-8+deb7u2+2+deb7u1amd64
Cross-binutils for Win64 (x64) using MinGW-w64
ii  g++-mingw-w64 4.6.3-14+8all  GNU C++ compiler 
for MinGW-w64
ii  g++-mingw-w64-i686 4.6.3-14+8amd64GNU C++ 
compiler for MinGW-w64 targeting Win32
ii  g++-mingw-w64-x86-64 4.6.3-14+8amd64GNU C++ 
compiler for MinGW-w64 targeting Win64
ii  gcc-mingw-w64 4.6.3-14+8all  GNU C compiler for 
MinGW-w64
ii  gcc-mingw-w64-base 4.6.3-14+8amd64GNU Compiler 
Collection for MinGW-w64 (base package)
ii  gcc-mingw-w64-i686 4.6.3-14+8amd64GNU C 
compiler for MinGW-w64 targeting Win32
ii  gcc-mingw-w64-x86-64 4.6.3-14+8amd64GNU C 
compiler for MinGW-w64 targeting Win64
ii  gdb-mingw-w64 7.4.1-1.1+5   amd64Cross-debugger for 
Win32 and Win64 using MinGW-w64
ii  gdb-mingw-w64-target 7.4.1-1.1+5   all  
Cross-debugger server for Win32 and Win64 using MinGW-w64
ii  gtk-mingw-w64-x86-64 3.6.4-20131201-2  all  Converted 
tgz package
ii  gtk2.0-mingw-w64-i686 2.24.10-20120208-2all  Converted 
tgz package
ii  libfdt-mingw-w64-i686 1.4.0-2   all  Converted 
tgz package
ii  libfdt-mingw-w64-x86-64 1.4.0-2   all  
Converted tgz package
ii  libpthreads-mingw-w64 2.9.1+dfsg-1  all  POSIX 
threads library for 32- and 64-bit Windows
ii  mingw-w64 2.0.3-1   all  Development 
environment targetting 32- and 64-bit Windows
ii  mingw-w64-i686-dev 2.0.3-1   all  Development 
files for MinGW-w64 targeting Win32
ii  mingw-w64-tools 2.0.3-1   amd64Development 
tools for 32- and 64-bit Windows
ii  mingw-w64-x86-64-dev 2.0.3-1   all  Development 
files for MinGW-w64 targeting Win64


Stefan,

I'm afraid there is a small misunderstanding here, I checked and even without 
upgrading the packages, the Debian 8 (Jessie) does not include the packages you 
are referring above, the actual versions I identified are:


You are right, I was wrong: my production server qemu.weilnetz.de uses 
Wheezy,

not Jessie, so my list was for Wheezy.

Here is the list for my Jessie system:

ii  binutils-mingw-w64-i686 2.25-5+5.2  
amd64Cross-binutils for Win32 (x86) using MinGW-w64
ii  binutils-mingw-w64-x86-64 2.25-5+5.2  
amd64Cross-binutils for Win64 (x64) using MinGW-w64
ii  g++-mingw-w64 4.9.1-19+14.3   all  GNU 
C++ compiler for MinGW-w64
ii  g++-mingw-w64-i686 4.9.1-19+14.3   amd64
GNU C++ compiler for MinGW-w64 targeting Win32
ii  g++-mingw-w64-x86-64 4.9.1-19+14.3   
amd64GNU C++ compiler for MinGW-w64 targeting Win64
ii  gcc-mingw-w64 4.9.1-19+14.3   all  GNU C 
compiler for MinGW-w64
ii  gcc-mingw-w64-base 4.9.1-19+14.3   amd64
GNU Compiler Collection for MinGW-w64 (base package)
ii  gcc-mingw-w64-i686 4.9.1-19+14.3   amd64
GNU C compiler for MinGW-w64 targeting Win32
ii  gcc-mingw-w64-x86-64 4.9.1-19+14.3   
amd64GNU C compiler for MinGW-w64 targeting Win64
ii  gtk-mingw-w64-x86-64 3.6.4-20131201-2
all  Converted tgz package
ii  gtk2.0-mingw-w64-i686 2.24.10-20120208-2  
all  Converted tgz package
ii  libfdt-mingw-w64-i686 1.4.0-2 
all  Converted tgz package
ii  libfdt-mingw-w64-x86-64 1.4.0-2 
all  Converted tgz package
ii  mingw-w64 4.0~rc3-1   all  
Development environment targeting 32- and 64-bit Windows
ii  mingw-w64-common 4.0~rc3-1   all  
Common files for Mingw-w64
ii  mingw-w64-i686-dev 4.0~rc3-1   all  
Development files for MinGW-w64 targeting Win32
ii  mingw-w64-x86-64-dev 4.0~rc3-1   
all  Development files for MinGW-w64 targeting Win64


Both the Wheezy and the Jessie environment work.


and the question from the previous message is still open, how did you install the 
following packages? the comment "Converted tgz package" is a good sign these 
are custom packages.

ii  gtk-mingw-w64-x86-64 3.6.4-20131201-2  all  Converted 
tgz package
ii  gtk2.0-mingw-w64-i686 2.24.10-20120208

Re: [Qemu-devel] [Qemu-ppc] [RFC PATCH v2 19/23] spapr: CPU hot unplug support

2015-04-08 Thread Bharata B Rao

On Tue, Apr 07, 2015 at 04:45:17PM +1000, Alexey Kardashevskiy wrote:
> On 03/24/2015 12:36 AM, Bharata B Rao wrote:
> >Support hot removal of CPU for sPAPR guests by sending the hot
> >unplug notification to the guest via EPOW interrupt.
> >
> >Signed-off-by: Bharata B Rao 
> >---
> >  hw/ppc/spapr.c| 78 
> > ++-
> >  linux-headers/linux/kvm.h |  1 +
> >  target-ppc/kvm.c  |  7 +
> >  target-ppc/kvm_ppc.h  |  6 
> >  4 files changed, 91 insertions(+), 1 deletion(-)
> >
> >diff --git a/hw/ppc/spapr.c b/hw/ppc/spapr.c
> >index b48994b..7b8784d 100644
> >--- a/hw/ppc/spapr.c
> >+++ b/hw/ppc/spapr.c
> >@@ -1468,6 +1468,12 @@ static void spapr_cpu_init(PowerPCCPU *cpu)
> >  qemu_register_reset(spapr_cpu_reset, cpu);
> >  }
> >
> >+static void spapr_cpu_destroy(PowerPCCPU *cpu)
> >+{
> >+xics_cpu_destroy(spapr->icp, cpu);
> >+qemu_unregister_reset(spapr_cpu_reset, cpu);
> >+}
> >+
> >  /* pSeries LPAR / sPAPR hardware init */
> >  static void ppc_spapr_init(MachineState *machine)
> >  {
> >@@ -1880,6 +1886,18 @@ static void spapr_cpu_hotplug_add(DeviceState *dev, 
> >CPUState *cs, Error **errp)
> >  }
> >  }
> >
> >+static void spapr_cpu_hotplug_remove(DeviceState *dev, CPUState *cs,
> >+ Error **errp)
> >+{
> >+PowerPCCPU *cpu = POWERPC_CPU(cs);
> >+int id = ppc_get_vcpu_dt_id(cpu);
> >+sPAPRDRConnector *drc =
> >+spapr_dr_connector_by_id(SPAPR_DR_CONNECTOR_TYPE_CPU, id);
> >+sPAPRDRConnectorClass *drck = SPAPR_DR_CONNECTOR_GET_CLASS(drc);
> >+
> >+drck->detach(drc, dev, NULL, NULL, errp);
> >+}
> >+
> >  static void spapr_cpu_plug(HotplugHandler *hotplug_dev, DeviceState *dev,
> >  Error **errp)
> >  {
> >@@ -1911,6 +1929,51 @@ static void spapr_cpu_plug(HotplugHandler 
> >*hotplug_dev, DeviceState *dev,
> >  return;
> >  }
> >
> >+static int spapr_cpu_unplug(Object *obj, void *opaque)
> >+{
> >+Error **errp = opaque;
> >+DeviceState *dev = DEVICE(obj);
> >+CPUState *cs = CPU(dev);
> >+PowerPCCPU *cpu = POWERPC_CPU(cs);
> >+int id = ppc_get_vcpu_dt_id(cpu);
> >+int smt = kvmppc_smt_threads();
> >+sPAPRDRConnector *drc =
> >+spapr_dr_connector_by_id(SPAPR_DR_CONNECTOR_TYPE_CPU, id);
> >+
> >+spapr_cpu_destroy(cpu);
> >+
> >+/*
> >+ * SMT threads return from here, only main thread (core) will
> >+ * continue and signal hot unplug event to the guest.
> >+ */
> >+if ((id % smt) != 0) {
> >+return 0;
> >+}
> >+g_assert(drc);
> >+
> >+spapr_cpu_hotplug_remove(dev, cs, errp);
> >+if (*errp) {
> >+return -1;
> >+}
> >+spapr_hotplug_req_remove_event(drc);
> >+
> >+return 0;
> >+}
> >+
> >+static int spapr_cpu_core_unplug(Object *obj, void *opaque)
> >+{
> >+Error **errp = opaque;
> >+
> >+object_child_foreach(obj, spapr_cpu_unplug, errp);
> >+return 0;
> >+}
> >+
> >+static void spapr_cpu_socket_unplug(HotplugHandler *hotplug_dev,
> >+DeviceState *dev, Error **errp)
> >+{
> >+object_child_foreach(OBJECT(dev), spapr_cpu_core_unplug, errp);
> >+}
> >+
> >  static void spapr_machine_device_plug(HotplugHandler *hotplug_dev,
> >DeviceState *dev, Error **errp)
> >  {
> >@@ -1926,10 +1989,21 @@ static void spapr_machine_device_plug(HotplugHandler 
> >*hotplug_dev,
> >  }
> >  }
> >
> >+static void spapr_machine_device_unplug(HotplugHandler *hotplug_dev,
> >+  DeviceState *dev, Error **errp)
> >+{
> >+if (object_dynamic_cast(OBJECT(dev), TYPE_CPU_SOCKET)) {
> >+if (dev->hotplugged && spapr->dr_cpu_enabled) {
> >+spapr_cpu_socket_unplug(hotplug_dev, dev, errp);
> >+}
> >+}
> >+}
> >+
> >  static HotplugHandler *spapr_get_hotpug_handler(MachineState *machine,
> >   DeviceState *dev)
> >  {
> >-if (object_dynamic_cast(OBJECT(dev), TYPE_CPU)) {
> >+if (object_dynamic_cast(OBJECT(dev), TYPE_CPU) ||
> >+object_dynamic_cast(OBJECT(dev), TYPE_CPU_SOCKET)) {
> 
> 
> What is this change for? I mean why is not it always socket-only? Commit log
> would not hurt here...

In the hot add case (do_device_add), the CPU socket device is realized first
which will realize the CPU core devices. Core devices will realize the CPU
thread devices. So the ->plug() operation happens as part of CPU thread devices
and hence hotplug_handler is returned only for TYPE_CPU.

However in case of hot remove, qdev_unplug() directly does ->unplug() and
hence I need to return the hotplug_handler for TYPE_CPU_SOCKET also.
This ensures that ->unplug() gets called for socket object where I take
care of recursively walking down the core and thread objects and unplugging
the CPU thread object eventually.

Regards,
Bharata.

Re: [Qemu-devel] [PATCH for-2.3] cris: memory: Replace memory_region_init_ram with memory_region_allocate_system_memory

2015-04-08 Thread Edgar E. Iglesias

On Sat, Apr 04, 2015 at 02:15:10PM +0200, Dirk Müller wrote:
> Commit 0b183fc871:"memory: move mem_path handling to
> memory_region_allocate_system_memory" split memory_region_init_ram and
> memory_region_init_ram_from_file. Also it moved mem-path handling a step
> up from memory_region_init_ram to memory_region_allocate_system_memory.
> 
> Therefore for any board that uses memory_region_init_ram directly,
> -mem-path is not supported.
> 
> Fix this by replacing memory_region_init_ram with
> memory_region_allocate_system_memory.
> 
> Cc: Edgar E. Iglesias 
> Signed-off-by: Dirk Mueller 
> ---
>  hw/cris/axis_dev88.c | 5 ++---
>  1 file changed, 2 insertions(+), 3 deletions(-)


Hi,

A question, should this only be done for one of the memories?
BTW, I'm having problems git am:ing this patch, not sure why...

Cheers,
Edgar


> 
> diff --git a/hw/cris/axis_dev88.c b/hw/cris/axis_dev88.c
> index 0479196..3cae480 100644
> --- a/hw/cris/axis_dev88.c
> +++ b/hw/cris/axis_dev88.c
> @@ -270,9 +270,8 @@ void axisdev88_init(MachineState *machine)
>  env = &cpu->env;
> 
>  /* allocate RAM */
> -memory_region_init_ram(phys_ram, NULL, "axisdev88.ram", ram_size,
> -   &error_abort);
> -vmstate_register_ram_global(phys_ram);
> +memory_region_allocate_system_memory(phys_ram, NULL, "axisdev88.ram",
> + ram_size);
>  memory_region_add_subregion(address_space_mem, 0x4000, phys_ram);
> 
>  /* The ETRAX-FS has 128Kb on chip ram, the docs refer to it as the
> -- 
> 2.0.4

Re: [Qemu-devel] [PULL 22/62] block: Support Archipelago as a QEMU block backend

2015-04-08 Thread Andreas Färber

Am 08.08.2014 um 19:39 schrieb Kevin Wolf:
> From: Chrysostomos Nanakos 
> 
> VM Image on Archipelago volume is specified like this:
> 
> file.driver=archipelago,file.volume=[,file.mport=[,
> file.vport=][,file.segment=]]
> 
> 'archipelago' is the protocol.
> 
> 'mport' is the port number on which mapperd is listening. This is optional
> and if not specified, QEMU will make Archipelago to use the default port.
> 
> 'vport' is the port number on which vlmcd is listening. This is optional
> and if not specified, QEMU will make Archipelago to use the default port.
> 
> 'segment' is the name of the shared memory segment Archipelago stack is using.
> This is optional and if not specified, QEMU will make Archipelago to use the
> default value, 'archipelago'.
> 
> Examples:
> 
> file.driver=archipelago,file.volume=my_vm_volume
> file.driver=archipelago,file.volume=my_vm_volume,file.mport=123
> file.driver=archipelago,file.volume=my_vm_volume,file.mport=123,
> file.vport=1234
> file.driver=archipelago,file.volume=my_vm_volume,file.mport=123,
> file.vport=1234,file.segment=my_segment
> 
> Signed-off-by: Chrysostomos Nanakos 
> Reviewed-by: Stefan Hajnoczi 
> Signed-off-by: Kevin Wolf 
> ---
>  MAINTAINERS |   6 +
>  block/Makefile.objs |   2 +
>  block/archipelago.c | 787 
> 
>  configure   |  40 +++
>  4 files changed, 835 insertions(+)
>  create mode 100644 block/archipelago.c

Judging by configure output in v2.3.0-rc2, QEMU seems to rely on
libxseg, which is GPL-3.0+: https://github.com/grnet/libxseg

How can anyone legally build this backend then? o.O

Any chance libxseg can be relicensed to GPL-2.0+?

Regards,
Andreas

-- 
SUSE Linux GmbH, Maxfeldstr. 5, 90409 Nürnberg, Germany
GF: Felix Imendörffer, Jane Smithard, Jennifer Guild, Dilip Upmanyu,
Graham Norton; HRB 21284 (AG Nürnberg)

[Qemu-devel] seccomp breakage on arm

2015-04-08 Thread Andreas Färber

Hello,

I am seeing the following build failure on openSUSE Tumbleweed armv7l
with --enable-seccomp in v2.3.0-rc2:

[  551s] In file included from qemu-seccomp.c:16:0:
[  551s] /usr/include/libseccomp/seccomp.h:177:23: error: '__NR_mmap'
undeclared here (not in a function)
[  551s]  #define SCMP_SYS(x)  (__NR_##x)
[  551s]^
[  551s] qemu-seccomp.c:36:7: note: in expansion of macro 'SCMP_SYS'
[  551s]  { SCMP_SYS(mmap), 247 },
[  551s]^
[  551s] /usr/include/libseccomp/seccomp.h:177:23: error:
'__NR_getrlimit' undeclared here (not in a function)
[  551s]  #define SCMP_SYS(x)  (__NR_##x)
[  551s]^
[  551s] qemu-seccomp.c:57:7: note: in expansion of macro 'SCMP_SYS'
[  551s]  { SCMP_SYS(getrlimit), 245 },
[  551s]^
[  551s] /home/abuild/rpmbuild/BUILD/qemu-2.3.0-rc2/rules.mak:57: recipe
for target 'qemu-seccomp.o' failed
[  551s] make: *** [qemu-seccomp.o] Error 1

Is this a problem with libseccomp 2.2.0 / master and needs to be fixed
in the library? Or do we need to #ifdef some syscalls in qemu-seccomp.c?

aarch64 builds fine. For ppc and ppc64 we're carrying a libseccomp patch
in openSUSE, those build okay then; ppc64le is still missing in libseccomp.

Regards,
Andreas

-- 
SUSE Linux GmbH, Maxfeldstr. 5, 90409 Nürnberg, Germany
GF: Felix Imendörffer, Jane Smithard, Jennifer Guild, Dilip Upmanyu,
Graham Norton; HRB 21284 (AG Nürnberg)

Re: [Qemu-devel] [PATCH v4 07/20] hw/arm/virt-acpi-build: Generate FADT table and update ACPI headers

2015-04-08 Thread Shannon Zhao

On 2015/4/9 2:53, Michael S. Tsirkin wrote:
> On Fri, Apr 03, 2015 at 06:03:39PM +0800, Shannon Zhao wrote:
>> @@ -135,6 +138,43 @@ struct AcpiFadtDescriptorRev1
>>  } QEMU_PACKED;
>>  typedef struct AcpiFadtDescriptorRev1 AcpiFadtDescriptorRev1;
>>  
>> +struct acpi_generic_address {
>> +uint8_t space_id;/* Address space where struct or register 
>> exists */
>> +uint8_t bit_width;   /* Size in bits of given register */
>> +uint8_t bit_offset;  /* Bit offset within the register */
>> +uint8_t access_width;/* Minimum Access size (ACPI 3.0) */
>> +uint64_t address;/* 64-bit address of struct or register */
>> +} QEMU_PACKED;
> 
> Pls use standard QEMU style for structs.
> There are more like this in the patchset, pls find and fix them.
> 

Ok, thanks.

> 
>> +
>> +struct AcpiFadtDescriptorRev5_1 {
>> +ACPI_FADT_COMMON_DEF
>> +uint16_t boot_flags; /* IA-PC Boot Architecture Flags (see below 
>> for individual flags) */
>> +uint8_t reserved;/* Reserved, must be zero */
>> +uint32_t flags;  /* Miscellaneous flag bits (see below for 
>> individual flags) */
>> +struct acpi_generic_address reset_register; /* 64-bit address of the 
>> Reset register */
>> +uint8_t reset_value; /* Value to write to the reset_register port 
>> to reset the system */
>> +uint16_t arm_boot_flags; /* ARM-Specific Boot Flags (see below for 
>> individual flags) (ACPI 5.1) */
>> +uint8_t minor_revision;  /* FADT Minor Revision (ACPI 5.1) */
>> +uint64_t Xfacs;  /* 64-bit physical address of FACS */
>> +uint64_t Xdsdt;  /* 64-bit physical address of DSDT */
>> +struct acpi_generic_address xpm1a_event_block;  /* 64-bit Extended 
>> Power Mgt 1a Event Reg Blk address */
>> +struct acpi_generic_address xpm1b_event_block;  /* 64-bit Extended 
>> Power Mgt 1b Event Reg Blk address */
>> +struct acpi_generic_address xpm1a_control_block;/* 64-bit Extended 
>> Power Mgt 1a Control Reg Blk address */
>> +struct acpi_generic_address xpm1b_control_block;/* 64-bit Extended 
>> Power Mgt 1b Control Reg Blk address */
>> +struct acpi_generic_address xpm2_control_block; /* 64-bit Extended 
>> Power Mgt 2 Control Reg Blk address */
>> +struct acpi_generic_address xpm_timer_block;/* 64-bit Extended 
>> Power Mgt Timer Ctrl Reg Blk address */
>> +struct acpi_generic_address xgpe0_block;/* 64-bit Extended General 
>> Purpose Event 0 Reg Blk address */
>> +struct acpi_generic_address xgpe1_block;/* 64-bit Extended General 
>> Purpose Event 1 Reg Blk address */
>> +struct acpi_generic_address sleep_control;  /* 64-bit Sleep Control 
>> register (ACPI 5.0) */
>> +struct acpi_generic_address sleep_status;   /* 64-bit Sleep Status 
>> register (ACPI 5.0) */
>> +} QEMU_PACKED;
> 
> empty line missing.
> 

ok.

>> +typedef struct AcpiFadtDescriptorRev5_1 AcpiFadtDescriptorRev5_1;
>> +
>> +enum {
>> +ACPI_FADT_ARM_USE_PSCI_G_0_2,
>> +ACPI_FADT_ARM_PSCI_USE_HVC,
>> +};
> 
> These are part of tables, are they not?

They are the values of arm_boot_flags in AcpiFadtDescriptorRev5_1.

> Pls add = 0, = 1, so we don't change them by mistake.

Ok, thanks.

> 
>> +
>>  /*
>>   * ACPI 1.0 Root System Description Table (RSDT)
>>   */
>> -- 
>> 2.0.4
>>
> 
> .
>

Re: [Qemu-devel] [snabb-devel] Re: [PATCH v2] vhost-user: add multi queue support

2015-04-08 Thread Ouyang, Changchun

Hi guys,

> -Original Message-
> From: snabb-de...@googlegroups.com [mailto:snabb-
> de...@googlegroups.com] On Behalf Of Michael S. Tsirkin
> Sent: Monday, April 6, 2015 11:07 PM
> To: Nikolay Nikolaev
> Cc: Long, Thomas; snabb-de...@googlegroups.com; ebl...@redhat.com;
> qemu-devel@nongnu.org; t...@virtualopensystems.com
> Subject: [snabb-devel] Re: [PATCH v2] vhost-user: add multi queue support
> 
> On Sat, Jan 24, 2015 at 02:22:29PM +0200, Nikolay Nikolaev wrote:
> > Vhost-user will implement the multiqueueu support in a similar way to
> > what
> 
> multiqueue
> 
> > vhost already has - a separate thread for each queue.
> >
> > To enable the multiqueue funcionality - a new command line parameter
> > "queues" is introduced for the vhost-user netdev.
> >
> > Changes since v1:
> >  - use s->nc.info_str when bringing up/down the backend
> >
> > Signed-off-by: Nikolay Nikolaev 
> > ---
> >  docs/specs/vhost-user.txt |5 +
> >  hw/virtio/vhost-user.c|6 +-
> >  net/vhost-user.c  |   39 +--
> >  qapi-schema.json  |6 +-
> >  qemu-options.hx   |5 +++--
> >  5 files changed, 43 insertions(+), 18 deletions(-)
> >
> > diff --git a/docs/specs/vhost-user.txt b/docs/specs/vhost-user.txt
> > index 650bb18..d7b208c 100644
> > --- a/docs/specs/vhost-user.txt
> > +++ b/docs/specs/vhost-user.txt
> 
> I've been thinking that the protocol might be a useful addition to the virtio
> spec. For this, as a minimum you would have to submit this document as a
> comment to virtio TC with a proposal to include it in the virtio spec.
> See
> https://www.oasis-
> open.org/committees/comments/index.php?wg_abbrev=virtio
> 
> Can you do this?
> 
> We can take it from there, though I would encourage your company to join
> as a contributor.
> 
> 
> > @@ -127,6 +127,11 @@ in the ancillary data:
> >  If Master is unable to send the full message or receives a wrong
> > reply it will  close the connection. An optional reconnection mechanism can
> be implemented.
> >
> > +Multi queue suport
> > +-
> > +The protocol supports multiple queues by setting all index fields in
> > +the sent messages to a properly calculated value.
> > +
> 
> Something that's not clear from this document is what happens with control
> VQ.
> Can you clarify please?
> 
> 
> >  Message types
> >  -
> >
> > diff --git a/hw/virtio/vhost-user.c b/hw/virtio/vhost-user.c index
> > aefe0bb..83ebcaa 100644
> > --- a/hw/virtio/vhost-user.c
> > +++ b/hw/virtio/vhost-user.c
> > @@ -253,17 +253,20 @@ static int vhost_user_call(struct vhost_dev *dev,
> unsigned long int request,
> >  case VHOST_SET_VRING_NUM:
> >  case VHOST_SET_VRING_BASE:
> >  memcpy(&msg.state, arg, sizeof(struct vhost_vring_state));
> > +msg.state.index += dev->vq_index;
> >  msg.size = sizeof(m.state);
> >  break;
> >
> >  case VHOST_GET_VRING_BASE:
> >  memcpy(&msg.state, arg, sizeof(struct vhost_vring_state));
> > +msg.state.index += dev->vq_index;
> >  msg.size = sizeof(m.state);
> >  need_reply = 1;
> >  break;
> >
> >  case VHOST_SET_VRING_ADDR:
> >  memcpy(&msg.addr, arg, sizeof(struct vhost_vring_addr));
> > +msg.addr.index += dev->vq_index;
> >  msg.size = sizeof(m.addr);
> >  break;
> >
> > @@ -271,7 +274,7 @@ static int vhost_user_call(struct vhost_dev *dev,
> unsigned long int request,
> >  case VHOST_SET_VRING_CALL:
> >  case VHOST_SET_VRING_ERR:
> >  file = arg;
> > -msg.u64 = file->index & VHOST_USER_VRING_IDX_MASK;
> > +msg.u64 = (file->index + dev->vq_index) &
> > + VHOST_USER_VRING_IDX_MASK;

I identify one vq_index issue here when it is the case of VHOST_SET_VRING_CALL,
The vq_index is not initialized before it is used here, so it could be a random 
value.
It leads to error in vhost, when this random value is passed to vhost and vhost 
use this random value to set the vring call.
 
I have a quick fix for this, code changes as the following:
diff --git a/hw/net/vhost_net.c b/hw/net/vhost_net.c
index 4e3a061..2fbdb93 100644
--- a/hw/net/vhost_net.c
+++ b/hw/net/vhost_net.c
@@ -157,6 +157,7 @@ struct vhost_net *vhost_net_init(VhostNetOptions *options)

 net->dev.nvqs = 2;
 net->dev.vqs = net->vqs;
+net->dev.vq_index = net->nc->queue_index;

 r = vhost_dev_init(&net->dev, options->opaque,
options->backend_type, options->force);
diff --git a/net/vhost-user.c b/net/vhost-user.c
index a0b4af2..b27190f 100644
--- a/net/vhost-user.c
+++ b/net/vhost-user.c
@@ -152,6 +152,7 @@ static int net_vhost_user_init(NetClientState *peer, const 
char *device,
 s->nc.receive_disabled = 1;
 s->chr = chr;
 s->vhostforce = vhostforce;
+s->nc.queue_index = i;

 qemu_chr_add_handlers(s->chr, NULL, NULL, net_vhost_user_event, s);
 }

Would you guys have a look at

Re: [Qemu-devel] [PATCH] kvm: fix slot flags sync between Qemu and KVM

2015-04-08 Thread Xiao Guangrong




On 04/08/2015 06:46 PM, Paolo Bonzini wrote:



On 08/04/2015 08:34, Xiao Guangrong wrote:

We noticed that KVM keeps tracking dirty for the memslots when
live migration failed which causes bad performance due to huge
page mapping disallowed for this kind of memslot

It is caused by slot flags does not properly sync-ed between Qemu
and KVM. Current code doing slot update depends on slot->flags
which hopes to omit unnecessary ioctl. However, slot->flags only
reflects the stauts of corresponding memory region, vmsave and
live migration do dirty tracking which overset
KVM_MEM_LOG_DIRTY_PAGES for the slot. That causes the slot status
recorded in the flags does not exactly match the stauts in kernel.

We fixed it by introducing slot->is_dirty_logging which indicates
the dirty status in kernel so that it helps us to sync the status
between userspace and kernel

Wanpeng Li 
Signed-off-by: Xiao Guangrong 


Hi Xiao,

the patch looks good.

However, I am planning to remove s->migration_log completely from QEMU
2.4 and have slot->flags also track the migration state.  This has the
side effect of fixing this bug.  I'll Cc you on the patches when I post
them (next week probably).


Good to know it, look forward to your patches. Thank you, Paolo!

Re: [Qemu-devel] [PATCH] tcg/tcg-op.c: Fix ld/st of 64 bit values on 32-bit bigendian hosts

2015-04-08 Thread Richard Henderson

On 04/08/2015 12:57 PM, Peter Maydell wrote:
> Switch the ifdef back to HOST_WORDS_BIGENDIAN.
> 
> Signed-off-by: Peter Maydell 

Doh.

Reviewed-by: Richard Henderson 


r~

Re: [Qemu-devel] [PATCH v6 0/8] QEMU memory hot unplug support

2015-04-08 Thread Zhu Guihua



On 04/08/2015 06:47 PM, Paulo Ricardo Paz Vital wrote:

On Wed, 2015-04-08 at 11:52 +0200, Michael S. Tsirkin wrote:

On Wed, Apr 08, 2015 at 05:49:42PM +0800, Zhu Guihua wrote:

Ping...

It's only been 4 days.  We are finalizing 2.3 so pls sit tight.

I agree with Michael, it's time to close 2.3.
But I have a question. Is the patch counter correct? I didn't found the
patch 1/8 in my mailbox neither in qemu-devel archive.


My partners have received patch 1/8, but it is not in qemu-devel archive 
indeed. I don't

know what happened.

I will resend the series later.

Thanks,
Zhu




On 04/02/2015 05:50 PM, Zhu Guihua wrote:

This patchset adds support to hot unplug memory.

Memory hot unplug is complicated multi-stage process. Unplug request callback
sends remove request. After guest os processes ejection request, OSPM will
execute _EJ0 to signal qemu that a device eject will be to occur. Then qemu
will call unplug callback to eject the device.

v6:
  -improve documentation of memory hot unplug
  -add trace event for device deletion
  -put fix about "Memory device control fields" register in a separate patch

v5:
  -reorganize the patchset
  -add documentation to understand patch easily
  -add MEMORY_SLOT_EJECT for initiating device eject
  -add support to send qmp event to notify mgmt about memory unplug error

v4:
  -reorganize the patchset
  -drop the new API acpi_send_gpe_event()
  -update ssdt-mem

v3:
  -commit message changes
  -reorganize the patchset, squash and separate some patches
  -update specs about acpi_mem_hotplug
  -first cleanup external state, then un-map and un-register memory device

v2:
  -do a generic for acpi to send gpe event
  -unparent object by PC_MACHINE
  -update description in acpi_mem_hotplug.txt
  -combine the last two patches in the last version
  -cleanup external state in acpi_memory_unplug_cb

Tang Chen (3):
   acpi, mem-hotplug: add acpi_memory_slot_status() to get MemStatus
   acpi, mem-hotplug: add unplug request cb for memory device
   acpi, mem-hotplug: add unplug cb for memory device

Zhu Guihua (5):
   docs: update documentation for memory hot unplug
   acpi: extend aml_field() to support UpdateRule
   acpi: fix "Memory device control fields" register
   acpi: add hardware implementation for memory hot unplug
   qmp-event: add event notification for memory hot unplug error

  docs/memory-hotplug.txt   | 23 --
  docs/qmp/qmp-events.txt   | 17 +++
  docs/specs/acpi_mem_hotplug.txt   | 58 +--
  hw/acpi/aml-build.c   |  4 +-
  hw/acpi/ich9.c| 19 ++--
  hw/acpi/memory_hotplug.c  | 96 ---
  hw/acpi/piix4.c   | 17 +--
  hw/core/qdev.c|  2 +-
  hw/i386/acpi-build.c  | 25 --
  hw/i386/acpi-dsdt-mem-hotplug.dsl | 13 +-
  hw/i386/pc.c  | 62 +++--
  include/hw/acpi/aml-build.h   | 10 +++-
  include/hw/acpi/memory_hotplug.h  | 12 +
  include/hw/acpi/pc-hotplug.h  |  3 ++
  include/hw/qdev-core.h|  1 +
  monitor.c |  1 +
  qapi/event.json   | 14 ++
  trace-events  |  4 ++
  18 files changed, 346 insertions(+), 35 deletions(-)

[Qemu-devel] [Bug 1441775] Re: possible null pointer dereference in qemuDomainPinEmulator()

2015-04-08 Thread Eric Blake

** Project changed: qemu => libvirt

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1441775

Title:
  possible null pointer dereference in qemuDomainPinEmulator()

Status in libvirt virtualization API:
  New

Bug description:
  In src/qemu/qemu_driver.c the qemuDomainPinEmulator() routine
  basically does this

   virDomainObjPtr vm;

   if (!(vm = qemuDomObjFromDomain(dom)))
   goto cleanup;

  cleanup:
   qemuDomObjEndAPI(&vm);

  
  If "vm" is null, then this will crash.

  The bug seems to have been added in commit 540c339a, which removed a null 
pointer check:
  -if (vm)
  -virObjectUnlock(vm);
  +qemuDomObjEndAPI(&vm);

To manage notifications about this bug go to:
https://bugs.launchpad.net/libvirt/+bug/1441775/+subscriptions

Re: [Qemu-devel] [PATCH 3/3] block: add 'node-name' field to BLOCK_IMAGE_CORRUPTED

2015-04-08 Thread Eric Blake

On 04/08/2015 03:29 AM, Alberto Garcia wrote:
> Since this event can occur in nodes that cannot have a device name
> associated, include also a field with the node name.
> 
> Signed-off-by: Alberto Garcia 
> ---
>  block/qcow2.c   |  8 ++--
>  docs/qmp/qmp-events.txt | 21 +
>  qapi/block-core.json| 17 +++--
>  3 files changed, 30 insertions(+), 16 deletions(-)
> 
>  
> -- "device": Device name (json-string)
> -- "msg":Informative message (e.g., reason for the corruption) 
> (json-string)
> -- "offset": If the corruption resulted from an image access, this is the 
> access
> -offset into the image (json-int)
> -- "size":   If the corruption resulted from an image access, this is the 
> access
> -size (json-int)
> +- "device":Device name (json-string)
> +- "node-name": Node name (json-string, optional)
> +- "msg":   Informative message (e.g., reason for the corruption)
> +   (json-string)
> +- "offset":If the corruption resulted from an image access, this
> +   is the access offset into the image (json-int)
> +- "size":  If the corruption resulted from an image access, this
> +   is the access size (json-int)

Not your fault (so don't worry about fixing it here), but I still find
this definition of 'offset' confusing - is it the guest's offset, or the
host's offset?  I'm going to assume the host's offset (remember, on
qcow2, the guest offset 0 is never at host offset 0, because that is
reserved for the qcow2 header - but we CAN encounter a read error while
reading the qcow2 header).

Reviewed-by: Eric Blake 

-- 
Eric Blake   eblake redhat com+1-919-301-3266
Libvirt virtualization library http://libvirt.org

Re: [Qemu-devel] [PATCH] qemu-config: Accept empty option values

2015-04-08 Thread Eric Blake

On 04/08/2015 12:16 PM, Eduardo Habkost wrote:
> Currently it is impossible to set an option in a config file to an empty
> string, because the parser matches only lines containing non-empty
> strings between double-quotes.
> 
> As sscanf() "[" conversion specifier only matches non-empty strings, add
> a special case for empty strings.

I avoid sscanf() as a rule (as it's behavior on %d is undefined in the
face of malicious input), so I had to read the man page; but you are
right.  Libvirt is trying to completely ban use of *scanf for that
reason; but obviously qemu is not quite so opposed to it.

> 
> Signed-off-by: Eduardo Habkost 
> ---
>  util/qemu-config.c | 3 ++-
>  1 file changed, 2 insertions(+), 1 deletion(-)
> 
> diff --git a/util/qemu-config.c b/util/qemu-config.c
> index 2d32ce7..9f9577d 100644
> --- a/util/qemu-config.c
> +++ b/util/qemu-config.c
> @@ -413,7 +413,8 @@ int qemu_config_parse(FILE *fp, QemuOptsList **lists, 
> const char *fname)
>  opts = qemu_opts_create(list, NULL, 0, &error_abort);
>  continue;
>  }
> -if (sscanf(line, " %63s = \"%1023[^\"]\"", arg, value) == 2) {
> +if (sscanf(line, " %63s = \"%1023[^\"]\"", arg, value) == 2 ||
> +(value[0] = '\0', sscanf(line, " %63s = \"\"", arg) == 1)) {

This is one of the few times I've seen sscanf used in a well-defined
manner (albeit still arbitrarily limiting, in that we have fixed-size
buffers) - but having to rely on a comma operator to get there makes
this look quite arcane.  I still wonder if hand-rolling a real scanner
would beat the compactness of sscanf by making the code intentions a
little more discernible, and have the benefits of avoiding my sscanf
red-flag checker.  But my wonder is not enough to stop me from accepting
this hack as-is.

Reviewed-by: Eric Blake 

-- 
Eric Blake   eblake redhat com+1-919-301-3266
Libvirt virtualization library http://libvirt.org

signature.asc
Description: OpenPGP digital signature

1 2 3 >

1 - 100 of 206 matches

Mail list logo