Michael S. Tsirkin <m...@redhat.com> 于2023年1月30日周一 23:07写道: > > On Mon, Jan 30, 2023 at 10:47:25PM +0800, Feng Sun wrote: > > Igor Mammedov <imamm...@redhat.com> 于2023年1月24日周二 18:30写道: > > > > > > On Tue, 17 Jan 2023 19:15:21 +0800 > > > Sun Feng <loyo...@gmail.com> wrote: > > > > > > > Migrate from aarch64 host with PAGE_SIZE 64k to 4k failed with > > > > following errors: > > > > > > > > qmp_cmd_name: migrate-incoming, arguments: {"uri": "tcp:[::]:49152"} > > > > {"timestamp": {"seconds": 1673922775, "microseconds": 534702}, "event": > > > > "MIGRATION", "data": {"status": "setup"}} > > > > {"timestamp": {"seconds": 1673922776, "microseconds": 53003}, "event": > > > > "MIGRATION", "data": {"status": "active"}} > > > > 2023-01-17T02:32:56.058827Z qemu-system-aarch64: Length too large: > > > > /rom@etc/acpi/rsdp: 0x10000 > 0x1000: Invalid argument > > > > > > this should mention/explain why it's happening. > > > > > > i.e we now have 4k limit for RSDP, but then source somehow managed to > > > start with 64k > > > allocated to for RSDP. It looks like limit isn't working as expected to > > > me. > > > > 4k limit should be romsize limit. I can see Rom '/rom@etc/acpi/rsdp' > > with romsize:4096, datasize:36. > > RAMBlock's used_length is set with datasize aligned to PAGE_SIZE, so > > it become 64k when PAGE_SIZE is 64k. > > ``` > > static > > RAMBlock *qemu_ram_alloc_internal(ram_addr_t size, ram_addr_t max_size, > > void (*resized)(const char*, > > uint64_t length, > > void *host), > > void *host, uint32_t ram_flags, > > MemoryRegion *mr, Error **errp) > > { > > RAMBlock *new_block; > > Error *local_err = NULL; > > > > assert((ram_flags & ~(RAM_SHARED | RAM_RESIZEABLE | RAM_PREALLOC | > > RAM_NORESERVE)) == 0); > > assert(!host ^ (ram_flags & RAM_PREALLOC)); > > > > size = HOST_PAGE_ALIGN(size); > > max_size = HOST_PAGE_ALIGN(max_size); > > new_block = g_malloc0(sizeof(*new_block)); > > new_block->mr = mr; > > new_block->resized = resized; > > new_block->used_length = size; > > ``` > > So when migrate to 4k PAGE_SIZE, it will report the errors. > > > > ramblock information for PAGE_SIZE 64k and 4k. > > ``` > > # getconf PAGE_SIZE > > 65536 > > # virsh qemu-monitor-command testvm --hmp 'info ramblock' > > Block Name PSize Offset > > Used Total > > mach-virt.ram 64 KiB 0x0000000000000000 > > 0x0000000040000000 0x0000000040000000 > > virt.flash0 64 KiB 0x0000000040000000 > > 0x0000000004000000 0x0000000004000000 > > virt.flash1 64 KiB 0x0000000044000000 > > 0x0000000004000000 0x0000000004000000 > > /rom@etc/acpi/tables 64 KiB 0x0000000048040000 > > 0x0000000000020000 0x0000000000200000 > > 0000:00:01.2:00.0/virtio-net-pci.rom 64 KiB 0x0000000048000000 > > 0x0000000000040000 0x0000000000040000 > > /rom@etc/table-loader 64 KiB 0x0000000048240000 > > 0x0000000000010000 0x0000000000010000 > > /rom@etc/acpi/rsdp 64 KiB 0x0000000048280000 > > 0x0000000000010000 0x0000000000010000 > > > > # getconf PAGE_SIZE > > 4096 > > # virsh qemu-monitor-command testvm --hmp 'info ramblock' > > Block Name PSize Offset > > Used Total > > mach-virt.ram 4 KiB 0x0000000000000000 > > 0x0000000800000000 0x0000000800000000 > > virt.flash0 4 KiB 0x0000000800000000 > > 0x0000000004000000 0x0000000004000000 > > virt.flash1 4 KiB 0x0000000804000000 > > 0x0000000004000000 0x0000000004000000 > > /rom@etc/acpi/tables 4 KiB 0x0000000808000000 > > 0x0000000000020000 0x0000000000200000 > > /rom@etc/table-loader 4 KiB 0x0000000808200000 > > 0x0000000000001000 0x0000000000010000 > > /rom@etc/acpi/rsdp 4 KiB 0x0000000808240000 > > 0x0000000000001000 0x0000000000001000 > > ``` > > Oh interesting. I don't remember why I decided to align in. > What does the following do (warning: completely untested): > > > diff --git a/softmmu/physmem.c b/softmmu/physmem.c > index cb998cdf23..5c732101b9 100644 > --- a/softmmu/physmem.c > +++ b/softmmu/physmem.c > @@ -2154,7 +2154,7 @@ RAMBlock *qemu_ram_alloc_internal(ram_addr_t size, > ram_addr_t max_size, > RAM_NORESERVE)) == 0); > assert(!host ^ (ram_flags & RAM_PREALLOC)); > > - size = HOST_PAGE_ALIGN(size); > + // size = HOST_PAGE_ALIGN(size); > max_size = HOST_PAGE_ALIGN(max_size); > new_block = g_malloc0(sizeof(*new_block)); > new_block->mr = mr; >
With additional change we can see actually used size with 'info ramblock', --- a/softmmu/physmem.c +++ b/softmmu/physmem.c @@ -1837,7 +1837,7 @@ int qemu_ram_resize(RAMBlock *block, ram_addr_t newsize, Error **errp) assert(block); - newsize = HOST_PAGE_ALIGN(newsize); + //newsize = HOST_PAGE_ALIGN(newsize); if (block->used_length == newsize) { /* # virsh qemu-monitor-command testvm --hmp 'info ramblock' Block Name PSize Offset Used Total mach-virt.ram 64 KiB 0x0000000000000000 0x0000000800000000 0x0000000800000000 virt.flash0 64 KiB 0x0000000800000000 0x0000000004000000 0x0000000004000000 virt.flash1 64 KiB 0x0000000804000000 0x0000000004000000 0x0000000004000000 /rom@etc/acpi/tables 64 KiB 0x0000000808000000 0x0000000000020000 0x0000000000200000 /rom@etc/table-loader 64 KiB 0x0000000808200000 0x0000000000000b00 0x0000000000010000 /rom@etc/acpi/rsdp 64 KiB 0x0000000808240000 0x0000000000000024 0x0000000000010000 but migration needs more changes. I fixed the following error during migration: qemu-system-aarch64: ../softmmu/physmem.c:1059: cpu_physical_memory_test_and_clear_dirty: Assertion `start >= ramblock->offset && start + length <= ramblock->offset + ramblock->used_length' failed. 2023-01-31 04:09:40.934+0000: shutting down, reason=crashed --- a/softmmu/physmem.c +++ b/softmmu/physmem.c @@ -1055,7 +1055,7 @@ bool cpu_physical_memory_test_and_clear_dirty(ram_addr_t start, ramblock = qemu_get_ram_block(start); /* Range sanity check on the ramblock */ assert(start >= ramblock->offset && - start + length <= ramblock->offset + ramblock->used_length); + start + length <= ramblock->offset + ramblock->max_length); while (page < end) { unsigned long idx = page / DIRTY_MEMORY_BLOCK_SIZE; but more issues still exist, source: 2023-01-31T05:23:28.051615Z qemu-system-aarch64: failed to save SaveStateEntry with id(name): 3(ram): -5 2023-01-31T05:23:28.053256Z qemu-system-aarch64: Unable to write to socket: Bad file descriptor target: 2023-01-31T05:23:28.049659Z qemu-system-aarch64: Received an unexpected compressed page 2023-01-31T05:23:28.049709Z qemu-system-aarch64: error while loading state for instance 0x0 of device 'ram' 2023-01-31T05:23:28.050095Z qemu-system-aarch64: load of migration failed: Invalid argument In my opinion, it would be a tricky way to set 64k and would not have migration compatibility problems. Of course, the best and appropriate way is to migrate with actual data size. I am not quite familiar with migration codes, if needed, I can help to do more migration patch tests.