Re: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...
Xu, Jiajun wrote: On Wednesday, November 26, 2008 8:43 PM [EMAIL PROTECTED] wrote: Xu, Jiajun wrote: The call trace messages in guest: ### Kernel BUG at block/elevator.c:560 invalid opcode: [1] SMP last sysfs file: /block/hda/removable This suggests something happened to /dev/hda. Could be a timeout or something. Are there any messages in the log before the BUG? There are two parts call traces: First, when I do live migration, on the original guest there are some call trace: BUG: soft lockup detected on CPU#0! Call Trace: [] softlockup_tick+0xd5/0xe7 [] update_process_times+0x42/0x68 [] smp_local_timer_interrupt+0x23/0x47 [] smp_apic_timer_interrupt+0x41/0x47 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] __do_softirq+0x53/0xd5 [] call_softirq+0x1c/0x28 [] do_softirq+0x2c/0x85 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] ide_outb+0x8/0x9 [] ide_do_request+0x408/0x787 [] elv_insert+0xd6/0x1f7 [] ide_do_drive_cmd+0xc8/0x11e [] ide_outsw+0x0/0x9 [] :ide_cd:cdrom_queue_packet_command+0x46/0xe2 [] ide_outsw+0x8/0x9 [] ide_init_drive_cmd+0x10/0x24 [] :ide_cd:cdrom_check_status+0x62/0x71 [] blk_end_sync_rq+0x0/0x2e [] :ide_cd:ide_cdrom_check_media_change_real+0x1c/0x37 [] :cdrom:media_changed+0x44/0x74 [] check_disk_change+0x1f/0x76 [] :cdrom:cdrom_open+0x92c/0x979 [] do_lookup+0x65/0x1c3 [] dput+0x2c/0x114 [] __link_path_walk+0xdf8/0xf42 [] mntput_no_expire+0x19/0x89 [] link_path_walk+0xd3/0xe5 [] do_nanosleep+0x47/0x70 [] kobject_get+0x12/0x17 [] get_disk+0x3f/0x81 [] exact_lock+0xc/0x14 [] kobj_lookup+0x132/0x19b [] :ide_cd:idecd_open+0x9f/0xd1 [] do_open+0xa2/0x30f [] may_open+0x58/0x21e [] blkdev_open+0x0/0x4f [] blkdev_open+0x23/0x4f [] __dentry_open+0xd9/0x1dc [] do_filp_open+0x2a/0x38 [] do_nanosleep+0x47/0x70 [] do_sys_open+0x44/0xbe [] tracesys+0xd5/0xe0 Anthony, could migration be starving qemu_mutex and thus the guest? -- error compiling committee.c: too many arguments to function -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
RE: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...
On Wednesday, November 26, 2008 8:43 PM [EMAIL PROTECTED] wrote: > Xu, Jiajun wrote: >> The call trace messages in guest: >> ### >> Kernel BUG at block/elevator.c:560 >> invalid opcode: [1] SMP >> last sysfs file: /block/hda/removable >> > > This suggests something happened to /dev/hda. Could be a timeout or > something. Are there any messages in the log before the BUG? There are two parts call traces: First, when I do live migration, on the original guest there are some call trace: BUG: soft lockup detected on CPU#0! Call Trace: [] softlockup_tick+0xd5/0xe7 [] update_process_times+0x42/0x68 [] smp_local_timer_interrupt+0x23/0x47 [] smp_apic_timer_interrupt+0x41/0x47 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] __do_softirq+0x53/0xd5 [] call_softirq+0x1c/0x28 [] do_softirq+0x2c/0x85 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] ide_outb+0x8/0x9 [] ide_do_request+0x408/0x787 [] elv_insert+0xd6/0x1f7 [] ide_do_drive_cmd+0xc8/0x11e [] ide_outsw+0x0/0x9 [] :ide_cd:cdrom_queue_packet_command+0x46/0xe2 [] ide_outsw+0x8/0x9 [] ide_init_drive_cmd+0x10/0x24 [] :ide_cd:cdrom_check_status+0x62/0x71 [] blk_end_sync_rq+0x0/0x2e [] :ide_cd:ide_cdrom_check_media_change_real+0x1c/0x37 [] :cdrom:media_changed+0x44/0x74 [] check_disk_change+0x1f/0x76 [] :cdrom:cdrom_open+0x92c/0x979 [] do_lookup+0x65/0x1c3 [] dput+0x2c/0x114 [] __link_path_walk+0xdf8/0xf42 [] mntput_no_expire+0x19/0x89 [] link_path_walk+0xd3/0xe5 [] do_nanosleep+0x47/0x70 [] kobject_get+0x12/0x17 [] get_disk+0x3f/0x81 [] exact_lock+0xc/0x14 [] kobj_lookup+0x132/0x19b [] :ide_cd:idecd_open+0x9f/0xd1 [] do_open+0xa2/0x30f [] may_open+0x58/0x21e [] blkdev_open+0x0/0x4f [] blkdev_open+0x23/0x4f [] __dentry_open+0xd9/0x1dc [] do_filp_open+0x2a/0x38 [] do_nanosleep+0x47/0x70 [] do_sys_open+0x44/0xbe [] tracesys+0xd5/0xe0 BUG: soft lockup detected on CPU#0! Call Trace: [] softlockup_tick+0xd5/0xe7 [] update_process_times+0x42/0x68 [] smp_local_timer_interrupt+0x23/0x47 [] smp_apic_timer_interrupt+0x41/0x47 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] handle_IRQ_event+0x1b/0x58 [] __do_IRQ+0xa4/0x105 [] do_IRQ+0xe7/0xf5 [] ret_from_intr+0x0/0xa [] ide_outb+0x0/0x9 [] __do_softirq+0x53/0xd5 [] call_softirq+0x1c/0x28 [] do_softirq+0x2c/0x85 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] ide_outb+0x8/0x9 [] ide_do_request+0x408/0x787 [] elv_insert+0xd6/0x1f7 [] ide_do_drive_cmd+0xc8/0x11e [] ide_outsw+0x0/0x9 [] :ide_cd:cdrom_queue_packet_command+0x46/0xe2 [] ide_outsw+0x8/0x9 [] ide_init_drive_cmd+0x10/0x24 [] :ide_cd:cdrom_check_status+0x62/0x71 [] blk_end_sync_rq+0x0/0x2e [] :ide_cd:ide_cdrom_check_media_change_real+0x1c/0x37 [] :cdrom:media_changed+0x44/0x74 [] check_disk_change+0x1f/0x76 [] :cdrom:cdrom_open+0x92c/0x979 [] do_lookup+0x65/0x1c3 [] dput+0x2c/0x114 [] __link_path_walk+0xdf8/0xf42 [] mntput_no_expire+0x19/0x89 [] link_path_walk+0xd3/0xe5 [] do_nanosleep+0x47/0x70 [] kobject_get+0x12/0x17 [] get_disk+0x3f/0x81 [] exact_lock+0xc/0x14 [] kobj_lookup+0x132/0x19b [] :ide_cd:idecd_open+0x9f/0xd1 [] do_open+0xa2/0x30f [] may_open+0x58/0x21e [] blkdev_open+0x0/0x4f [] blkdev_open+0x23/0x4f [] __dentry_open+0xd9/0x1dc [] do_filp_open+0x2a/0x38 [] do_nanosleep+0x47/0x70 [] do_sys_open+0x44/0xbe [] tracesys+0xd5/0xe0 BUG: soft lockup detected on CPU#0! Call Trace: [] softlockup_tick+0xd5/0xe7 [] update_process_times+0x42/0x68 [] smp_local_timer_interrupt+0x23/0x47 [] smp_apic_timer_interrupt+0x41/0x47 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] handle_IRQ_event+0x1b/0x58 [] __do_IRQ+0xa4/0x105 [] do_IRQ+0xe7/0xf5 [] ret_from_intr+0x0/0xa [] ide_outb+0x0/0x9 [] __do_softirq+0x53/0xd5 [] call_softirq+0x1c/0x28 [] do_softirq+0x2c/0x85 [] apic_timer_interrupt+0x66/0x6c [] ide_outb+0x0/0x9 [] ide_outb+0x8/0x9 [] ide_do_request+0x408/0x787 [] elv_insert+0xd6/0x1f7 [] ide_do_drive_cmd+0xc8/0x11e [] ide_outsw+0x0/0x9 [] :ide_cd:cdrom_queue_packet_command+0x46/0xe2 [] ide_outsw+0x8/0x9 [] ide_init_drive_cmd+0x10/0x24 [] :ide_cd:cdrom_check_status+0x62/0x71 [] blk_end_sync_rq+0x0/0x2e [] :ide_cd:ide_cdrom_check_media_change_real+0x1c/0x37 [] :cdrom:media_changed+0x44/0x74 [] check_disk_change+0x1f/0x76 [] :cdrom:cdrom_open+0x92c/0x979 [] do_lookup+0x65/0x1c3 [] dput+0x2c/0x114 [] __link_path_walk+0xdf8/0xf42 [] mntput_no_expire+0x19/0x89 [] link_path_walk+0xd3/0xe5 [] do_nanosleep+0x47/0x70 [] kobject_get+0x12/0x17 [] get_disk+0x3f/0x81 [] exact_lock+0xc/0x14 [] kobj_lookup+0x132/0x19b [] :ide_cd:idecd_open+0x9f/0xd1 [] do_open+0xa2/0x30f [] may_open+0x58/0x21e [] blkdev_open+0x0/0x4f [] blkdev_open+0x23/0x4f [] __dentry_open+0xd9/0x1dc [] do_filp_open+0x2a/0x38 [] do_nanosleep+0x47/0x70 [] do_sys_open+0x44/0xbe [] tracesys+0xd5/0xe0 # Then, the destination guest shows call trace same as I sent in
Re: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...
Xu, Jiajun wrote: The call trace messages in guest: ### Kernel BUG at block/elevator.c:560 invalid opcode: [1] SMP last sysfs file: /block/hda/removable This suggests something happened to /dev/hda. Could be a timeout or something. Are there any messages in the log before the BUG? -- error compiling committee.c: too many arguments to function -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
RE: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...
On Monday, November 24, 2008 12:57 AM Avi Kivity wrote: > Xu, Jiajun wrote: >> 2. failure to migrate guests with more than 4GB of RAM >> > https://sourceforge.net/tracker/index.php?func=detail&aid=19715 > 12&group_id=180599&atid=893831 >> >> > > Can you retest this? I successfully migrated a 5G guest (from a 4G > host to itself; slo...)/ I tried latest commit, userspace.git 6e63ba19476753595e508713eb9daf559dc50bf6 with a 64-bit RHEL5.1 Guest. My host kernel is 2.6.26.2. And My host has 8GB memory and 4GB swap. Guest can be live migrated, but after that, guest will call trace. Maybe we can have a check with each other's environment. My steps as following: 1. qemu-system-x86_64 -incoming tcp:localhost: -m 4096 -net nic,macaddr=00:16:3e:44:1a:a6,model=rtl8139 -net tap,script=/etc/kvm/qemu-ifup -hda /share/xvs/var/rhel5u1.img 2. qemu-system-x86_64 -m 4096 -net nic,macaddr=00:16:3e:44:1a:a6,model=rtl8139 -net tap,script=/etc/kvm/qemu-ifup -hda /share/xvs/var/rhel5u1.img 3. In qemu console, type "migrate tcp:localhost:" The call trace messages in guest: ### Kernel BUG at block/elevator.c:560 invalid opcode: [1] SMP last sysfs file: /block/hda/removable CPU 0 Modules linked in: ipv6 autofs4 hidp rfcomm l2cap bluetooth sunrpc iscsi_tcp ib_iser libiscsi scsi_transport_iscsi rdma_ucm ib_ucm ib_srp ib_sdp rdma_cm ib_cm iw_cm ib_addr ib_local_sa ib_ipoib ib_sa ib_uverbs ib_umad ib_mad ib_core dm_mirror dm_multipath dm_mod video sbs backlight i2c_ec i2c_core button battery asus_acpi acpi_memhotplug ac lp floppy pcspkr serio_raw 8139cp 8139too parport_pc parport mii ide_cd cdrom ata_piix libata sd_mod scsi_mod ext3 jbd ehci_hcd ohci_hcd uhci_hcd Pid: 0, comm: swapper Not tainted 2.6.18-53.el5 #1 RIP: 0010:[] [] elv_dequeue_request+0x8/0x3c RSP: 0018:8040ddc0 EFLAGS: 00010046 RAX: 0001 RBX: 81011381b398 RCX: RDX: 81011381b398 RSI: 81011381b398 RDI: 81011fb912c0 RBP: 804abe18 R08: 80304108 R09: 0012 R10: 0022 R11: R12: R13: 0001 R14: 0086 R15: 8040deb8 FS: () GS:80396000() knlGS: CS: 0010 DS: 0018 ES: 0018 CR0: 8005003b CR2: 2ad6f4d0 CR3: 0001126cc000 CR4: 06e0 Process swapper (pid: 0, threadinfo 803c6000, task 802dcae0) Stack: 8000ae3c 804abe18 804abe50 804abd00 0246 8003ba73 8003ba0c 804abe18 81011fbe5800 8000d2a5 81011fb8c5c0 Call Trace: [] ide_end_request+0xc6/0xfc [] ide_dma_intr+0x67/0xab [] ide_dma_intr+0x0/0xab [] ide_intr+0x16f/0x1df [] handle_IRQ_event+0x29/0x58 [] __do_IRQ+0xa4/0x105 [] do_IRQ+0xe7/0xf5 [] ret_from_intr+0x0/0xa [] __do_softirq+0x53/0xd5 [] call_softirq+0x1c/0x28 [] do_softirq+0x2c/0x85 [] default_idle+0x0/0x50 [] apic_timer_interrupt+0x66/0x6c [] default_idle+0x29/0x50 [] cpu_idle+0x95/0xb8 [] start_kernel+0x220/0x225 [] _sinittext+0x237/0x23e Code: 0f 0b 68 25 50 29 80 c2 30 02 48 8b 46 08 48 89 42 08 48 89 RIP [] elv_dequeue_request+0x8/0x3c RSP <0>Kernel panic - not syncing: Fatal exception BUG: warning at kernel/panic.c:137/panic() (Not tainted) Call Trace: [] panic+0x1e3/0x1f4 [] do_unblank_screen+0x1b/0x132 [] oops_end+0x51/0x53 [] die+0x3a/0x44 [] do_invalid_op+0xad/0xb7 [] elv_dequeue_request+0x8/0x3c [] do_timer+0x2e8/0x53c [] main_timer_handler+0x23d/0x3f4 [] error_exit+0x0/0x84 [] elv_dequeue_request+0x8/0x3c [] ide_end_request+0xc6/0xfc [] ide_dma_intr+0x67/0xab [] ide_dma_intr+0x0/0xab [] ide_intr+0x16f/0x1df [] handle_IRQ_event+0x29/0x58 [] __do_IRQ+0xa4/0x105 [] do_IRQ+0xe7/0xf5 [] ret_from_intr+0x0/0xa [] __do_softirq+0x53/0xd5 [] call_softirq+0x1c/0x28 [] do_softirq+0x2c/0x85 [] default_idle+0x0/0x50 [] apic_timer_interrupt+0x66/0x6c [] default_idle+0x29/0x50 [] cpu_idle+0x95/0xb8 [] start_kernel+0x220/0x225 [] _sinittext+0x237/0x23e BUG: warning at drivers/input/serio/i8042.c:846/i8042_panic_blink() (Not tainted) Call Trace: [] i8042_panic_blink+0x112/0x2a5 [] panic+0x189/0x1f4 [] do_unblank_screen+0x1b/0x132 [] oops_end+0x51/0x53 [] die+0x3a/0x44 [] do_invalid_op+0xad/0xb7 [] elv_dequeue_request+0x8/0x3c [] do_timer+0x2e8/0x53c [] main_timer_handler+0x23d/0x3f4 [] error_exit+0x0/0x84 [] elv_dequeue_request+0x8/0x3c [] ide_end_request+0xc6/0xfc [] ide_dma_intr+0x67/0xab [] ide_dma_intr+0x0/0xab [] ide_intr+0x16f/0x1df [] handle_IRQ_event+0x29/0x58 [] __do_IRQ+0xa4/0x105 [] do_IRQ+0xe7/0xf5 [] ret_from_intr+0x0/0xa [] __do_softirq+0x53/0xd5 [] call_softirq+0x1c/0x28 [] do_softirq+0x2c/0x85 [] default_idle+0x0/0x50 [] apic_timer_interrupt+0x66/0x6c [] default_idle+0x29/0x50 [] cpu_idle+0x95/0xb8 [] start_kernel+0x220/0x225 [] _sinittext+0x237/0x23e BUG: warning at drivers/input
Re: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...
Xu, Jiajun wrote: 2. failure to migrate guests with more than 4GB of RAM https://sourceforge.net/tracker/index.php?func=detail&aid=1971512&group_id=180599&atid=893831 Can you retest this? I successfully migrated a 5G guest (from a 4G host to itself; slo...)/ -- error compiling committee.c: too many arguments to function -- To unsubscribe from this list: send the line "unsubscribe kvm" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html