RE: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...

2008-11-27 Thread Xu, Jiajun
On Wednesday, November 26, 2008 8:43 PM [EMAIL PROTECTED] wrote:

 Xu, Jiajun wrote:
 The call trace messages in guest:
 ###
 Kernel BUG at block/elevator.c:560
 invalid opcode:  [1] SMP
 last sysfs file: /block/hda/removable
 
 
 This suggests something happened to /dev/hda.  Could be a timeout or
 something.  Are there any messages in the log before the BUG?

There are two parts call traces:
First, when I do live migration, on the original guest there are some call 
trace:

BUG: soft lockup detected on CPU#0!

Call Trace:
 IRQ  [800b50fa] softlockup_tick+0xd5/0xe7
 [800930e2] update_process_times+0x42/0x68
 [800746e3] smp_local_timer_interrupt+0x23/0x47
 [80074da5] smp_apic_timer_interrupt+0x41/0x47
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 [800076b2] ide_outb+0x0/0x9
 [80011ca9] __do_softirq+0x53/0xd5
 [8005c2fc] call_softirq+0x1c/0x28
 [8006a53a] do_softirq+0x2c/0x85
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 EOI  [800076b2] ide_outb+0x0/0x9
 [800076ba] ide_outb+0x8/0x9
 [8000eac7] ide_do_request+0x408/0x787
 [80134f37] elv_insert+0xd6/0x1f7
 [8003f5af] ide_do_drive_cmd+0xc8/0x11e
 [8005857f] ide_outsw+0x0/0x9
 [8811f198] :ide_cd:cdrom_queue_packet_command+0x46/0xe2
 [80058587] ide_outsw+0x8/0x9
 [801c127a] ide_init_drive_cmd+0x10/0x24
 [8811f462] :ide_cd:cdrom_check_status+0x62/0x71
 [801360b5] blk_end_sync_rq+0x0/0x2e
 [8811f48d] :ide_cd:ide_cdrom_check_media_change_real+0x1c/0x37
 [8810d076] :cdrom:media_changed+0x44/0x74
 [800d9b83] check_disk_change+0x1f/0x76
 [88111399] :cdrom:cdrom_open+0x92c/0x979
 [8000c9c9] do_lookup+0x65/0x1c3
 [8000cec4] dput+0x2c/0x114
 [8000a2e0] __link_path_walk+0xdf8/0xf42
 [8002c86b] mntput_no_expire+0x19/0x89
 [8000e630] link_path_walk+0xd3/0xe5
 [80061c7e] do_nanosleep+0x47/0x70
 [800560a3] kobject_get+0x12/0x17
 [80138bbd] get_disk+0x3f/0x81
 [80058ef8] exact_lock+0xc/0x14
 [801ad98f] kobj_lookup+0x132/0x19b
 [8811fe4b] :ide_cd:idecd_open+0x9f/0xd1
 [800da1bc] do_open+0xa2/0x30f
 [80011d83] may_open+0x58/0x21e
 [800da67d] blkdev_open+0x0/0x4f
 [800da6a0] blkdev_open+0x23/0x4f
 [8001e0f4] __dentry_open+0xd9/0x1dc
 [800270dc] do_filp_open+0x2a/0x38
 [80061c7e] do_nanosleep+0x47/0x70
 [80019523] do_sys_open+0x44/0xbe
 [8005b28d] tracesys+0xd5/0xe0

BUG: soft lockup detected on CPU#0!

Call Trace:
 IRQ  [800b50fa] softlockup_tick+0xd5/0xe7
 [800930e2] update_process_times+0x42/0x68
 [800746e3] smp_local_timer_interrupt+0x23/0x47
 [80074da5] smp_apic_timer_interrupt+0x41/0x47
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 [800076b2] ide_outb+0x0/0x9
 [80010792] handle_IRQ_event+0x1b/0x58
 [800b5482] __do_IRQ+0xa4/0x105
 [8006a3bd] do_IRQ+0xe7/0xf5
 [8005b615] ret_from_intr+0x0/0xa
 [800076b2] ide_outb+0x0/0x9
 [80011ca9] __do_softirq+0x53/0xd5
 [8005c2fc] call_softirq+0x1c/0x28
 [8006a53a] do_softirq+0x2c/0x85
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 EOI  [800076b2] ide_outb+0x0/0x9
 [800076ba] ide_outb+0x8/0x9
 [8000eac7] ide_do_request+0x408/0x787
 [80134f37] elv_insert+0xd6/0x1f7
 [8003f5af] ide_do_drive_cmd+0xc8/0x11e
 [8005857f] ide_outsw+0x0/0x9
 [8811f198] :ide_cd:cdrom_queue_packet_command+0x46/0xe2
 [80058587] ide_outsw+0x8/0x9
 [801c127a] ide_init_drive_cmd+0x10/0x24
 [8811f462] :ide_cd:cdrom_check_status+0x62/0x71
 [801360b5] blk_end_sync_rq+0x0/0x2e
 [8811f48d] :ide_cd:ide_cdrom_check_media_change_real+0x1c/0x37
 [8810d076] :cdrom:media_changed+0x44/0x74
 [800d9b83] check_disk_change+0x1f/0x76
 [88111399] :cdrom:cdrom_open+0x92c/0x979
 [8000c9c9] do_lookup+0x65/0x1c3
 [8000cec4] dput+0x2c/0x114
 [8000a2e0] __link_path_walk+0xdf8/0xf42
 [8002c86b] mntput_no_expire+0x19/0x89
 [8000e630] link_path_walk+0xd3/0xe5
 [80061c7e] do_nanosleep+0x47/0x70
 [800560a3] kobject_get+0x12/0x17
 [80138bbd] get_disk+0x3f/0x81
 [80058ef8] exact_lock+0xc/0x14
 [801ad98f] kobj_lookup+0x132/0x19b
 [8811fe4b] :ide_cd:idecd_open+0x9f/0xd1
 [800da1bc] do_open+0xa2/0x30f
 [80011d83] may_open+0x58/0x21e
 [800da67d] blkdev_open+0x0/0x4f
 [800da6a0] blkdev_open+0x23/0x4f
 [8001e0f4] __dentry_open+0xd9/0x1dc
 [800270dc] do_filp_open+0x2a/0x38
 [80061c7e] do_nanosleep+0x47/0x70
 [80019523] do_sys_open+0x44/0xbe
 [8005b28d] tracesys+0xd5/0xe0

BUG: soft lockup detected on CPU#0!

Call Trace:
 IRQ  

Re: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...

2008-11-27 Thread Avi Kivity

Xu, Jiajun wrote:

On Wednesday, November 26, 2008 8:43 PM [EMAIL PROTECTED] wrote:

  

Xu, Jiajun wrote:


The call trace messages in guest:
###
Kernel BUG at block/elevator.c:560
invalid opcode:  [1] SMP
last sysfs file: /block/hda/removable

  

This suggests something happened to /dev/hda.  Could be a timeout or
something.  Are there any messages in the log before the BUG?



There are two parts call traces:
First, when I do live migration, on the original guest there are some call 
trace:

BUG: soft lockup detected on CPU#0!

Call Trace:
 IRQ  [800b50fa] softlockup_tick+0xd5/0xe7
 [800930e2] update_process_times+0x42/0x68
 [800746e3] smp_local_timer_interrupt+0x23/0x47
 [80074da5] smp_apic_timer_interrupt+0x41/0x47
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 [800076b2] ide_outb+0x0/0x9
 [80011ca9] __do_softirq+0x53/0xd5
 [8005c2fc] call_softirq+0x1c/0x28
 [8006a53a] do_softirq+0x2c/0x85
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 EOI  [800076b2] ide_outb+0x0/0x9
 [800076ba] ide_outb+0x8/0x9
 [8000eac7] ide_do_request+0x408/0x787
 [80134f37] elv_insert+0xd6/0x1f7
 [8003f5af] ide_do_drive_cmd+0xc8/0x11e
 [8005857f] ide_outsw+0x0/0x9
 [8811f198] :ide_cd:cdrom_queue_packet_command+0x46/0xe2
 [80058587] ide_outsw+0x8/0x9
 [801c127a] ide_init_drive_cmd+0x10/0x24
 [8811f462] :ide_cd:cdrom_check_status+0x62/0x71
 [801360b5] blk_end_sync_rq+0x0/0x2e
 [8811f48d] :ide_cd:ide_cdrom_check_media_change_real+0x1c/0x37
 [8810d076] :cdrom:media_changed+0x44/0x74
 [800d9b83] check_disk_change+0x1f/0x76
 [88111399] :cdrom:cdrom_open+0x92c/0x979
 [8000c9c9] do_lookup+0x65/0x1c3
 [8000cec4] dput+0x2c/0x114
 [8000a2e0] __link_path_walk+0xdf8/0xf42
 [8002c86b] mntput_no_expire+0x19/0x89
 [8000e630] link_path_walk+0xd3/0xe5
 [80061c7e] do_nanosleep+0x47/0x70
 [800560a3] kobject_get+0x12/0x17
 [80138bbd] get_disk+0x3f/0x81
 [80058ef8] exact_lock+0xc/0x14
 [801ad98f] kobj_lookup+0x132/0x19b
 [8811fe4b] :ide_cd:idecd_open+0x9f/0xd1
 [800da1bc] do_open+0xa2/0x30f
 [80011d83] may_open+0x58/0x21e
 [800da67d] blkdev_open+0x0/0x4f
 [800da6a0] blkdev_open+0x23/0x4f
 [8001e0f4] __dentry_open+0xd9/0x1dc
 [800270dc] do_filp_open+0x2a/0x38
 [80061c7e] do_nanosleep+0x47/0x70
 [80019523] do_sys_open+0x44/0xbe
 [8005b28d] tracesys+0xd5/0xe0
  


Anthony, could migration be starving qemu_mutex and thus the guest?



--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line unsubscribe kvm in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...

2008-11-26 Thread Avi Kivity

Xu, Jiajun wrote:

The call trace messages in guest:
###
Kernel BUG at block/elevator.c:560
invalid opcode:  [1] SMP 
last sysfs file: /block/hda/removable
  


This suggests something happened to /dev/hda.  Could be a timeout or 
something.  Are there any messages in the log before the BUG?



--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line unsubscribe kvm in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html


RE: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...

2008-11-24 Thread Xu, Jiajun
On Monday, November 24, 2008 12:57 AM Avi Kivity wrote:

 Xu, Jiajun wrote:
 2. failure to migrate guests with more than 4GB of RAM
 
 https://sourceforge.net/tracker/index.php?func=detailaid=19715
 12group_id=180599atid=893831
 
 
 
 Can you retest this?  I successfully migrated a 5G guest (from a 4G
 host to itself; slo...)/

I tried latest commit, userspace.git 6e63ba19476753595e508713eb9daf559dc50bf6 
with a 64-bit RHEL5.1 Guest. My host kernel is 2.6.26.2. And My host has 8GB 
memory and 4GB swap.
Guest can be live migrated, but after that, guest will call trace.

Maybe we can have a check with each other's environment.

My steps as following:
1. qemu-system-x86_64 -incoming tcp:localhost: -m 4096  -net 
nic,macaddr=00:16:3e:44:1a:a6,model=rtl8139 -net tap,script=/etc/kvm/qemu-ifup 
-hda /share/xvs/var/rhel5u1.img
2. qemu-system-x86_64  -m 4096 -net nic,macaddr=00:16:3e:44:1a:a6,model=rtl8139 
-net tap,script=/etc/kvm/qemu-ifup -hda /share/xvs/var/rhel5u1.img
3. In qemu console, type migrate tcp:localhost:

The call trace messages in guest:
###
Kernel BUG at block/elevator.c:560
invalid opcode:  [1] SMP 
last sysfs file: /block/hda/removable
CPU 0 
Modules linked in: ipv6 autofs4 hidp rfcomm l2cap bluetooth sunrpc iscsi_tcp
ib_iser libiscsi scsi_transport_iscsi rdma_ucm ib_ucm ib_srp ib_sdp rdma_cm
ib_cm iw_cm ib_addr ib_local_sa ib_ipoib ib_sa ib_uverbs ib_umad ib_mad ib_core
dm_mirror dm_multipath dm_mod video sbs backlight i2c_ec i2c_core button
battery asus_acpi acpi_memhotplug ac lp floppy pcspkr serio_raw 8139cp 8139too
parport_pc parport mii ide_cd cdrom ata_piix libata sd_mod scsi_mod ext3 jbd
ehci_hcd ohci_hcd uhci_hcd
Pid: 0, comm: swapper Not tainted 2.6.18-53.el5 #1
RIP: 0010:[80134673]  [80134673]
elv_dequeue_request+0x8/0x3c
RSP: 0018:8040ddc0  EFLAGS: 00010046
RAX: 0001 RBX: 81011381b398 RCX: 
RDX: 81011381b398 RSI: 81011381b398 RDI: 81011fb912c0
RBP: 804abe18 R08: 80304108 R09: 0012
R10: 0022 R11:  R12: 
R13: 0001 R14: 0086 R15: 8040deb8
FS:  () GS:80396000() knlGS:
CS:  0010 DS: 0018 ES: 0018 CR0: 8005003b
CR2: 2ad6f4d0 CR3: 0001126cc000 CR4: 06e0
Process swapper (pid: 0, threadinfo 803c6000, task 802dcae0)
Stack:  8000ae3c 804abe18 804abe50 
 804abd00 0246 8003ba73 8003ba0c
 804abe18 81011fbe5800 8000d2a5 81011fb8c5c0
Call Trace:
 IRQ  [8000ae3c] ide_end_request+0xc6/0xfc
 [8003ba73] ide_dma_intr+0x67/0xab
 [8003ba0c] ide_dma_intr+0x0/0xab
 [8000d2a5] ide_intr+0x16f/0x1df
 [800107a0] handle_IRQ_event+0x29/0x58
 [800b5482] __do_IRQ+0xa4/0x105
 [8006a3bd] do_IRQ+0xe7/0xf5
 [8005b615] ret_from_intr+0x0/0xa
 [80011ca9] __do_softirq+0x53/0xd5
 [8005c2fc] call_softirq+0x1c/0x28
 [8006a53a] do_softirq+0x2c/0x85
 [80068d0e] default_idle+0x0/0x50
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 EOI  [80068d37] default_idle+0x29/0x50
 [80046f8d] cpu_idle+0x95/0xb8
 [803d1806] start_kernel+0x220/0x225
 [803d1237] _sinittext+0x237/0x23e


Code: 0f 0b 68 25 50 29 80 c2 30 02 48 8b 46 08 48 89 42 08 48 89 
RIP  [80134673] elv_dequeue_request+0x8/0x3c
 RSP 8040ddc0
 0Kernel panic - not syncing: Fatal exception
 BUG: warning at kernel/panic.c:137/panic() (Not tainted)

Call Trace:
 IRQ  [8008ccca] panic+0x1e3/0x1f4
 [80196ae8] do_unblank_screen+0x1b/0x132
 [800631aa] oops_end+0x51/0x53
 [80069689] die+0x3a/0x44
 [80069c37] do_invalid_op+0xad/0xb7
 [80134673] elv_dequeue_request+0x8/0x3c
 [80092dd4] do_timer+0x2e8/0x53c
 [8006c0cc] main_timer_handler+0x23d/0x3f4
 [8005bde9] error_exit+0x0/0x84
 [80134673] elv_dequeue_request+0x8/0x3c
 [8000ae3c] ide_end_request+0xc6/0xfc
 [8003ba73] ide_dma_intr+0x67/0xab
 [8003ba0c] ide_dma_intr+0x0/0xab
 [8000d2a5] ide_intr+0x16f/0x1df
 [800107a0] handle_IRQ_event+0x29/0x58
 [800b5482] __do_IRQ+0xa4/0x105
 [8006a3bd] do_IRQ+0xe7/0xf5
 [8005b615] ret_from_intr+0x0/0xa
 [80011ca9] __do_softirq+0x53/0xd5
 [8005c2fc] call_softirq+0x1c/0x28
 [8006a53a] do_softirq+0x2c/0x85
 [80068d0e] default_idle+0x0/0x50
 [8005bc8e] apic_timer_interrupt+0x66/0x6c
 EOI  [80068d37] default_idle+0x29/0x50
 [80046f8d] cpu_idle+0x95/0xb8
 [803d1806] start_kernel+0x220/0x225
 [803d1237] _sinittext+0x237/0x23e

BUG: warning at drivers/input/serio/i8042.c:846/i8042_panic_blink() (Not
tainted)

Call Trace:
 IRQ  [801ee9b8] 

Re: Weekly KVM Test report, kernel 30d95f ... userspace fc94d1 ...

2008-11-23 Thread Avi Kivity

Xu, Jiajun wrote:

2. failure to migrate guests with more than 4GB of RAM
https://sourceforge.net/tracker/index.php?func=detailaid=1971512group_id=180599atid=893831

  


Can you retest this?  I successfully migrated a 5G guest (from a 4G host 
to itself; slo...)/


--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line unsubscribe kvm in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html