Hello, 

I have been running a series of kexec tests using LKDTT on the 
aacraid driver on this card (ASR-4805SAS (Marauder-E)) on x86_64
using the latest top of scsi-misc git-tree(as of yesterday), and 
I have found that it is not coming up consistantly when booted 
through kexec.

I have included 4 different types of failures I found here because 
I assume they might be related, and thought maybe there could 
be an issue with the card's state on reboot (through kexec).

The most common problem is this oops/panic, which has happened 
with various types of crash points (6 times out of 40):

Loading aacraid.Adaptec aacraid driver (1.1-5[2437]-mh4)^M
ko module^M
ACPI: PCI Interrupt 0000:03:0e.0[A] -> Link [LNKC] -> GSI 3 (level, low) -> IRQ 
3^M
general protection fault: 0000 [1] ^M
CPU 0 ^M
Modules linked in: aacraid^M
Pid: 0, comm: swapper Not tainted 2.6.21-rc3-kdump #1^M
RIP: 0010:[<ffffffff88008a99>]  [<ffffffff88008a99>] 
:aacraid:aac_intr_normal+0x17a/0x1b1^M
RSP: 0000:ffffffff81523ed8  EFLAGS: 00010006^M
RAX: ffff810004102000 RBX: ffff8100014f01e0 RCX: 0000000000000086^M
RDX: ffff810004041540 RSI: ffff8100014f01e0 RDI: cccccccccccccccc^M
RBP: ffff810004702cd8 R08: 00000000a6037e6c R09: 00000016001562d7^M
R10: 0000000000000023 R11: 0000000000000000 R12: 0000000000000011^M
R13: ffff810004702cd8 R14: ffff810004001400 R15: 0000000000000000^M
FS:  0000000000000000(0000) GS:ffffffff814d5000(0000) knlGS:0000000000000000^M
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b^M
CR2: 00000000006ba5a0 CR3: 000000000474d000 CR4: 00000000000006e0^M
Process swapper (pid: 0, threadinfo ffffffff814e4000, task ffffffff81470360)^M
Stack:  0000000000000011 ffff810004702cd8 0000000000000100 0000000000000003^M
 0000000000000001 ffffffff88009470 0000000000000000 ffff810004041540^M
 ffffffff814d5080 ffffffff810428f4 0000000000000000 ffffffff814d5080^M
Call Trace:^M
 <IRQ>  [<ffffffff88009470>] :aacraid:aac_rx_intr_message+0x2c/0x60^M
 [<ffffffff810428f4>] note_interrupt+0xd3/0x1db^M
 [<ffffffff8104319b>] handle_level_irq+0x7e/0xab^M
 [<ffffffff8100b0b1>] do_IRQ+0xd7/0x132^M
 [<ffffffff810085a1>] mwait_idle+0x0/0x43^M
 [<ffffffff81009651>] ret_from_intr+0x0/0xa^M
 <EOI>  [<ffffffff810085e0>] mwait_idle+0x3f/0x43^M
 [<ffffffff81008540>] cpu_idle+0x3d/0x5c^M
 [<ffffffff814e78d2>] start_kernel+0x28f/0x29b^M
 [<ffffffff814e7140>] _sinittext+0x140/0x144^M
^M
^M
Code: ff 53 38 eb 20 9c 58 fa 83 7b 30 00 75 07 c7 43 30 01 00 00 ^M
RIP  [<ffffffff88008a99>] :aacraid:aac_intr_normal+0x17a/0x1b1^M
Kernel panic - not syncing: Aiee, killing interrupt handler!^M
 

Another failure:   for crash point 'TIMERADD-bug' I got this error 
loading insmod:

Loading aacraid.Adaptec aacraid driver (1.1-5[2437]-mh4)^M
ko module^M
ACPI: PCI Interrupt 0000:03:0e.0[A] -> Link [LNKC] -> GSI 3 (level, low) -> IRQ 
3^M
input: ImExPS/2 Generic Explorer Mouse as /class/input/input3^M
aacraid: aac_fib_send: adapter blinkLED 0xc2.^M
Usually a result of a serious unrecoverable hardware problem^M
aac_fib_free, XferState != 0, fibptr = 0xffff8100014f0000, XferState = 0x810ad^M
aacraid: probe of 0000:03:0e.0 failed with error -14^M


Yet another Failure: for crash point 'TIMERADD-panic' I got this error 
during insmod:

Loading aacraid.Adaptec aacraid driver (1.1-5[2437]-mh4)^M
ko module^M
ACPI: PCI Interrupt 0000:03:0e.0[A] -> Link [LNKC] -> GSI 3 (level, low) -> IRQ 
3^M
input: ImExPS/2 Generic Explorer Mouse as /class/input/input3^M
Ecr^H ^H^H ^H^H ^HBUG: soft lockup detected on CPU#0!^M
^M
Call Trace:^M
 <IRQ>  [<ffffffff8102bcbb>] update_process_times+0x3b/0x5f^M
 [<ffffffff8100bebf>] main_timer_handler+0x2f/0x1ae^M
 [<ffffffff8102b504>] run_timer_softirq+0x14/0x161^M
 [<ffffffff8100c050>] timer_interrupt+0x12/0x27^M
 [<ffffffff81041f9c>] handle_IRQ_event+0x25/0x53^M
 [<ffffffff81028c1b>] __do_softirq+0x46/0x90^M
 [<ffffffff81043186>] handle_level_irq+0x69/0xab^M
 [<ffffffff8100b0b1>] do_IRQ+0xd7/0x132^M
 [<ffffffff81009651>] ret_from_intr+0x0/0xa^M
 <EOI>  [<ffffffff811229ed>] __delay+0x8/0x10^M
 [<ffffffff88007c68>] :aacraid:aac_fib_send+0x1ba/0x234^M
 [<ffffffff880048aa>] :aacraid:aac_get_adapter_info+0x76/0x536^M
 [<ffffffff88002bb3>] :aacraid:aac_probe_one+0x236/0x457^M
 [<ffffffff8112bd6d>] pci_device_probe+0x4c/0x75^M
 [<ffffffff8117d0da>] really_probe+0xc4/0x148^M
 [<ffffffff8117d30b>] __driver_attach+0x6d/0xab^M
 [<ffffffff8117d29e>] __driver_attach+0x0/0xab^M
 [<ffffffff8117d29e>] __driver_attach+0x0/0xab^M
 [<ffffffff8117c5b2>] bus_for_each_dev+0x43/0x6e^M
 [<ffffffff8117c8f4>] bus_add_driver+0x6b/0x18d^M
 [<ffffffff8112bf0b>] __pci_register_driver+0x72/0xa7^M
 [<ffffffff8801203a>] :aacraid:aac_init+0x3a/0x75^M
 [<ffffffff8103bafc>] sys_init_module+0x1195/0x12e6^M
 [<ffffffff8100913e>] system_call+0x7e/0x83^M
^M
BUG: soft lockup detected on CPU#0!^M

One last error I got for INT_TASKLET_ENTRY-exception was this
after the filesystem is mounted and I am copying the vmcore 
file to it:

Copying the dump
aacraid: Host adapter abort request (4,0,0,0)
aacraid: Host adapter abort request (4,0,0,0)
aacraid: Host adapter reset request. SCSI hang ?
[-- MARK -- Tue Mar 27 15:30:00 2007]
sd 4:0:0:0: [sdc] 143132672 512-byte hardware sectors (73284 MB)
sd 4:0:0:0: [sdc] Assuming Write Enabled
sd 4:0:0:0: [sdc] Assuming drive cache: write through
EXT3-fs error (device sdc1): ext3_new_block: Allocating block in system
zone - blocks from 1802240, length 1
EXT3-fs error (device sdc1): ext3_new_block: Allocating block in system
zone - blocks from 1802241, length 1
journal_bmap: journal block not found at offset 2184 on sdc1
Aborting journal on device sdc1.
ext3_abort called.
EXT3-fs error (device sdc1): ext3_journal_start_sb: Detected aborted journal
Remounting filesystem read-only
EXT3-fs error (device sdc1): ext3_free_blocks: Freeing blocks in system
zones - Block = 1802241, count = 1
EXT3-fs error (device sdc1) in ext3_free_blocks_sb: Journal has aborted
/bin/dd: writing to `/dump/dumpfile': Read-only file system
13190265+0 r__journal_remove_journal_head: freeing b_committed_data
ecords in
13190__journal_remove_journal_head: freeing b_frozen_data
264+0 records out
6753415168 bytes (6.8 GB) copied, 745.436 s, 9.1 MB/s
__journal_remove_journal_head: freeing b_frozen_data
__journal_remove_journal_head: freeing b_committed_data
__journal_remove_journal_head: freeing b_frozen_data
__journal_remove_journal_head: freeing b_frozen_data
__journal_remove_journal_head: freeing b_frozen_data


That file size should be 8.5G.

Thanks,
Judith
-
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to