Bug#410817: linux-image-2.6.18-3-686: new megaraid driver fails to detect logical drive

2007-02-13 Thread Mark Weaver
Package: linux-image-2.6.18-3-686
Version: 2.6.18-7
Severity: critical
Justification: breaks the whole system

This is the relevant portion of dmesg from the sarge kernel:

Feb 11 06:52:03 clap kernel: Linux version 2.6.8-3-686-smp ([EMAIL PROTECTED]) (
gcc version 3.3.5 (Debian 1:3.3.5-13)) #1 SMP Tue Dec 5 23:17:50 UTC 2006
Feb 11 06:52:03 clap kernel: SCSI subsystem initialized
Feb 11 06:52:03 clap kernel: ACPI: PCI interrupt :01:02.0[A] - GSI 17 (leve
l, low) - IRQ 193
Feb 11 06:52:03 clap kernel: megaraid: found 0x101e:0x1960:bus 1:slot 2:func 0
Feb 11 06:52:03 clap kernel: scsi0:Found MegaRAID controller at 0xf8829000, IRQ:
193
Feb 11 06:52:03 clap kernel: megaraid: [6.62:1.03] detected 1 logical drives.
Feb 11 06:52:03 clap kernel: megaraid: supports extended CDBs.
Feb 11 06:52:03 clap kernel: megaraid: channel[0] is raid.
Feb 11 06:52:03 clap kernel: megaraid: channel[1] is raid.
Feb 11 06:52:03 clap kernel: megaraid: channel[2] is raid.
Feb 11 06:52:03 clap kernel: megaraid: channel[3] is raid.
Feb 11 06:52:03 clap kernel: scsi0 : LSI Logic MegaRAID 6.62 254 commands 16 tar
gs 7 chans 7 luns
Feb 11 06:52:03 clap kernel: scsi0: scanning scsi channel 0 for logical drives.
Feb 11 06:52:03 clap kernel:   Vendor: MegaRAID  Model: LD 0 RAID1  228G  Rev: 6
.62
Feb 11 06:52:03 clap kernel:   Type:   Direct-Access  ANSI S
CSI revision: 02
Feb 11 06:52:03 clap kernel: SCSI device sda: 468697088 512-byte hdwr sectors (2
39973 MB)
Feb 11 06:52:03 clap kernel: sda: asking for cache data failed
Feb 11 06:52:03 clap kernel: sda: assuming drive cache: write through
Feb 11 06:52:03 clap kernel:  /dev/scsi/host0/bus0/target0/lun0: p1
Feb 11 06:52:03 clap kernel: Attached scsi disk sda at scsi0, channel 0, id 0, l
un 0
Feb 11 06:52:03 clap kernel: scsi0: scanning scsi channel 1 for logical drives.
Feb 11 06:52:03 clap kernel: scsi0: scanning scsi channel 2 for logical drives.
Feb 11 06:52:03 clap kernel: scsi0: scanning scsi channel 4 [P0] for physical de
vices.
Feb 11 06:52:03 clap kernel: scsi0: scanning scsi channel 5 [P1] for physical de
vices.
Feb 11 06:52:03 clap kernel: scsi0: scanning scsi channel 6 [P2] for physical de
vices.
Feb 11 06:52:03 clap kernel: scsi0: scanning scsi channel 7 [P3] for physical de
vices.

And this is from the etch kernel:
Feb 11 00:35:24 clap kernel: megaraid: 2.20.4.9 (Release Date: Sun Jul 16 12:27:
22 EST 2006)
Feb 11 00:35:24 clap kernel: megaraid: probe new device 0x101e:0x1960:0x1028:0x0
511: bus 1:slot 2:func 0
Feb 11 00:35:24 clap kernel: ACPI: PCI Interrupt :01:02.0[A] - GSI 17 (leve
l, low) - IRQ 177
Feb 11 00:35:24 clap kernel: megaraid: fw version:[6.62] bios version:[1.03]
Feb 11 00:35:24 clap kernel: scsi0 : LSI Logic MegaRAID driver
Feb 11 00:35:24 clap kernel: scsi[0]: scanning scsi channel 0 [Phy 0] for non-ra
id devices
Feb 11 00:35:24 clap kernel: scsi[0]: scanning scsi channel 1 [Phy 1] for non-ra
id devices
Feb 11 00:35:24 clap kernel: e1000: :00:02.0: e1000_probe: (PCI:33MHz:32-bit
) 00:c0:9f:29:68:31
Feb 11 00:35:24 clap kernel: scsi[0]: scanning scsi channel 2 [Phy 2] for non-ra
id devices
Feb 11 00:35:24 clap kernel: e1000: eth0: e1000_probe: Intel(R) PRO/1000 Network
 Connection
Feb 11 00:35:24 clap kernel: ACPI: PCI Interrupt :00:04.0[A] - GSI 25 (leve
l, low) - IRQ 193
Feb 11 00:35:24 clap kernel: scsi[0]: scanning scsi channel 3 [Phy 3] for non-ra
id devices
Feb 11 00:35:24 clap kernel: scsi[0]: scanning scsi channel 4 [virtual] for logi
cal drives

I have stuck an  IDE boot drive in the machine (it is in a hosting center
and I don't have any other similarly broken machines), so I can now reboot 
it remotely for experimental purposes if required, and am happy to collect
any information required.

-- System Information:
Debian Release: 4.0
  APT prefers testing
  APT policy: (500, 'testing')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.6.8-3-686-smp
Locale: LANG=en_GB, LC_CTYPE=en_GB (charmap=ISO-8859-1)

Versions of packages linux-image-2.6.18-3-686 depends on:
ii  coreutils 5.97-5 The GNU core utilities
ii  debconf [debconf-2.0] 1.5.11 Debian configuration management sy
ii  initramfs-tools [linux-initra 0.85e  tools for generating an initramfs
ii  module-init-tools 3.3-pre3-1 tools for managing Linux kernel mo

Versions of packages linux-image-2.6.18-3-686 recommends:
pn  libc6-i686none (no description available)

-- debconf information:
  shared/kernel-image/really-run-bootloader: true
  linux-image-2.6.18-3-686/postinst/bootloader-error-2.6.18-3-686:
  linux-image-2.6.18-3-686/postinst/old-dir-initrd-link-2.6.18-3-686: true
  linux-image-2.6.18-3-686/postinst/kimage-is-a-directory:
  linux-image-2.6.18-3-686/preinst/elilo-initrd-2.6.18-3-686: true
  linux-image-2.6.18-3-686/postinst/old-system-map-link-2.6.18-3-686: true
  linux-image-2.6.18-3-686/preinst/lilo-initrd-2.6.18-3-686: true
  

Bug#410817: linux-image-2.6.18-3-686: new megaraid driver fails to detect logical drive

2007-02-13 Thread Slávek Banko
Try to use module megaraid_mbox, instead of module megaraid.

Slavek


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#410817: linux-image-2.6.18-3-686: new megaraid driver fails to detect logical drive

2007-02-13 Thread Mark Weaver

Slávek Banko wrote:

Try to use module megaraid_mbox, instead of module megaraid.


Ok, I reboot with 2.6.18-3 and:

[EMAIL PROTECTED]:~# rmmod megaraid
[EMAIL PROTECTED]:~# rmmod megaraid_mbox
[EMAIL PROTECTED]:~# rmmod megaraid_mm
[EMAIL PROTECTED]:~# modprobe megaraid_mbox

[EMAIL PROTECTED]:~# lsmod |grep megaraid
megaraid_mbox  29168  0
megaraid_mm10560  1 megaraid_mbox
scsi_mod  124168  2 megaraid_mbox,sd_mod

dmesg says:

megaraid: detaching device 0x101e:0x1960:0x1028:0x0511
megaraid cmm: Unregistered one adapter:0x110
ACPI: PCI interrupt for device :01:02.0 disabled
megaraid cmm: 2.20.2.7 (Release Date: Sun Jul 16 00:01:03 EST 2006)
megaraid: 2.20.4.9 (Release Date: Sun Jul 16 12:27:22 EST 2006)
megaraid: probe new device 0x101e:0x1960:0x1028:0x0511: bus 1:slot 2:func 0
PCI: Enabling device :01:02.0 (0110 - 0112)
ACPI: PCI Interrupt :01:02.0[A] - GSI 17 (level, low) - IRQ 193
megaraid: fw version:[6.62] bios version:[1.03]
scsi1 : LSI Logic MegaRAID driver
scsi[1]: scanning scsi channel 0 [Phy 0] for non-raid devices
scsi[1]: scanning scsi channel 1 [Phy 1] for non-raid devices
scsi[1]: scanning scsi channel 2 [Phy 2] for non-raid devices
scsi[1]: scanning scsi channel 3 [Phy 3] for non-raid devices
scsi[1]: scanning scsi channel 4 [virtual] for logical drives

i.e. drive still missing.

Then:

rmmod megaraid_mbox
rmmod megaraid_mm
[EMAIL PROTECTED]:/usr/src/linux-2.6.18.2/drivers/scsi# lsmod|grep megaraid
megaraid   35880  0
scsi_mod  124168  2 megaraid,sd_mod
megaraid: detaching device 0x101e:0x1960:0x1028:0x0511
megaraid cmm: Unregistered one adapter:0x110
ACPI: PCI interrupt for device :01:02.0 disabled

i.e. the megaraid driver doesn't appear to find  the hardware.  The 
messages in dmesg appear to come from megaraid_mbox (have verified this 
by looking in the source).


Indeed this makes sense, my card has PCI ID 101e:1960, and the legacy 
megaraid driver lists:


alias:  pci:v101Ed9010sv*sd*bc*sc*i*
alias:  pci:v101Ed9060sv*sd*bc*sc*i*
alias:  pci:v8086d1960sv*sd*bc*sc*i*

but megaraid_mbox lists:

alias:  pci:v1028d000Esv1028sd0123bc*sc*i*
alias:  pci:v1000d1960sv1028sd0520bc*sc*i*
alias:  pci:v1000d1960sv1028sd0518bc*sc*i*
alias:  pci:v1000d0407sv*sd*bc*sc*i*
alias:  pci:v1028d000Fsv1028sd014Abc*sc*i*
alias:  pci:v1028d0013sv1028sd016Cbc*sc*i*
alias:  pci:v1028d0013sv1028sd016Dbc*sc*i*
alias:  pci:v1028d0013sv1028sd016Ebc*sc*i*
alias:  pci:v1028d0013sv1028sd016Fbc*sc*i*
alias:  pci:v1028d0013sv1028sd0170bc*sc*i*
alias:  pci:v1000d0408sv*sd*bc*sc*i*
alias:  pci:v101Ed1960sv*sd*bc*sc*i*
alias:  pci:v1000d1960sv*sd*bc*sc*i*
alias:  pci:v1000d0409sv*sd*bc*sc*i*

so my card will not be picked up by the legacy driver.

The (legacy) megaraid driver includes the 101e:1960 pci id, but this has 
been removed in the etch kernel.  I will try hacking the pci id into the 
legacy megaraid driver.




--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#410817: linux-image-2.6.18-3-686: new megaraid driver fails to detect logical drive

2007-02-13 Thread Mark Weaver

Mark Weaver wrote:

The (legacy) megaraid driver includes the 101e:1960 pci id, but this has 
been removed in the etch kernel.  I will try hacking the pci id into the 
legacy megaraid driver.



I can confirm that this works.  I am not sure whether this is the 
correct general solution or whether it's just a bug in megaraid_mbox 
that needs to be fixed.  What's the best way to proceed?



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]