Public bug reported:

Binary package hint: udev

My system was correctly booting from a mirrored PATA (via libata) RAID 1
until one of the disks was removed.  I then hit the "degraded mode
doesn't boot" problem for a while.  This all happened under Gutsy (kept
up to date daily.)

After that I replaced the missing disk, while the system was powered off, and 
then:
 * booted with 'break=mount' on the kernel command line.
 * waited until detection of hardware had completed.
 * ran 'exec mdadm -As' to detect the degraded RAID and continue to boot
 * ran 'sfdisk -d /dev/sda | sfdisk /dev/sdb' to partition the new disk
 * ran 'mdadm -a /dev/md0 /dev/sdb1'
 * ran 'mdadm -a /dev/md1 /dev/sdb2'
 * waited for the system to resync the data
 * checked that /proc/mdstat showed all healthy
 * rebooted

At this point I expected, naturally, to have the system boot cleanly
without any problems or delay.  Instead the system simply ground to a
halt after the three minute boot timeout without the RAID detected.

After some investigation this looks, to me, like a problem with
identification of the use of the device components.

The udev rules for mdadm depend on ENV{ID_FS_TYPE}=="linux_raid*" to run
mdadm at all; for my RAID components I get the following details:

UDEV  [1187654052.620855] add      /block/sda/sda1 (block)
UDEV_LOG=3
ACTION=add
DEVPATH=/block/sda/sda1
SUBSYSTEM=block
SEQNUM=1750
MINOR=1
MAJOR=8
PHYSDEVPATH=/devices/pci0000:00/0000:00:1f.1/host0/target0:0:0/0:0:0:0
PHYSDEVBUS=scsi
PHYSDEVDRIVER=sd
UDEVD_EVENT=1
DEVTYPE=partition
ID_VENDOR=ATA
ID_MODEL=SAMSUNG_MP0804H
ID_REVISION=UE10
ID_SERIAL=1ATA_SAMSUNG_MP0804H_S042J10Y257961
ID_SERIAL_SHORT=ATA_SAMSUNG_MP0804H_S042J10Y257961
ID_TYPE=disk
ID_BUS=scsi
ID_ATA_COMPAT=SAMSUNG_MP0804H_S042J10Y257961
ID_PATH=pci-0000:00:1f.1-scsi-0:0:0:0
ID_FS_USAGE=filesystem
ID_FS_TYPE=ext3
ID_FS_VERSION=1.0
ID_FS_UUID=f93b5509-6e68-4f2f-9d2f-fcff7a2dfb19
ID_FS_UUID_ENC=f93b5509-6e68-4f2f-9d2f-fcff7a2dfb19
ID_FS_LABEL=enki-root
ID_FS_LABEL_ENC=enki-root
ID_FS_LABEL_SAFE=enki-root
DEVNAME=/dev/sda1
DEVLINKS=/dev/disk/by-id/scsi-1ATA_SAMSUNG_MP0804H_S042J10Y257961-part1 
/dev/disk/by-id/ata-SAMSUNG_MP0804H_S042J10Y257961-part1 
/dev/disk/by-path/pci-0000:00:1f.1-scsi-0:0:0:0-part1 
/dev/disk/by-uuid/f93b5509-6e68-4f2f-9d2f-fcff7a2dfb19 
/dev/disk/by-label/enki-root

Note that the 'ID_FS_TYPE' value is ext3, the file system in the RAID
array, rather than identifying this disk as part of a RAID array.

The same misidentification is present for the swap RAID1 and the other
component; I can supply logs showing that if it matters.

The RAID array itself is a healthy RAID1 with version 1.0 metadata:

[EMAIL PROTECTED]:~$ cat /proc/mdstat
Personalities : [linear] [multipath] [raid0] [raid1] [raid6] [raid5] [raid4] 
[raid10]
md1 : active raid1 sda2[0] sdb2[1]
      1975984 blocks super 1.0 [2/2] [UU]

md0 : active raid1 sdb1[0] sda1[2]
      76204224 blocks super 1.0 [2/2] [UU]

unused devices: <none>

[EMAIL PROTECTED]:~$ sudo mdadm -D /dev/md0
[sudo] password for daniel:
/dev/md0:
        Version : 01.00.03
  Creation Time : Wed May 16 01:08:07 2007
     Raid Level : raid1
     Array Size : 76204224 (72.67 GiB 78.03 GB)
  Used Dev Size : 152408448 (72.67 GiB 78.03 GB)
   Raid Devices : 2
  Total Devices : 2
Preferred Minor : 0
    Persistence : Superblock is persistent

    Update Time : Tue Aug 21 10:21:04 2007
          State : active
 Active Devices : 2
Working Devices : 2
 Failed Devices : 0
  Spare Devices : 0

           Name : enki-root
           UUID : 3a6b05ca:c218089c:f9df5f76:731157e8
         Events : 802619

    Number   Major   Minor   RaidDevice State
       0       8       17        0      active sync   /dev/sdb1
       2       8        1        1      active sync   /dev/sda1

However, one very odd factor in this:

[EMAIL PROTECTED]:~$ sudo mdadm -E /dev/sda1
/dev/sda1:
          Magic : a92b4efc
        Version : 01
    Feature Map : 0x0
     Array UUID : 3a6b05ca:c218089c:f9df5f76:731157e8
           Name : enki-root
  Creation Time : Wed May 16 01:08:07 2007
     Raid Level : raid1
   Raid Devices : 2

  Used Dev Size : 152408448 (72.67 GiB 78.03 GB)
     Array Size : 152408448 (72.67 GiB 78.03 GB)
   Super Offset : 152408576 sectors
          State : active
    Device UUID : fc1de0a0:88071249:b753809d:3cd1beee

    Update Time : Tue Aug 21 10:21:24 2007
       Checksum : 655ae10b - correct
         Events : 802619


    Array Slot : 2 (0, failed, 1)
   Array State : uU 1 failed
[EMAIL PROTECTED]:~$ sudo mdadm -E /dev/sdb1
/dev/sdb1:
          Magic : a92b4efc
        Version : 01
    Feature Map : 0x0
     Array UUID : 3a6b05ca:c218089c:f9df5f76:731157e8
           Name : enki-root
  Creation Time : Wed May 16 01:08:07 2007
     Raid Level : raid1
   Raid Devices : 2

  Used Dev Size : 152408448 (72.67 GiB 78.03 GB)
     Array Size : 152408448 (72.67 GiB 78.03 GB)
   Super Offset : 152408576 sectors
          State : active
    Device UUID : 71575927:ef34548e:5693969d:1b8f6631

    Update Time : Tue Aug 21 10:21:26 2007
       Checksum : 73d44564 - correct
         Events : 802619


    Array Slot : 0 (0, failed, 1)
   Array State : Uu 1 failed


It looks like the MD device has a happy RAID1 header but the individual 
components have metadata that indicates that /both/ of them are part of a 
failed RAID array.

In any case the misidentification of the component use means that mdadm
is never called on my system.

Please let me know if I can assist in debugging this further or in
providing any specific testing.  I have no concerns building whatever
tools with debugging, etc, as needed.

** Affects: udev (Ubuntu)
     Importance: Undecided
         Status: New

-- 
[gutsy] partitions no longer detected as RAID components after repairing 
degraded RAID 1 mirror
https://bugs.launchpad.net/bugs/133773
You received this bug notification because you are a member of Ubuntu
Bugs, which is the bug contact for Ubuntu.

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to