All,

I saw TB's patch posts to the kernel-raid mailing list. I believe I experienced the same systemd-mdadm error which resulted in ~440 libraries being left as 0-byte files and pacman losing all reference to what files where owned by what packages. In my case dmesg showed the system never made any attempt to activate sda7 as part of the root array (md1) on boot. Following a --fail, --remove and --add to re-add sda7 worked as advertised. However, on next reboot, grub was found and launched the kernel, only to have the system crash due to /usr/lib/libkmod.so "file too short" (0-byte).

  Attempting to re-install the kernel to attempt to address the issue resulted 
in:

[2015-08-31 15:58] [ALPM-SCRIPTLET] ==> WARNING: No modules were added to the image. This is probably not what you want. [2015-08-31 15:58] [ALPM-SCRIPTLET] ==> Creating gzip-compressed initcpio image: /boot/initramfs-linux.img [2015-08-31 15:58] [ALPM-SCRIPTLET] ==> WARNING: errors were encountered during the build. The image may not be complete. [2015-08-31 15:58] [ALPM-SCRIPTLET] ==> Building image from preset: /etc/mkinitcpio.d/linux.preset: 'fallback' [2015-08-31 15:58] [ALPM-SCRIPTLET] -> -k /boot/vmlinuz-linux -c /etc/mkinitcpio.conf -g /boot/initramfs-linux-fallback.img -S autodetect
[2015-08-31 15:58] [ALPM-SCRIPTLET] ==> Starting build: 4.1.6-1-ARCH
[2015-08-31 15:58] [ALPM-SCRIPTLET]   -> Running build hook: [base]
[2015-08-31 15:58] [ALPM-SCRIPTLET]   -> Running build hook: [udev]
[2015-08-31 15:58] [ALPM-SCRIPTLET]   -> Running build hook: [modconf]
[2015-08-31 15:58] [ALPM-SCRIPTLET]   -> Running build hook: [block]
[2015-08-31 15:58] [ALPM-SCRIPTLET] ==> ERROR: module not found: `eata'
[2015-08-31 15:58] [ALPM-SCRIPTLET] ==> ERROR: module not found: `a100u2w'
[2015-08-31 15:58] [ALPM-SCRIPTLET] ==> ERROR: module not found: `3w_sas'
[2015-08-31 15:58] [ALPM-SCRIPTLET] ==> ERROR: module not found: `bnx2fc'
<snip> lots of modules not found

Booting the install media, chrooting and investigating revealed some 440 libraries in /usr/lib that were 0-byte. pacman had also lost its package-file database. E.g.

  # pacman -Ql unixodbc
  # (showed no files)

all of this occurred with mdadm 3.3.2-2, which is still the current package.

  TB's original post at the kernel raid list regarding the patch is:

[PATCH 0/2] mdadm: Fix udev rule interaction with systemd,
http://www.spinics.net/lists/raid/msg49678.html

  The explanation of the losses on my system posted to the list are:

Re-add of raid1 drive resulted in strange loss of data on Archlinux?
http://www.spinics.net/lists/raid/msg49704.html

Are these the symptoms (or related symptoms) that the TB's patch is intended to address?

This has only occurred once since the current mdadm was installed back on 2015-05-26. What is the recommended work-around until a fix is in place? (validate all arrays with /proc/mdstat) But then how to handle a degraded boot? I did --fail, --remove --add, and ended up with a mess. Is there some other suggested way to handle this?


--
David C. Rankin, J.D.,P.E.

Reply via email to