Public bug reported:

# lsb_release -rd
Description:    Ubuntu 18.04 LTS
Release:        18.04

udisks2: 2.7.6-3
libblockdev-mdraid2: 2.16-2

Ubuntu 18.04, freshly upgraded from 16.04 via 17.10; system ran OK for about 
72h before issue.
Filesystem on faulty raid is XFS, and raid is raid 10 far-copies:

md3 : active raid10 sdb4[2] sda4[4]
      524155904 blocks super 1.2 512K chunks 2 far-copies [2/2] [UU]

What happened:

A "git commit" could not succeed. Process was impossible to kill. Load
reported by xload reached outstanding levels; system was still
responsive however, but impossible to halt, waiting for disk sync that
never occurred. I had to reboot the hard way.

After reboot, /home could not mount, and boot process dropped me to a
root console. I could see that mdadm reported ongoing reconstruction,
but frozen at some point. In the logs:

May  6 01:35:35 localhost kernel: [58614.835510] INFO: task xfsaild/md3:1056 
blocked for more than 120 seconds.
May  6 01:35:35 localhost kernel: [58614.835521]       Tainted: G           OE  
  4.15.0-20-lowlatency #21-Ubuntu
May  6 01:35:35 localhost kernel: [58614.835524] "echo 0 > 
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
May  6 01:35:35 localhost kernel: [58614.835529] xfsaild/md3     D    0  1056   
   2 0x80000000
May  6 01:35:35 localhost kernel: [58614.835535] Call Trace:
May  6 01:35:35 localhost kernel: [58614.835552]  __schedule+0x299/0x8c0
May  6 01:35:35 localhost kernel: [58614.835560]  schedule+0x33/0x90
May  6 01:35:35 localhost kernel: [58614.835570]  wait_barrier+0x134/0x190 
[raid10]
May  6 01:35:35 localhost kernel: [58614.835578]  ? wait_woken+0x80/0x80
May  6 01:35:35 localhost kernel: [58614.835585]  
raid10_write_request+0x77/0x950 [raid10]
...

Another line of interest:

May  6 02:05:59 localhost udisksd[1480]: failed to load module mdraid:
libbd_mdraid.so.2: cannot open shared object file: No such file or
directory

Actually, package libblockdev-mdraid2 is missing. Added it as well as
libblockdev-mpath2. Rebooted. mdadm starts reconstruction and is OK when
I come back from sleep.

System looks stable since then.

** Affects: udisks2 (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Desktop
Packages, which is subscribed to udisks2 in Ubuntu.
https://bugs.launchpad.net/bugs/1769553

Title:
  xfs blocks on unresponsive raid - blockdev module is missing

Status in udisks2 package in Ubuntu:
  New

Bug description:
  # lsb_release -rd
  Description:    Ubuntu 18.04 LTS
  Release:        18.04

  udisks2: 2.7.6-3
  libblockdev-mdraid2: 2.16-2

  Ubuntu 18.04, freshly upgraded from 16.04 via 17.10; system ran OK for about 
72h before issue.
  Filesystem on faulty raid is XFS, and raid is raid 10 far-copies:

  md3 : active raid10 sdb4[2] sda4[4]
        524155904 blocks super 1.2 512K chunks 2 far-copies [2/2] [UU]

  What happened:

  A "git commit" could not succeed. Process was impossible to kill. Load
  reported by xload reached outstanding levels; system was still
  responsive however, but impossible to halt, waiting for disk sync that
  never occurred. I had to reboot the hard way.

  After reboot, /home could not mount, and boot process dropped me to a
  root console. I could see that mdadm reported ongoing reconstruction,
  but frozen at some point. In the logs:

  May  6 01:35:35 localhost kernel: [58614.835510] INFO: task xfsaild/md3:1056 
blocked for more than 120 seconds.
  May  6 01:35:35 localhost kernel: [58614.835521]       Tainted: G           
OE    4.15.0-20-lowlatency #21-Ubuntu
  May  6 01:35:35 localhost kernel: [58614.835524] "echo 0 > 
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
  May  6 01:35:35 localhost kernel: [58614.835529] xfsaild/md3     D    0  1056 
     2 0x80000000
  May  6 01:35:35 localhost kernel: [58614.835535] Call Trace:
  May  6 01:35:35 localhost kernel: [58614.835552]  __schedule+0x299/0x8c0
  May  6 01:35:35 localhost kernel: [58614.835560]  schedule+0x33/0x90
  May  6 01:35:35 localhost kernel: [58614.835570]  wait_barrier+0x134/0x190 
[raid10]
  May  6 01:35:35 localhost kernel: [58614.835578]  ? wait_woken+0x80/0x80
  May  6 01:35:35 localhost kernel: [58614.835585]  
raid10_write_request+0x77/0x950 [raid10]
  ...

  Another line of interest:

  May  6 02:05:59 localhost udisksd[1480]: failed to load module mdraid:
  libbd_mdraid.so.2: cannot open shared object file: No such file or
  directory

  Actually, package libblockdev-mdraid2 is missing. Added it as well as
  libblockdev-mpath2. Rebooted. mdadm starts reconstruction and is OK
  when I come back from sleep.

  System looks stable since then.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/udisks2/+bug/1769553/+subscriptions

-- 
Mailing list: https://launchpad.net/~desktop-packages
Post to     : desktop-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~desktop-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to