apport information

** Attachment added: "UdevDb.txt"
   https://bugs.launchpad.net/bugs/1882312/+attachment/5380832/+files/UdevDb.txt

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1882312

Title:
  RAID5 reshape stuck due to same badblock on multiple devices

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Linux [hostname removed] 5.3.0-55-generic #49-Ubuntu SMP Thu May 21
  12:47:19 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux

  Ubuntu release: 19.10 (although the same issue is present in 18.04 and
  20.04 as well).

  A RAID5 reshape from 3 to 4 devices got stuck:

  md127 : active raid5 sde1[5] sdd1[4] sdc1[0] sdf1[3]
        7813769216 blocks super 1.2 level 5, 512k chunk, algorithm 2 [4/4] 
[UUUU]
        [>....................]  reshape =  1.8% (72261116/3906884608) 
finish=1663133.7min speed=38K/sec
        bitmap: 0/30 pages [0KB], 65536KB chunk

  with the following stack trace:

  [54979.996871] INFO: task md127_reshape:7090 blocked for more than 1208 
seconds.
  [54979.996922]       Tainted: P           OE     5.3.0-55-generic #49-Ubuntu
  [54979.996967] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables 
this message.
  [54979.997018] md127_reshape   D    0  7090      2 0x80004080
  [54979.997019] Call Trace:
  [54979.997022]  __schedule+0x2b9/0x6c0
  [54979.997023]  schedule+0x42/0xb0
  [54979.997027]  reshape_request+0x878/0x950 [raid456]
  [54979.997028]  ? wait_woken+0x80/0x80
  [54979.997030]  raid5_sync_request+0x302/0x3b0 [raid456]
  [54979.997032]  md_do_sync.cold+0x3ef/0x999
  [54979.997034]  ? ecryptfs_write_begin+0x70/0x280
  [54979.997034]  ? __switch_to_asm+0x40/0x70
  [54979.997035]  ? __switch_to_asm+0x34/0x70
  [54979.997035]  ? __switch_to_asm+0x40/0x70
  [54979.997036]  ? __switch_to_asm+0x34/0x70
  [54979.997036]  ? __switch_to_asm+0x40/0x70
  [54979.997037]  ? __switch_to_asm+0x34/0x70
  [54979.997038]  md_thread+0x97/0x160
  [54979.997040]  kthread+0x104/0x140
  [54979.997040]  ? md_start_sync+0x60/0x60
  [54979.997041]  ? kthread_park+0x80/0x80
  [54979.997042]  ret_from_fork+0x35/0x40

  No other hardware errors were reported and the reshape got stuck at
  somewhat different blocks every time it was restarted (all within the
  same vicinity of each others). It turns out that md had injected the
  same exact sector into the badblock log of multiple devices at some
  point before the reshape was started. This could be seen with "mdadm
  --examine-badblocks /dev/sdXY". The original cause for the badblocks
  entries was probably a loose cable as the reported sectors were fully
  readable with the "dd" and "badblocks" command.

  The problem was eventually resolved by removing the badblock log on
  the RAID5 device using "mdadm --assemble /dev/md0 --update=force-no-
  bbl". Having removed the badblock log, reshape progressed beyond the
  previously troublesome area of blocks.

  I would have expected at least an error message in the kernel log rather than 
just a "hung task" message, probably before the reshape was allowed to be 
initiated (aka early termination). Furthermore, it would be beneficial if mdadm 
could allow the badblock log to be cleared for a device rather than removed on 
the array with "update=force-no-bbl".
  --- 
  ProblemType: Bug
  AlsaVersion: Advanced Linux Sound Architecture Driver Version 
k5.3.0-55-generic.
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay'
  ApportVersion: 2.20.11-0ubuntu8.9
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 
'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/by-path', 
'/dev/snd/controlC0', '/dev/snd/hwC0D2', '/dev/snd/hwC0D0', 
'/dev/snd/pcmC0D9p', '/dev/snd/pcmC0D8p', '/dev/snd/pcmC0D7p', 
'/dev/snd/pcmC0D3p', '/dev/snd/pcmC0D2c', '/dev/snd/pcmC0D1p', 
'/dev/snd/pcmC0D0c', '/dev/snd/pcmC0D0p', '/dev/snd/seq', '/dev/snd/timer'] 
failed with exit code 1:
  Card0.Amixer.info: Error: [Errno 2] No such file or directory: 'amixer': 
'amixer'
  Card0.Amixer.values: Error: [Errno 2] No such file or directory: 'amixer': 
'amixer'
  DistroRelease: Ubuntu 19.10
  HibernationDevice: RESUME=/dev/mapper/vg0-swap
  IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig'
  Lsusb:
   Bus 004 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 003 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
   Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
   Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
  MachineType: System manufacturer System Product Name
  NonfreeKernelModules: zfs zunicode zavl icp zcommon znvpair
  Package: linux (not installed)
  ProcEnviron:
   LC_CTYPE=en_US.UTF-8
   TERM=xterm
   PATH=(custom, no user)
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 i915drmfb
  ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-5.3.0-55-generic 
root=/dev/mapper/vg0-root ro swapaccount=1 acpi_enforce_resources=lax 
intel_iommu=on pci=assign-busses
  ProcVersionSignature: Ubuntu 5.3.0-55.49-generic 5.3.18
  RelatedPackageVersions:
   linux-restricted-modules-5.3.0-55-generic N/A
   linux-backports-modules-5.3.0-55-generic  N/A
   linux-firmware                            1.183.5
  RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill'
  Tags:  eoan
  Uname: Linux 5.3.0-55-generic x86_64
  UnreportableReason: This report is about a package that is not installed.
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups:
   
  _MarkForUpload: False
  dmi.bios.date: 03/15/2018
  dmi.bios.vendor: American Megatrends Inc.
  dmi.bios.version: 1302
  dmi.board.asset.tag: Default string
  dmi.board.name: PRIME Z270-A
  dmi.board.vendor: ASUSTeK COMPUTER INC.
  dmi.board.version: Rev 1.xx
  dmi.chassis.asset.tag: Default string
  dmi.chassis.type: 3
  dmi.chassis.vendor: Default string
  dmi.chassis.version: Default string
  dmi.modalias: 
dmi:bvnAmericanMegatrendsInc.:bvr1302:bd03/15/2018:svnSystemmanufacturer:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnPRIMEZ270-A:rvrRev1.xx:cvnDefaultstring:ct3:cvrDefaultstring:
  dmi.product.family: To be filled by O.E.M.
  dmi.product.name: System Product Name
  dmi.product.sku: SKU
  dmi.product.version: System Version
  dmi.sys.vendor: System manufacturer

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1882312/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to