** Description changed: mdmonitor reacts on md events, it pools on /proc/mdstat file. Those events are generated if a change on any mddevice is observed in kernel. This is done asynchronously and can be caused by user space process (mdadm called by udev or user), or by kernel itself (drive is removed - because it have to much errors). + because it has to many errors). The problem here is that mdmonitor isn't dealing with user space or udev. When drive with metadata is inserted, mdadm adds it to mddevice (it is done by udev). Md Event is generated then and mdmonitor may try to move drive to other mddevice if needed. It relies on by-path links, but this link to newly appeared device is not created yet, udev is still working on. As a result recovery doesn't start immediately. Observed on Ubuntu 20.04. Steps to reproduce: 1. Create RAID volume: # mdadm --create /dev/md/imsm0 --metadata=imsm --raid-devices=4 /dev/nvme6n1 /dev/nvme1n1 /dev/nvme7n1 /dev/nvme3n1 --run - # mdadm --create /dev/md/r10d4s64-20_A --level=10 --chunk 64 --raid-devices=4 /dev/nvme6n1 /dev/nvme1n1 /dev/nvme7n1 /dev/nvme3n1 --run + # mdadm --create /dev/md/r10d4s64-20_A --level=10 --chunk 64 --raid-devices=4 /dev/nvme6n1 /dev/nvme1n1 /dev/nvme7n1 /dev/nvme3n1 --run 2. Add spare to container: # mdadm --add /dev/md/imsm0 /dev/nvme0n1 3. Create appropriate policy line in /etc/mdadm/mdadm.conf. POLICY domain=RAID_DOMAIN_1 path=* action=spare-same-slot - 4. Disconnect spare from container. 5. Start mdadm monitor with big delay (ex. 10 minutes): # mdadm --monitor --delay 6000 --scan --mail=root@localhost --daemonize --syslog 6. Hot remove disk from array (physical disconnect). 7. Connect previously prepared spare. - + Expected results: Rebuild should start. Actual results: Rebuild does not start, added spare is in separate container.
-- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1888812 Title: mdmonitor doesn't start recovery immediately To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/mdadm/+bug/1888812/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs