At this point I'd appreciate some input from the kernel team what this message actually means, and what the likely cause could be. It does not really seem specific to either hdparm nor libatasmart, all they do is things like https://bugs.launchpad.net/ubuntu/karmic/+source/libatasmart/+bug/445852/comments/202 (i. e. using the normal SCSI ioctls).
One possible workaround might be to not run hdparm on SSD devices. This only might fix the symptom, but it might be an appropriate SRU for lucid. ** Description changed: - Will be added later. + This is related to bug 445852, it causes the same effects, but under + different circumstances. + + During boot on an SSD system and when logging in and starting something + rather IO intense, like firefox, the system freezes for 30 seconds, and + afterwards dmesg shows an error like + + ata2: lost interrupt (Status 0x58) + ata2: drained 16384 bytes to clear DRQ. + ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen + ata2.00: BMDMA stat 0x4 + ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in + res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation) + ata2.00: status: { DRDY DRQ } + ata2: soft resetting link + ata2.00: configured for UDMA/66 + ata2: EH complete + + The main cause of bug 445828 was fixed now, but there are still some + users who get those HSM violations/30 second hangs during boot. + + Martin Pitt got ssh access to jarikvh's machine which is still affected + (he's willing to provide access to other people for debugging). + + A lot of different commands were tried to reproduce this at runtime, + like + + # for i in `seq 50`; do skdump --can-smart /dev/sda; hdparm -B254 /dev/sda; sleep 0.2; done + # udevadm trigger --action=change --sysname-match=sda # (also in a loop) + # (/lib/udev/udisks-probe-ata-smart /dev/sda &); /lib/udev/udisks-probe-ata-smart /dev/sda + + and so on, but it seems impossible to reproduce at runtime + unfortunately. I also tried those commands while a "grep -r . /usr" was + running in the background to induce I/O and disk reading activity. + + The interesting thing is that the bug goes away if you either disable + /lib/udev/rules.d/85-hdparm.rules, or udisks-probe-ata-smart in + /lib/udev/rules.d/80-udisks.rules. So this seems to happen in situations + where there is something reading a lot of files from the disk, and + hdparm or libatasmart send their ioctls to the drive. + ProblemType: Bug DistroRelease: Ubuntu 10.04 Package: linux-image-2.6.32-21-generic 2.6.32-21.32 Regression: No Reproducible: Yes ProcVersionSignature: Ubuntu 2.6.32-21.32-generic 2.6.32.11+drm33.2 Uname: Linux 2.6.32-21-generic i686 AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21. AplayDevices: - **** List of PLAYBACK Hardware Devices **** - card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog] - Subdevices: 1/1 - Subdevice #0: subdevice #0 + **** List of PLAYBACK Hardware Devices **** + card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog] + Subdevices: 1/1 + Subdevice #0: subdevice #0 Architecture: i386 ArecordDevices: - **** List of CAPTURE Hardware Devices **** - card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog] - Subdevices: 1/1 - Subdevice #0: subdevice #0 + **** List of CAPTURE Hardware Devices **** + card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog] + Subdevices: 1/1 + Subdevice #0: subdevice #0 AudioDevicesInUse: - USER PID ACCESS COMMAND - /dev/snd/controlC0: jarik 1395 F.... pulseaudio + USER PID ACCESS COMMAND + /dev/snd/controlC0: jarik 1395 F.... pulseaudio Card0.Amixer.info: - Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16' - Mixer name : 'Realtek ALC268' - Components : 'HDA:10ec0268,1025015b,00100101' - Controls : 8 - Simple ctrls : 5 + Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16' + Mixer name : 'Realtek ALC268' + Components : 'HDA:10ec0268,1025015b,00100101' + Controls : 8 + Simple ctrls : 5 Date: Mon May 3 15:21:38 2010 InstallationMedia: Ubuntu-Netbook-Remix 9.10 "Karmic Koala" - Release i386 (20091028.4) MachineType: Acer AOA110 ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-21-generic root=UUID=994b1074-44a7-4871-9553-fe61b94182cf ro quiet splash pciehp.pciehp_force=1 elevator=noop ProcEnviron: - LANG=en_US.utf8 - SHELL=/bin/bash + LANG=en_US.utf8 + SHELL=/bin/bash RelatedPackageVersions: linux-firmware 1.34 RfKill: - 0: phy0: Wireless LAN - Soft blocked: no - Hard blocked: no + 0: phy0: Wireless LAN + Soft blocked: no + Hard blocked: no SourcePackage: linux dmi.bios.date: 05/09/2008 dmi.bios.vendor: INSYDE dmi.bios.version: v0.3109 dmi.board.asset.tag: Base Board Asset Tag dmi.board.name: Base Board Product Name dmi.board.vendor: Intel Corp. dmi.board.version: Base Board Version dmi.chassis.type: 1 dmi.chassis.vendor: Chassis Manufacturer dmi.chassis.version: Chassis Version dmi.modalias: dmi:bvnINSYDE:bvrv0.3109:bd05/09/2008:svnAcer:pnAOA110:pvr1:rvnIntelCorp.:rnBaseBoardProductName:rvrBaseBoardVersion:cvnChassisManufacturer:ct1:cvrChassisVersion: dmi.product.name: AOA110 dmi.product.version: 1 dmi.sys.vendor: Acer ** Changed in: linux (Ubuntu) Status: New => Confirmed ** Changed in: linux (Ubuntu) Importance: Undecided => High ** Changed in: linux (Ubuntu) Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team) ** Description changed: This is related to bug 445852, it causes the same effects, but under different circumstances. During boot on an SSD system and when logging in and starting something rather IO intense, like firefox, the system freezes for 30 seconds, and afterwards dmesg shows an error like - ata2: lost interrupt (Status 0x58) - ata2: drained 16384 bytes to clear DRQ. - ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen - ata2.00: BMDMA stat 0x4 - ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in - res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation) - ata2.00: status: { DRDY DRQ } - ata2: soft resetting link - ata2.00: configured for UDMA/66 - ata2: EH complete + ata2: lost interrupt (Status 0x58) + ata2: drained 16384 bytes to clear DRQ. + ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen + ata2.00: BMDMA stat 0x4 + ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in + res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation) + ata2.00: status: { DRDY DRQ } + ata2: soft resetting link + ata2.00: configured for UDMA/66 + ata2: EH complete The main cause of bug 445828 was fixed now, but there are still some users who get those HSM violations/30 second hangs during boot. - Martin Pitt got ssh access to jarikvh's machine which is still affected + Martin Pitt got ssh access to Jarige's machine which is still affected (he's willing to provide access to other people for debugging). A lot of different commands were tried to reproduce this at runtime, like - # for i in `seq 50`; do skdump --can-smart /dev/sda; hdparm -B254 /dev/sda; sleep 0.2; done - # udevadm trigger --action=change --sysname-match=sda # (also in a loop) - # (/lib/udev/udisks-probe-ata-smart /dev/sda &); /lib/udev/udisks-probe-ata-smart /dev/sda + # for i in `seq 50`; do skdump --can-smart /dev/sda; hdparm -B254 /dev/sda; sleep 0.2; done + # udevadm trigger --action=change --sysname-match=sda # (also in a loop) + # (/lib/udev/udisks-probe-ata-smart /dev/sda &); /lib/udev/udisks-probe-ata-smart /dev/sda and so on, but it seems impossible to reproduce at runtime unfortunately. I also tried those commands while a "grep -r . /usr" was running in the background to induce I/O and disk reading activity. The interesting thing is that the bug goes away if you either disable /lib/udev/rules.d/85-hdparm.rules, or udisks-probe-ata-smart in /lib/udev/rules.d/80-udisks.rules. So this seems to happen in situations where there is something reading a lot of files from the disk, and hdparm or libatasmart send their ioctls to the drive. - ProblemType: Bug DistroRelease: Ubuntu 10.04 Package: linux-image-2.6.32-21-generic 2.6.32-21.32 Regression: No Reproducible: Yes ProcVersionSignature: Ubuntu 2.6.32-21.32-generic 2.6.32.11+drm33.2 Uname: Linux 2.6.32-21-generic i686 AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21. AplayDevices: **** List of PLAYBACK Hardware Devices **** card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 Architecture: i386 ArecordDevices: **** List of CAPTURE Hardware Devices **** card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog] Subdevices: 1/1 Subdevice #0: subdevice #0 AudioDevicesInUse: USER PID ACCESS COMMAND /dev/snd/controlC0: jarik 1395 F.... pulseaudio Card0.Amixer.info: Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16' Mixer name : 'Realtek ALC268' Components : 'HDA:10ec0268,1025015b,00100101' Controls : 8 Simple ctrls : 5 Date: Mon May 3 15:21:38 2010 InstallationMedia: Ubuntu-Netbook-Remix 9.10 "Karmic Koala" - Release i386 (20091028.4) MachineType: Acer AOA110 ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-21-generic root=UUID=994b1074-44a7-4871-9553-fe61b94182cf ro quiet splash pciehp.pciehp_force=1 elevator=noop ProcEnviron: LANG=en_US.utf8 SHELL=/bin/bash RelatedPackageVersions: linux-firmware 1.34 RfKill: 0: phy0: Wireless LAN Soft blocked: no Hard blocked: no SourcePackage: linux dmi.bios.date: 05/09/2008 dmi.bios.vendor: INSYDE dmi.bios.version: v0.3109 dmi.board.asset.tag: Base Board Asset Tag dmi.board.name: Base Board Product Name dmi.board.vendor: Intel Corp. dmi.board.version: Base Board Version dmi.chassis.type: 1 dmi.chassis.vendor: Chassis Manufacturer dmi.chassis.version: Chassis Version dmi.modalias: dmi:bvnINSYDE:bvrv0.3109:bd05/09/2008:svnAcer:pnAOA110:pvr1:rvnIntelCorp.:rnBaseBoardProductName:rvrBaseBoardVersion:cvnChassisManufacturer:ct1:cvrChassisVersion: dmi.product.name: AOA110 dmi.product.version: 1 dmi.sys.vendor: Acer ** Also affects: hdparm (Ubuntu) Importance: Undecided Status: New -- udisks-probe-ata-smart causes HSM violations https://bugs.launchpad.net/bugs/574462 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs