At this point I'd appreciate some input from the kernel team what this
message actually means, and what the likely cause could be. It does not
really seem specific to either hdparm nor libatasmart, all they do is
things like
https://bugs.launchpad.net/ubuntu/karmic/+source/libatasmart/+bug/445852/comments/202
(i. e. using the normal SCSI ioctls).

One possible workaround might be to not run hdparm on SSD devices. This
only might fix the symptom, but it might be an appropriate SRU for
lucid.

** Description changed:

- Will be added later.
+ This is related to bug 445852, it causes the same effects, but under
+ different circumstances.
+ 
+ During boot on an SSD system and when logging in and starting something
+ rather IO intense, like firefox, the system freezes for 30 seconds, and
+ afterwards dmesg shows an error like
+ 
+   ata2: lost interrupt (Status 0x58)
+   ata2: drained 16384 bytes to clear DRQ.
+   ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
+   ata2.00: BMDMA stat 0x4
+   ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in
+   res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation)
+   ata2.00: status: { DRDY DRQ }
+   ata2: soft resetting link
+   ata2.00: configured for UDMA/66
+   ata2: EH complete
+ 
+ The main cause of bug 445828 was fixed now, but there are still some
+ users who get those HSM violations/30 second hangs during boot.
+ 
+ Martin Pitt got ssh access to jarikvh's machine which is still affected
+ (he's willing to provide access to other people for debugging).
+ 
+ A lot of different commands were tried to reproduce this at runtime,
+ like
+ 
+   # for i in `seq 50`; do skdump --can-smart  /dev/sda; hdparm -B254 
/dev/sda; sleep 0.2; done
+   # udevadm trigger --action=change --sysname-match=sda # (also in a loop)
+   # (/lib/udev/udisks-probe-ata-smart /dev/sda &); 
/lib/udev/udisks-probe-ata-smart /dev/sda
+ 
+ and so on, but it seems impossible to reproduce at runtime
+ unfortunately. I also tried those commands while a "grep -r . /usr" was
+ running in the background to induce I/O and disk reading activity.
+ 
+ The interesting thing is that the bug goes away if you either disable
+ /lib/udev/rules.d/85-hdparm.rules, or udisks-probe-ata-smart in
+ /lib/udev/rules.d/80-udisks.rules. So this seems to happen in situations
+ where there is something reading a lot of files from the disk, and
+ hdparm or libatasmart send their ioctls to the drive.
+ 
  
  ProblemType: Bug
  DistroRelease: Ubuntu 10.04
  Package: linux-image-2.6.32-21-generic 2.6.32-21.32
  Regression: No
  Reproducible: Yes
  ProcVersionSignature: Ubuntu 2.6.32-21.32-generic 2.6.32.11+drm33.2
  Uname: Linux 2.6.32-21-generic i686
  AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
  AplayDevices:
-  **** List of PLAYBACK Hardware Devices ****
-  card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
-    Subdevices: 1/1
-    Subdevice #0: subdevice #0
+  **** List of PLAYBACK Hardware Devices ****
+  card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
+    Subdevices: 1/1
+    Subdevice #0: subdevice #0
  Architecture: i386
  ArecordDevices:
-  **** List of CAPTURE Hardware Devices ****
-  card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
-    Subdevices: 1/1
-    Subdevice #0: subdevice #0
+  **** List of CAPTURE Hardware Devices ****
+  card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
+    Subdevices: 1/1
+    Subdevice #0: subdevice #0
  AudioDevicesInUse:
-  USER        PID ACCESS COMMAND
-  /dev/snd/controlC0:  jarik      1395 F.... pulseaudio
+  USER        PID ACCESS COMMAND
+  /dev/snd/controlC0:  jarik      1395 F.... pulseaudio
  Card0.Amixer.info:
-  Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16'
-    Mixer name : 'Realtek ALC268'
-    Components : 'HDA:10ec0268,1025015b,00100101'
-    Controls      : 8
-    Simple ctrls  : 5
+  Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16'
+    Mixer name : 'Realtek ALC268'
+    Components : 'HDA:10ec0268,1025015b,00100101'
+    Controls      : 8
+    Simple ctrls  : 5
  Date: Mon May  3 15:21:38 2010
  InstallationMedia: Ubuntu-Netbook-Remix 9.10 "Karmic Koala" - Release i386 
(20091028.4)
  MachineType: Acer AOA110
  ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-21-generic 
root=UUID=994b1074-44a7-4871-9553-fe61b94182cf ro quiet splash 
pciehp.pciehp_force=1 elevator=noop
  ProcEnviron:
-  LANG=en_US.utf8
-  SHELL=/bin/bash
+  LANG=en_US.utf8
+  SHELL=/bin/bash
  RelatedPackageVersions: linux-firmware 1.34
  RfKill:
-  0: phy0: Wireless LAN
-       Soft blocked: no
-       Hard blocked: no
+  0: phy0: Wireless LAN
+   Soft blocked: no
+   Hard blocked: no
  SourcePackage: linux
  dmi.bios.date: 05/09/2008
  dmi.bios.vendor: INSYDE
  dmi.bios.version: v0.3109
  dmi.board.asset.tag: Base Board Asset Tag
  dmi.board.name: Base Board Product Name
  dmi.board.vendor: Intel Corp.
  dmi.board.version: Base Board Version
  dmi.chassis.type: 1
  dmi.chassis.vendor: Chassis Manufacturer
  dmi.chassis.version: Chassis Version
  dmi.modalias: 
dmi:bvnINSYDE:bvrv0.3109:bd05/09/2008:svnAcer:pnAOA110:pvr1:rvnIntelCorp.:rnBaseBoardProductName:rvrBaseBoardVersion:cvnChassisManufacturer:ct1:cvrChassisVersion:
  dmi.product.name: AOA110
  dmi.product.version: 1
  dmi.sys.vendor: Acer

** Changed in: linux (Ubuntu)
       Status: New => Confirmed

** Changed in: linux (Ubuntu)
   Importance: Undecided => High

** Changed in: linux (Ubuntu)
     Assignee: (unassigned) => Canonical Kernel Team (canonical-kernel-team)

** Description changed:

  This is related to bug 445852, it causes the same effects, but under
  different circumstances.
  
  During boot on an SSD system and when logging in and starting something
  rather IO intense, like firefox, the system freezes for 30 seconds, and
  afterwards dmesg shows an error like
  
-   ata2: lost interrupt (Status 0x58)
-   ata2: drained 16384 bytes to clear DRQ.
-   ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
-   ata2.00: BMDMA stat 0x4
-   ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in
-   res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation)
-   ata2.00: status: { DRDY DRQ }
-   ata2: soft resetting link
-   ata2.00: configured for UDMA/66
-   ata2: EH complete
+   ata2: lost interrupt (Status 0x58)
+   ata2: drained 16384 bytes to clear DRQ.
+   ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
+   ata2.00: BMDMA stat 0x4
+   ata2.00: cmd c8/00:40:cb:60:32/00:00:00:00:00/e0 tag 0 dma 32768 in
+   res 58/00:40:cb:60:32/00:00:00:00:00/e0 Emask 0x2 (HSM violation)
+   ata2.00: status: { DRDY DRQ }
+   ata2: soft resetting link
+   ata2.00: configured for UDMA/66
+   ata2: EH complete
  
  The main cause of bug 445828 was fixed now, but there are still some
  users who get those HSM violations/30 second hangs during boot.
  
- Martin Pitt got ssh access to jarikvh's machine which is still affected
+ Martin Pitt got ssh access to Jarige's machine which is still affected
  (he's willing to provide access to other people for debugging).
  
  A lot of different commands were tried to reproduce this at runtime,
  like
  
-   # for i in `seq 50`; do skdump --can-smart  /dev/sda; hdparm -B254 
/dev/sda; sleep 0.2; done
-   # udevadm trigger --action=change --sysname-match=sda # (also in a loop)
-   # (/lib/udev/udisks-probe-ata-smart /dev/sda &); 
/lib/udev/udisks-probe-ata-smart /dev/sda
+   # for i in `seq 50`; do skdump --can-smart  /dev/sda; hdparm -B254 
/dev/sda; sleep 0.2; done
+   # udevadm trigger --action=change --sysname-match=sda # (also in a loop)
+   # (/lib/udev/udisks-probe-ata-smart /dev/sda &); 
/lib/udev/udisks-probe-ata-smart /dev/sda
  
  and so on, but it seems impossible to reproduce at runtime
  unfortunately. I also tried those commands while a "grep -r . /usr" was
  running in the background to induce I/O and disk reading activity.
  
  The interesting thing is that the bug goes away if you either disable
  /lib/udev/rules.d/85-hdparm.rules, or udisks-probe-ata-smart in
  /lib/udev/rules.d/80-udisks.rules. So this seems to happen in situations
  where there is something reading a lot of files from the disk, and
  hdparm or libatasmart send their ioctls to the drive.
- 
  
  ProblemType: Bug
  DistroRelease: Ubuntu 10.04
  Package: linux-image-2.6.32-21-generic 2.6.32-21.32
  Regression: No
  Reproducible: Yes
  ProcVersionSignature: Ubuntu 2.6.32-21.32-generic 2.6.32.11+drm33.2
  Uname: Linux 2.6.32-21-generic i686
  AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
  AplayDevices:
   **** List of PLAYBACK Hardware Devices ****
   card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
     Subdevices: 1/1
     Subdevice #0: subdevice #0
  Architecture: i386
  ArecordDevices:
   **** List of CAPTURE Hardware Devices ****
   card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
     Subdevices: 1/1
     Subdevice #0: subdevice #0
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC0:  jarik      1395 F.... pulseaudio
  Card0.Amixer.info:
   Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16'
     Mixer name : 'Realtek ALC268'
     Components : 'HDA:10ec0268,1025015b,00100101'
     Controls      : 8
     Simple ctrls  : 5
  Date: Mon May  3 15:21:38 2010
  InstallationMedia: Ubuntu-Netbook-Remix 9.10 "Karmic Koala" - Release i386 
(20091028.4)
  MachineType: Acer AOA110
  ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.32-21-generic 
root=UUID=994b1074-44a7-4871-9553-fe61b94182cf ro quiet splash 
pciehp.pciehp_force=1 elevator=noop
  ProcEnviron:
   LANG=en_US.utf8
   SHELL=/bin/bash
  RelatedPackageVersions: linux-firmware 1.34
  RfKill:
   0: phy0: Wireless LAN
    Soft blocked: no
    Hard blocked: no
  SourcePackage: linux
  dmi.bios.date: 05/09/2008
  dmi.bios.vendor: INSYDE
  dmi.bios.version: v0.3109
  dmi.board.asset.tag: Base Board Asset Tag
  dmi.board.name: Base Board Product Name
  dmi.board.vendor: Intel Corp.
  dmi.board.version: Base Board Version
  dmi.chassis.type: 1
  dmi.chassis.vendor: Chassis Manufacturer
  dmi.chassis.version: Chassis Version
  dmi.modalias: 
dmi:bvnINSYDE:bvrv0.3109:bd05/09/2008:svnAcer:pnAOA110:pvr1:rvnIntelCorp.:rnBaseBoardProductName:rvrBaseBoardVersion:cvnChassisManufacturer:ct1:cvrChassisVersion:
  dmi.product.name: AOA110
  dmi.product.version: 1
  dmi.sys.vendor: Acer

** Also affects: hdparm (Ubuntu)
   Importance: Undecided
       Status: New

-- 
udisks-probe-ata-smart causes HSM violations
https://bugs.launchpad.net/bugs/574462
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to