Re: [Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread Steve Langasek
On Thu, Oct 06, 2011 at 01:18:27AM -, Serge Hallyn wrote:

 see comment #42 for one disk layout that reproduces it for me in a kvm
 VM.

This describes a physical disk layout... it doesn't describe a partition or
LV layout, which is what is key to reproducing the LVM-related hang.  Is
this VM using LVM?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread Steve Langasek
On Thu, Oct 06, 2011 at 01:20:03AM -, Serge Hallyn wrote:
 also see bug 833891 as a udev bug specifically for the LVM case.

Do you mean you're already tracking the LVM case on that bug instead, and
that in the setup you're using there is *no* use of LVM?  I guess no one
told Adam that this is a different bug, because it was his LVM-based setup
that I was responding to... :)

 In fact, Eduard toward the end speculated precisely the semaphore as a
 cause as you just did.

That was Eduard's speculation, copied from the *other* bug I linked to.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread James Hunt
@Adam/@Serge: could you try the following to see if it solves / reduces
the occurrence of the problem. Also, can you report back if you see any
interesting processes as logged by the change below:

1) Change initramfs to bind mount /dev and dump devices and running
processes before udevd exits:

$ sudo sed -i.bak -e '/^udevadm control --exit/i echo XXX: ps=`ps -ef`, 
\/dev=`ls -l \/dev`  /dev/kmsg' -e 's!^mount -n -o move /dev!mount -n -o bind 
/dev!g' /usr/share/initramfs-tools/scripts/init-bottom/udev
$ sudo mkinitramfs -o /boot/initramfs.foo

2) Reboot setting initrd /boot/initramfs.foo in grub.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread Serge Hallyn
@Steve,

(re comment #52) I'm not sure what you were asking for then, but as I
said in that comment, partition 1 is a simple ext3 filesystem.
Partition 2 is just an extended.  Partition 5 (the only one on the
extended) is swap.  There is no LVM.  With standard (non-instrumented)
udev it hangs roughly every 5 boots.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread Serge Hallyn
Quoting Steve Langasek (steve.langa...@canonical.com):
 On Thu, Oct 06, 2011 at 01:20:03AM -, Serge Hallyn wrote:
  also see bug 833891 as a udev bug specifically for the LVM case.
 
 Do you mean you're already tracking the LVM case on that bug instead, and

I was, yes.  I filed that one some time ago to track the hang on my own
laptop.  I realize there are too many open bugs on this, and we should
mark some as dupes.

 that in the setup you're using there is *no* use of LVM?  I guess no
one

Yes.

 told Adam that this is a different bug, because it was his LVM-based setup
 that I was responding to... :)

Right.

  In fact, Eduard toward the end speculated precisely the semaphore as a
  cause as you just did.
 
 That was Eduard's speculation, copied from the *other* bug I linked to.

Right.

NOTE though:

The LVM authors have admonished us for calling vgscan/vgchange in a udev
rule, as that has its own deadlock potential.  My debdiff in bug 833891
is meant to address that.  It sounds like that deadlock is separate from
the speculation above.  Although perhaps not - such a deadlock could
explain why a semaphore in udevd is not getting Vd (since udevd never
finishes its processing of that rule, but just gets killed on timeout)?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread Serge Hallyn
Quoting James Hunt (818...@bugs.launchpad.net):
 @Adam/@Serge: could you try the following to see if it solves / reduces
 the occurrence of the problem.

It certainly didn't solve it, hung on first try :)  I'm not sure what you
would deem interesting, but here is the output from the script as obtained
from dmesg.

...

[0.651771] udevd[79]: starting version 173
[0.812351] FDC 0 is a S82078B
[0.815764] EXT3-fs: barriers not enabled
[0.818981] kjournald starting.  Commit interval 5 seconds
[0.819002] EXT3-fs (vda1): mounted filesystem with ordered data mode
[0.839767] XXX: ps=  PID USER   VSZ STAT COMMAND
[0.839768] 1 0 4456 S/bin/sh /init ro quiet
[0.839769] 2 00 SW   [kthreadd]
[0.839770] 3 00 SW   [ksoftirqd/0]
[0.839771] 4 00 SW   [kworker/0:0]
[0.839771] 5 00 SW   [kworker/u:0]
[0.839772] 6 00 SW   [migration/0]
[0.839773] 7 00 SW  [cpuset]
[0.839773] 8 00 SW  [khelper]
[0.839774] 9 00 SW  [netns]
[0.839774]10 00 SW   [sync_supers]
[0.839775]11 00 SW   [bdi-default]
[0.839776]12 00 SW  [kintegrityd]
[0.839776]13 00 SW  [kblockd]
[0.839777]14 00 SW  [ata_sff]
[0.839778]15 00 SW   [khubd]
[0.839778]16 00 SW  [md]
[0.839779]17 00 SW   [kworker/0:1]
[0.839780]18 00 SW   [khungtaskd]
[0.839780]19 00 SW   [kworker/u:1]
[0.839781]20 00 SW   [kswapd0]
[0.839781]21 00 SWN  [ksmd]
[0.839782]22 00 SW   [fsnotify_mark]
[0.839783]23 00 SW   [ecryptfs-kthrea]
[0.839783]24 00 SW  [crypto]
[0.839784]32 00 SW  [kthrotld]
[0.839785]33 00 SW   [aemon --resolve-names=never
[0.839807]96 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839808]97 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839808]98 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839809]99 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839810]   100 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839811]   101 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839812]   102 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839813]   103 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839816]   104 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839817]   105 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839817]   106 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839818]   107 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839819]   108 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839820]   109 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839821]   110 012752 S/sb   /sbin/udevd --daemon 
--resolve-names=never
[0.839833]   115 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839834]   116 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839834]   120 012752 S/sbin/udevd --daemon 
--resolve-names=never
[0.839835]   121 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839836]   122 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839837]   123 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839838]   124 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839839]   125 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839840]   126 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839840]   127 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839841]   128 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839842]   129 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839843]   130 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839844]   131 012884 S/sbin/udevd --daemon 
--resolve-names=never
[0.839845]   132 0 012984 S/sbin/udevd --daemon 
--resolve-names=never
[0.839854]   175 012988 S/sbin/udevd --daemon 
--resolve-names=never
[0.839855]   176 012968 S/sbin/udevd --daemon 
--resolve-names=never
[0.839856]   177 012964 S/sbin/udevd --daemon 
--resolve-names=never
[0.839857]   178 012976 S/sbin/udevd --daemon 
--resolve-names=never
[0.839858]   179 012796 S/sbin/udevd --daemon 

[Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread Steve Langasek
Serge,

 I'm not sure what you were asking for then, but as I said in that
 comment, partition 1 is a simple ext3 filesystem.

Oh, doh - I read the wrong comment.  Sorry. :)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-06 Thread Andy Whitcroft
Ok.  I have a machine here which triggers somethign similar to this
pretty often.  For me udev finds /dev is read-only and halts boot.  This
is presumably because our devtmpfs /dev has not made it into / when it
starts.  I was getting a failed boot about every 5-6 boots, 4 in 20
overall.  I then used the work around in comment #54, with that in place
I have 35 consecutive successful boots.

The conjecture is that /dev in the initramfs is busy and cannot be moved
when needed.  From the ps output in comment #57 it seems udev has not
actually quit when requested, which might well account for this.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Adam Gandelman
After modifying initramfs similar to
https://bugs.launchpad.net/ubuntu/oneiric/+source/udev/+bug/833783/comments/17
, ive managed to hit the bug. Attached is output from a failed boot and
successful boot.


** Attachment added: 'udevadm monitor -e' log of a failed boot
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+attachment/2514534/+files/udev.failed

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Adam Gandelman
** Attachment added: 'udevadm monitor -e log of a successful boot
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+attachment/2514537/+files/udev.success

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Steve Langasek
analysis of the two logs shows the following events missing from udev in
the failed case.

UDEV  [4.229209] add  
/devices/pci:00/:00:01.1/host2/target2:0:1/2:0:1:0/block/sdc (block)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/host2/target2:0:1/2:0:1:0/block/sdc
SUBSYSTEM=block
DEVNAME=/dev/sdc
DEVTYPE=disk
SEQNUM=770
ID_ATA=1
ID_TYPE=disk
ID_BUS=ata
ID_MODEL=QEMU_HARDDISK
ID_MODEL_ENC=QEMU\x20HARDDISK\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20\x20
ID_REVISION=0.14.0
ID_SERIAL=QEMU_HARDDISK_QM4
ID_SERIAL_SHORT=QM4
ID_ATA_WRITE_CACHE=1
ID_ATA_WRITE_CACHE_ENABLED=0
ID_ATA_FEATURE_SET_SMART=1
ID_ATA_FEATURE_SET_SMART_ENABLED=1
ID_SCSI_COMPAT=SATA_QEMU_HARDDISK_QM4
ID_PATH=pci-:00:01.1-scsi-1:0:1:0
ID_PATH_TAG=pci-_00_01_1-scsi-1_0_1_0
ID_FS_UUID=4qMaMd-jD2v-vzpj-ScTe-Rsz7-CDw0-3DCuLn
ID_FS_UUID_ENC=4qMaMd-jD2v-vzpj-ScTe-Rsz7-CDw0-3DCuLn
ID_FS_VERSION=LVM2\x20001
ID_FS_TYPE=LVM2_member
ID_FS_USAGE=raid
MAJOR=8
MINOR=32
DEVLINKS=/dev/disk/by-id/ata-QEMU_HARDDISK_QM4 
/dev/disk/by-id/scsi-SATA_QEMU_HARDDISK_QM4 
/dev/disk/by-path/pci-:00:01.1-scsi-1:0:1:0

UDEV  [4.229522] add  /devices/pci:00/:00:01.1 (pci)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1
SUBSYSTEM=pci
DRIVER=ata_piix
PCI_CLASS=10180
PCI_ID=8086:7010
PCI_SUBSYS_ID=1AF4:1100
PCI_SLOT_NAME=:00:01.1
MODALIAS=pci:v8086d7010sv1AF4sd1100bc01sc01i80
SEQNUM=851

UDEV  [4.230831] add  /devices/pci:00/:00:01.1/ata1/ata_port/ata1 
(ata_port)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata1/ata_port/ata1
SUBSYSTEM=ata_port
SEQNUM=852

UDEV  [4.230937] add  
/devices/pci:00/:00:01.1/ata1/link1/ata_link/link1 (ata_link)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata1/link1/ata_link/link1
SUBSYSTEM=ata_link
SEQNUM=853

UDEV  [4.229854] add  
/devices/pci:00/:00:01.1/ata1/link1/dev1.0/ata_device/dev1.0 
(ata_device)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata1/link1/dev1.0/ata_device/dev1.0
SUBSYSTEM=ata_device
SEQNUM=854

UDEV  [4.229959] add  
/devices/pci:00/:00:01.1/ata1/link1/dev1.1/ata_device/dev1.1 
(ata_device)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata1/link1/dev1.1/ata_device/dev1.1
SUBSYSTEM=ata_device
SEQNUM=855

UDEV  [4.230742] add  /devices/pci:00/:00:01.1/ata2/ata_port/ata2 
(ata_port)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata2/ata_port/ata2
SUBSYSTEM=ata_port
SEQNUM=856

UDEV  [4.230209] add  /devices/pci:00/:00:01.1/ata2/link2/ata_link/l
ink2 (ata_link)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata2/link2/ata_link/link2
SUBSYSTEM=ata_link
SEQNUM=857

UDEV  [4.230318] add  /devices/pci:00/:00:01.1/ata2/link2/dev2.0/ata
_device/dev2.0 (ata_device)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata2/link2/dev2.0/ata_device/dev2.0
SUBSYSTEM=ata_device
SEQNUM=858

UDEV  [4.230428] add  
/devices/pci:00/:00:01.1/ata2/link2/dev2.1/ata_device/dev2.1 
(ata_device)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/ata2/link2/dev2.1/ata_device/dev2.1
SUBSYSTEM=ata_device
SEQNUM=859

UDEV  [4.230531] add  /devices/pci:00/:00:01.1/host1 (scsi)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/host1
SUBSYSTEM=scsi
DEVTYPE=scsi_host
SEQNUM=860

UDEV  [4.230638] add  /devices/pci:00/:00:01.1/host2 (scsi)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/host2
SUBSYSTEM=scsi
DEVTYPE=scsi_host
SEQNUM=868

UDEV  [4.231230] add  
/devices/pci:00/:00:01.1/host2/scsi_host/host2 (scsi_host)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/host2/scsi_host/host2
SUBSYSTEM=scsi_host
SEQNUM=869

UDEV  [4.231140] add  /devices/pci:00/:00:01.1/host2/target2:0:1 
(scsi)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/pci:00/:00:01.1/host2/target2:0:1
SUBSYSTEM=scsi
DEVTYPE=scsi_target
SEQNUM=870

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Steve Langasek
The corresponding kernel events *are* present, so this seems to be
definitively a udev bug and not a kernel bug.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Steve Langasek
The last udev event shown in the failure case is this one:

UDEV  [3.735566] change   /devices/virtual/block/dm-0 (block)
UDEV_LOG=3
ACTION=change
DEVPATH=/devices/virtual/block/dm-0
SUBSYSTEM=block
DM_COOKIE=4228816
DEVNAME=/dev/dm-0
DEVTYPE=disk
SEQNUM=1092
DM_UDEV_PRIMARY_SOURCE_FLAG=1
DM_NAME=test--vg-test--lv
DM_UUID=LVM-4iWtHXttRCwYKe6cxdfiFUSSU9pFQ9Zelj82pgIWg6GreSTESCmwv43yf6Nrf5s2
DM_SUSPENDED=0
DM_UDEV_RULES=1
DM_VG_NAME=test-vg
DM_LV_NAME=test-lv
DEVLINKS=/dev/mapper/test--vg-test--lv 
/dev/disk/by-id/dm-name-test--vg-test--lv 
/dev/disk/by-id/dm-uuid-LVM-4iWtHXttRCwYKe6cxdfiFUSSU9pFQ9Zelj82pgIWg6GreSTESCmwv43yf6Nrf5s2
 /dev/test-vg/test-lv
MAJOR=252
MINOR=0

So that points to an lvm problem in this case, at least.

I also notice a difference in this event - DM_SUSPENDED=0 vs. =1:

UDEV  [3.709786] add  /devices/virtual/block/dm-0 (block)
UDEV_LOG=3
ACTION=add
DEVPATH=/devices/virtual/block/dm-0
SUBSYSTEM=block
DEVNAME=/dev/dm-0
DEVTYPE=disk
SEQNUM=1091
DM_SUSPENDED=0
DM_UDEV_RULES=1
MAJOR=252
MINOR=0
DEVLINKS=/dev/disk/by-id/dm-name-

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Steve Langasek
** Changed in: linux (Ubuntu Oneiric)
   Status: Incomplete = Invalid

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Steve Langasek
Looking at bugs on lvm2 turns up this gem in bug #802626:

 Just a wild speculation, because I haven't yet digged into the interactions 
 between kernel
 and udevd, but the semaphore decrementation event might be lost when 
 transitioning from
 the initrd-udevd to the rootfs-udevd.

In cases where people are seeing initrd problems and lvm is in use, are
there always LVM VGs *other* than for the root fs?  If there's a VG
that's not a dependency of the rootfs, then indeed the vgchange call can
be racing the main initramfs sequence.

Twiddling the lvm2 package to insert a delay may help with reproducing
the issue.

None of this explains any issues with device nodes being absent from
/dev on the root filesystem when lvm is *not* in use.  The original bug
report did not mention LVM at all.  Can someone comment on the disk
configuration used there?  Maybe we need to split this into two bugs.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Serge Hallyn
@Steve,

see comment #42 for one disk layout that reproduces it for me in a kvm
VM.

(the vm was created with 'vm-new oneiric amd64 clean' - well,
technically with a customized vm-new using the mini iso which hasn't yet
been merged into ubuntu-qa-tools)

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-05 Thread Serge Hallyn
@Steve,

also see bug 833891 as a udev bug specifically for the LVM case.  In
fact, Eduard toward the end speculated precisely the semaphore as a
cause as you just did.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread Serge Hallyn
@JamesHunt,

you mention LVs in comment #34, but assuming this is the same bug
causing my hangs and read-only rootfs on VMs, it does not require LVs.
Unfortunately, like you, whenever I've instrumented grub to print out
the list of pending events, I can't reproduce it :)

Perhaps kgdb is the way to go here.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread Serge Hallyn
@JamesHunt,

have you pursued trying to reproduce with a version of udevd which
continues to process events when udev_exit==1?

I'm unclear as to whether (1) we need to continue to process inotify
events as well (so that udev workers don't get hung), and (2) whether
that just means that kobject_uevent messages from the kernel will still
get lost between the exit of the initramfs' udevd and the rootfs one.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread Adam Gandelman
@James: Im currently at a conference with limited wifi, but I checked
yesterday and I can consistently reproduce on my thinkpad + kvm.  Any
chance you can publish those modded udev packages to a branch or PPA?
I'm happy to test and see if I can get anything useful

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread James Hunt
@Adam: Not quite yet, but will work on that when I get a chance tomorrow
if possible. I'm intrigued by your setup though as I have a thinkpad +
kvm but only see the problem very infrequently (and I've tried setting
up images as you specify). How exactly are you invoking kvm? To clarify,
are you using the server image?

@Serge: what was your storage configuration when you saw the problem
without LVs? I think I've seen the problem a couple of times simply by
providing 2 extra raw disks to the system but at that point my udevd
debug wasn't helpful. I have tried modifying udevd to continue
processing events but it then hangs. I really need to be able to
recreate the problem more reliably on a system I've got control over
before fathoming this one out.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread Adam Gandelman
Hi James-

I'm using the server image, kvm+libvirt for the VM, here is the
corresponding XML config for the VM.  http://paste.ubuntu.com/702405/
Should note that the root disk is a qcow2 image, the two additional
images are raw dd'd images, each 100MB.  I've also been sure to provide
2 CPUs to the VM, tho I'm not certain this affects reproducibility.

The corresponding KVM process that gets spawned looks like this:
http://paste.ubuntu.com/702411

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread Serge Hallyn
Quoting James Hunt (818...@bugs.launchpad.net):
 @Serge: what was your storage configuration when you saw the problem
 without LVs? I think I've seen the problem a couple of times simply by
 providing 2 extra raw disks to the system but at that point my udevd
 debug wasn't helpful. I have tried modifying udevd to continue
 processing events but it then hangs. I really need to be able to
 recreate the problem more reliably on a system I've got control over
 before fathoming this one out.

I have simple vms, created with the vm-new from lp:ubuntu-qa-tools,
whose root disk looks like:

   Device Boot  Start End  Blocks   Id  System
   /dev/vda1   *204815728639 7863296   83  Linux
   /dev/vda21573068616775167  5222415  Extended
   /dev/vda51573068816775167  522240   82  Linux swap / Solaris

with vda1 being ext3.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


Re: [Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread Serge Hallyn
Quoting James Hunt (818...@bugs.launchpad.net):
 @Serge: what was your storage configuration when you saw the problem
 without LVs? I think I've seen the problem a couple of times simply by
 providing 2 extra raw disks to the system but at that point my udevd
 debug wasn't helpful. I have tried modifying udevd to continue
 processing events but it then hangs. I really need to be able to
 recreate the problem more reliably on a system I've got control over
 before fathoming this one out.

Here is the xml for that domain, for what it's worth.  disk0.img is
raw format:

serge@sergelap:~/vms/kvm/c$ qemu-img info disk0.img 
image: disk0.img
file format: raw
virtual size: 8.0G (8589934592 bytes)
disk size: 3.3G

serge@sergelap:~/vms/kvm/c$ virsh dumpxml c
domain type='kvm' id='2'
  namec/name
  uuid15405141-d078-c266-ae6f-8ca6b72a08e8/uuid
  memory524288/memory
  currentMemory524288/currentMemory
  vcpu1/vcpu
  os
type arch='x86_64' machine='pc-0.14'hvm/type
boot dev='hd'/
  /os
  features
acpi/
apic/
pae/
  /features
  clock offset='utc'/
  on_poweroffdestroy/on_poweroff
  on_rebootrestart/on_reboot
  on_crashrestart/on_crash
  devices
emulator/usr/bin/kvm/emulator
disk type='file' device='disk'
  driver name='qemu' type='raw'/
  source file='/home/serge/vms/kvm/c/disk0.img'/
  target dev='vda' bus='virtio'/
  alias name='virtio-disk0'/
  address type='pci' domain='0x' bus='0x00' slot='0x04' 
function='0x0'/
/disk
disk type='block' device='cdrom'
  driver name='qemu' type='raw'/
  target dev='hdc' bus='ide'/
  readonly/
  alias name='ide0-1-0'/
  address type='drive' controller='0' bus='1' unit='0'/
/disk
controller type='ide' index='0'
  alias name='ide0'/
  address type='pci' domain='0x' bus='0x00' slot='0x01' 
function='0x1'/
/controller
interface type='network'
  mac address='00:16:3e:26:26:74'/
  source network='default'/
  target dev='vnet0'/
  model type='virtio'/
  alias name='net0'/
  address type='pci' domain='0x' bus='0x00' slot='0x03' 
function='0x0'/
/interface
serial type='pty'
  source path='/dev/pts/4'/
  target port='0'/
  alias name='serial0'/
/serial
console type='pty' tty='/dev/pts/4'
  source path='/dev/pts/4'/
  target type='serial' port='0'/
  alias name='serial0'/
/console
input type='mouse' bus='ps2'/
graphics type='vnc' port='5900' autoport='yes'/
video
  model type='vmvga' vram='9216' heads='1'/
  alias name='video0'/
  address type='pci' domain='0x' bus='0x00' slot='0x02' 
function='0x0'/
/video
memballoon model='virtio'
  alias name='balloon0'/
  address type='pci' domain='0x' bus='0x00' slot='0x05' 
function='0x0'/
/memballoon
  /devices
  seclabel type='dynamic' model='apparmor'
labellibvirt-15405141-d078-c266-ae6f-8ca6b72a08e8/label
imagelabellibvirt-15405141-d078-c266-ae6f-8ca6b72a08e8/imagelabel
  /seclabel
/domain


The host is a core i7 laptop with 8G ram and showing 8 cpus.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread James Hunt
@Dave: I've modded udevd to display some internal details, but cannot
now make the images I have fail to boot reliably. Currently working with
@jamespage who has a machine that fails to boot most times.

I've tried to force more frequent failures by installing with lots of
LVs, but that doesn't seem to help either on virtual h/w (kvm) or the
limited physical h/w I have access to. I've even gone as far as forcing
particular LVs to respond *extremely* slowly (multi-second response
time), but that hasn't helped trigger the problem either).

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread James Hunt
BTW - I've also looked at how Fedora 15 stops udev with dracut and using
Stefans terminology, they club it to death like we used to. They also
pepper the code with frequent calls to settle and add a few sub-second
sleeps here and there which feels horribly wrong IMHO.

I have an off-beat idea as to how we could establish if messages are
getting lost which I may be forced to pursue unless I can overcome the
Heisenberg nature of debugging udevd.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-04 Thread James Hunt
@Adam: from comment #18, do you still have an image that fails to boot 1
in 5 times?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-10-03 Thread Dave Walker
@James, How did you get on with the debug version of udevd?  When we
tried to do the same, we were unable to reproduce the bug, as the debug
statements seemed to slow down udev, hiding the race.  Was this the same
behaviour you encountered?

Thanks.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-09-29 Thread Steve Langasek
Andrew, I'm not sure you're experiencing the same issue; I would say in
fact that you have some unrelated kernel bug, since there's no excuse
for it taking 2 minutes to settle the kernel event queue.  Getting a
dump of 'udevadm monitor -e' from this initramfs (which would need to be
started *before* this point to capture the actual events) should be
instructive.  Could you please open a new bug report for this and link
it back here?

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-09-28 Thread James Hunt
It looks like udevd.c is rather aggressive when handling the exit
scenario. I'm currently building a debug version of udevd + initramfs to
try and see if and how messages are getting lost.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-09-27 Thread Dave Walker
** Summary changed:

- HP DL380G5 root disk mounted read-only on boot and boot fails
+ boot failures caused by udev race

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs


[Bug 818177] Re: boot failures caused by udev race

2011-09-27 Thread Andrew Glen-Young
I have a few machines throwing a kernel panic while netbooting oneiric
with a similar error message.

I have attached the boot message log with the panic.


** Attachment added: udev-race.log
   
https://bugs.launchpad.net/ubuntu/+source/udev/+bug/818177/+attachment/2471068/+files/udev-race.log

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/818177

Title:
  boot failures caused by udev race

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/818177/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs