On Wed, Mar 18, 2020 at 11:39:37AM +0100, Dan Horák wrote:
> On Wed, 18 Mar 2020 09:50:24 +0000
> "Richard W.M. Jones" <rjo...@redhat.com> wrote:
> 
> > On Wed, Mar 18, 2020 at 10:46:19AM +0100, Dan Horák wrote:
> > > On Wed, 18 Mar 2020 09:34:45 +0000
> > > "Richard W.M. Jones" <rjo...@redhat.com> wrote:
> > > 
> > > > 
> > > > This might be a bug in the package itself, but has anyone seen
> > > > builds hanging in weird places, in Rawhide, especially on armv7
> > > > and s390x?
> > > > 
> > > > This packge build has hung 3 times in the same place, once on
> > > > armv7 and twice on s390x:
> > > > 
> > > > https://koji.fedoraproject.org/koji/taskinfo?taskID=42570766
> > > > 
> > > > It's hard to explain how it could hang at that place in the build
> > > > unless something fundamental is broken like make.
> > > 
> > > let me try the rebuild locally on s390x ...
> > 
> > Note that the build did succeed once on s390x (that was when it hung
> > on armv7 instead).  So it's not 100% reproducible.  Also if our theory
> > about tooling is correct then you would need all Rawhide packages.
> 
> it's a deadlock in the tests, not in make. Reproduced with
> "fedpkg local" in a cycle.
> 
> sharkcz  1649225  0.0  0.0 222288  3904 pts/5    S+   06:24   0:00 /bin/sh -e 
> /var/tmp/rpm-tmp.RXcMRr
> sharkcz  1649230  0.0  0.0  10372  3248 pts/5    S+   06:24   0:00 make -j4 
> check
> sharkcz  1658088  0.0  0.0 251236  3400 pts/5    Sl+  06:25   0:00 
> /home/sharkcz/nbdkit/nbdkit-1.19.3/server/nbdkit -v -P test-nbd-tls-psk.pid1 
> -U /tmp/tmp.7e7Gv5MPmZ --tls=require --tls-psk=keys.psk -- 
> /home/sharkcz/nbdkit/nbdkit-1.19.3/plugins/example1/.libs/nbdkit-example1-plugin.so
> sharkcz  1658091  0.0  0.1 192944  4464 pts/5    Sl+  06:25   0:00 
> /home/sharkcz/nbdkit/nbdkit-1.19.3/server/nbdkit -v -P test-nbd-tls-psk.pid2 
> -U /tmp/tmp.yp61yXx09y --tls=off -- 
> /home/sharkcz/nbdkit/nbdkit-1.19.3/plugins/nbd/.libs/nbdkit-nbd-plugin.so 
> tls=require tls-psk=keys.psk tls-username=qemu socket=/tmp/tmp.7e7Gv5MPmZ
> 
> the 2 nbdkit processes are stuck in the futex() syscall
> 
> Some years ago there was a kernel bug with the same symptoms. All
> arches were affected, but mostly visible on s390x and armv7.

In fact this happens on x86-64.  I was able to reproduce it
locally.  Investigating now.

Thanks,

Rich.

-- 
Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones
Read my programming and virtualization blog: http://rwmj.wordpress.com
Fedora Windows cross-compiler. Compile Windows programs, test, and
build Windows installers. Over 100 libraries supported.
http://fedoraproject.org/wiki/MinGW
_______________________________________________
devel mailing list -- devel@lists.fedoraproject.org
To unsubscribe send an email to devel-le...@lists.fedoraproject.org
Fedora Code of Conduct: 
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/devel@lists.fedoraproject.org

Reply via email to