----- Original Message -----
> From: "Dan Kenigsberg" <dan...@redhat.com>
> To: from...@redhat.com, smizr...@redhat.com, ybron...@redhat.com
> Cc: ee...@redhat.com, aba...@redhat.com, vdsm-patches@lists.fedorahosted.org, 
> dc...@redhat.com, lara...@redhat.com,
> fsimo...@redhat.com
> Sent: Friday, May 30, 2014 3:00:26 PM
> Subject: Re: [oVirt Jenkins] vdsm_master_unit_tests - Build # 3043 - Failure!
> 
> Francesco, Yaniv,
> 
> Have you found the time to dig into the coredump
> of these Segmentation faults?
> 
> (It is unrelated to Liron's new patches)

No much progress since last report. I'm very annoyed by this issue but I'm 
having hard time
wrapping my head around it.

Let me summarize what I currently know:

* the segfault should be reproduceable on any box running nose >= 1.3.0, just 
using
$ cd vdsm
$ ./configure && make
$ NOSE_WITH_XUNIT=1 make check
or at least I can reproduce the issue on all the boxes I tried locally (vanilla 
F20, F19)

* if we run each testunit separately, we do NOT observe the failure.
This triggers the segfault:
$ cd tests
$ ./run_tests_local.sh ./*.py

This does not:
$ cd tests
$ for TEST in `ls ./*.py`; do ./run_tests_local.sh $TEST; done

* the stack traces I observed are huge, more than 750 levels deep.
This suggests the stack exausted, and this in turn probably triggered by some 
kind of recursion
gone wild. Note the offending stack trace is just on one thread; all the others 
are quiet.

* I tried to reproduce the issue with a simpler use case with no luck so far.

At the moment I don't have better suggestions than bite the bullet
and dig in the huge stack trace looking for repetitive patterns or some sort of 
hint.

Suggestions welcome!

-- 
Francesco Romani
RedHat Engineering Virtualization R & D
Phone: 8261328
IRC: fromani
_______________________________________________
vdsm-patches mailing list
vdsm-patches@lists.fedorahosted.org
https://lists.fedorahosted.org/mailman/listinfo/vdsm-patches

Reply via email to