Re: [OMPI devel] HWLOC issue

2015-09-14 Thread George Bosilca
Brice, I confirm your patch solves the issue I reported earlier for OMPI. I did not try it on a standalone HWLOC, so I am not sure that it maintains the coherency of the output. If you want I can give it a try. Thanks, George. On Thu, Sep 10, 2015 at 6:08 PM, Brice Goglin wrote: > Try this

Re: [OMPI devel] HWLOC issue

2015-09-10 Thread Brice Goglin
Try this patch (it applies to hwloc v1.9-v1.11, it should be OK against OMPI's tree). Your bridge 22:00.0 says it contains the master bus 00. It causes a cycle in hwloc's insert algorithm, caught be the assertion. The patch just removes this invalid bridge entirely. Brice Le 10/09/2015 21:23, G

Re: [OMPI devel] HWLOC issue

2015-09-10 Thread Brice Goglin
I guess it could be some invalid bus information in PCI bridges. Maybe try to shutdown the node completely and restart it. I've seen other strange PCI issues disappear like this in the past... Otherwise, please send the tarball generated by "hwloc-gather-topology --io foo". Send it only to me, it

Re: [OMPI devel] HWLOC issue

2015-09-10 Thread George Bosilca
It used to work. Now I don't know exactly when I last updated the trunk version on the cluster, but not more than 10 days ago. lstopo complains with the same assert. Interestingly enough, the same binary succeed on the other nodes of the same cluster ... George. On Thu, Sep 10, 2015 at 3:20 P

Re: [OMPI devel] HWLOC issue

2015-09-10 Thread Brice Goglin
Did it work on the same machine before? Or did OMPI enable hwloc's PCI discovery recently? Does lstopo complain the same? Brice Le 10/09/2015 21:10, George Bosilca a écrit : > With the current trunk version I keep getting an assert deep down in > orted. > > orted: > ../../../../../../../ompi/op