[OMPI users] Error in file base/plm_base_launch_support.c: OPAL_HWLOC_TOPO

2018-07-21 Thread Brian Smith
Greetings, I'm having trouble getting openmpi 2.1.2 to work when launching a process from debian 8 on a remote debian 9 host. To keep things simple in this example, I'm just launching date on the remote host. deb8host$ mpirun -H deb9host date [deb8host:01552] [[32763,0],0] ORTE_ERROR_LOG: Error i

Re: [OMPI users] Error in file base/plm_base_launch_support.c: OPAL_HWLOC_TOPO

2018-07-21 Thread r...@open-mpi.org
More than likely the problem is the difference in hwloc versions - sounds like the topology to/from xml is different between the two versions, and the older one doesn’t understand the new one. > On Jul 21, 2018, at 12:04 PM, Brian Smith > wrote: > > Greetings, > > I'm having trouble getting

Re: [OMPI users] Error in file base/plm_base_launch_support.c: OPAL_HWLOC_TOPO

2018-07-21 Thread Gilles Gouaillardet
Brian, As Ralph already stated, this is likely a hwloc API issue. From debian9, you can lstopo --of xml | ssh debian8 lstopo --if xml -i - that will likely confirm the API error. If you are willing to get a bit more details, you can add some printf in opal_hwloc_unpack (from opal/mca/hwloc/base/