Hi Brice,

> When you assemble multiple nodes' topologies into a single one, the
> resulting topology cannot be used for binding. Binding is only
> possible when using objects/cpusets that correspond to the current
> node.

Ah, that explains it.

> Open-MPI does not support these cases, hence the crash. I see that
> individual XMLs worked fine. So why did you try this?

I was trying to figure out a way to fix buggy BIOS topologies for all
nodes from the server side. With our current setup, all the nodes are
diskless, they boot up an identical image from NFS and I don't know how
to tell each one what XML file to use. The reason I know that the XML
file fixes the problem is by logging onto that node, exporting
HWLOC_XMLFILE and running mpirun from there. Any suggestions on how to
do that system-wide?

Thanks,
Andrej

Reply via email to