Just realized my email wasn't sent to the archive.

On Sat, Jun 23, 2018 at 5:34 PM, carlos aguni <aguni...@gmail.com> wrote:

> Hi!
>
> Thank you all for your reply Jeff, Gilles and rhc.
>
> Thank you Jeff and rhc for clarifying to me some of the openmpi's
> internals.
>
> >> FWIW: we never send interface names to other hosts - just dot addresses
> > Should have clarified - when you specify an interface name for the MCA
> param, then it is the interface name that is transferred as that is the
> value of the MCA param. However, once we determine our address, we only
> transfer dot addresses between ourselves
>
> If only dot addresses are sent to the hosts then why doesn't openmpi use
> the default route like `ip route get <other host IP>` instead of choosing a
> random one? Is it an expected behaviour? Can it be changed?
>
> Sorry. As Gilles pointed out I forgot to mention which openmpi version I
> was using. I'm using openmpi 3.0.0 gcc 7.3.0 from openhpc. Centos 7.5.
>
> > mpirun—mca oob_tcp_if_exclude 192.168.100.0/24 ...
>
> I cannot just exclude that interface cause after that I want to add
> another computer that's on a different network. And this is where things
> get messy :( I cannot just include and exclude networks cause I have
> different machines on different networks.
> This is what I want to achieve:
>
>
> compute01
>
> compute02
>
> compute03
>
> ens3
>
> 192.168.100.104/24
>
> 10.0.0.227/24
>
> 192.168.100.105/24
>
> ens8
>
> 10.0.0.228/24
>
> 172.21.1.128/24
>
> ---
>
> ens9
>
> 172.21.1.155/24
>
> ---
>
> ---
>
> So I'm in compute01 MPI_spawning another process on compute02 and
> compute03.
> With both MPI_Spawn and `mpirun -n 3 -host compute01,compute02,compute03
> hostname`
>
> Then when I include the mca parameters I get this:
> `mpirun --oversubscribe --allow-run-as-root -n 3 --mca oob_tcp_if_include
> 10.0.0.0/24,192.168.100.0/24 -host compute01,compute02,compute03 hostname`
> WARNING: An invalid value was given for oob_tcp_if_include.  This value
> will be ignored.
> ...
> Message:    Did not find interface matching this subnet
>
> This would all work if it were to use the system's internals like `ip
> route`.
>
> Best regards,
> Carlos.
>
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Reply via email to