Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-29 Thread George Bosilca
You can use the tcp_if_include/tcp_if_exclude with address ranges instead of names. ompi_info --mca btl tcp give you some hints: > MCA btl: parameter "btl_tcp_if_include" (current value: > , data > source: default value) > Comm

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-18 Thread Krzysztof Zarzycki
We just discovered this ticket, which might describe the same problem that we have: https://svn.open-mpi.org/trac/ompi/ticket/1505 It seems unresolved... do you have a workaround for it? I've seen the "-mca opal_net_private_ipv4 " parameter, but I don't exactly know how to use it... At least my e

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-17 Thread Grzegorz Maj
2010/11/11 Jeff Squyres : > On Nov 11, 2010, at 3:23 PM, Krzysztof Zarzycki wrote: > >> No, unfortunately specification of interfaces is a little more >> complicated...  eth0/1/2 is not common for both machines. > > Can you define "common"?  Do you mean that eth0 on one machine is on a > differen

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Jeff Squyres
On Nov 11, 2010, at 3:23 PM, Krzysztof Zarzycki wrote: > No, unfortunately specification of interfaces is a little more complicated... > eth0/1/2 is not common for both machines. Can you define "common"? Do you mean that eth0 on one machine is on a different network then eth0 on the other mac

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Krzysztof Zarzycki
No, unfortunately specification of interfaces is a little more complicated... eth0/1/2 is not common for both machines. I've tried to play with (oob/btl)_tcp_ if_include, but actually... I don't know exactly how. Anyway, do you have any ideas how to further debug the communication problem? Chee

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Ralph Castain
There are two connections to be specified: -mca oob_tcp_if_include xxx -mca btl_tcp_if_include xxx On Nov 11, 2010, at 12:04 PM, Krzysztof Zarzycki wrote: > Hi, > I'm working with Grzegorz on the mentioned problem. > If I'm correct on checking the firewall settings, "iptables --list" shows an

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Jeff Squyres
On Nov 11, 2010, at 2:04 PM, Krzysztof Zarzycki wrote: > I'm working with Grzegorz on the mentioned problem. > If I'm correct on checking the firewall settings, "iptables --list" shows an > empty list of rules. > The second host does not have iptables installed at all. > > So what can be a next

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Krzysztof Zarzycki
Hi, I'm working with Grzegorz on the mentioned problem. If I'm correct on checking the firewall settings, "iptables --list" shows an empty list of rules. The second host does not have iptables installed at all. So what can be a next reason of this problem? By the way, how can I enforce mpirun to

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-11 Thread Jeff Squyres
I'd check the firewall settings. The stack trace indicates that the one host is trying to connect to the other (Open MPI initiates non-blocking TCP connections that can be polled on later). On Nov 10, 2010, at 12:46 PM, David Zhang wrote: > Have you double checked your firewall settings, TCP/

Re: [OMPI users] Problem with sending messages from one of the machines

2010-11-10 Thread David Zhang
Have you double checked your firewall settings, TCP/IP settings, and SSH keys are all setup correctly for all machines including the host? On Wed, Nov 10, 2010 at 2:57 AM, Grzegorz Maj wrote: > Hi all, > I've got a problem with sending messages from one of my machines. It > appears during MPI_Se

[OMPI users] Problem with sending messages from one of the machines

2010-11-10 Thread Grzegorz Maj
Hi all, I've got a problem with sending messages from one of my machines. It appears during MPI_Send/MPI_Recv and MPI_Bcast. The simplest case I've found is two processes, rank 0 sending a simple message and rank 1 receiving this message. I execute these processes using mpirun with -np 2. - when bo