[OMPI users] mca_oob_tcp_accept: accept() failed with errno 24

2006-10-13 Thread Adam Moody
Hello, I'm trying to run a 500 node job using mpirun / slurm with OpenMPI-1.1.1 and see the following errors at startup: [rhea342:09444] [0,1,318]-[0,0,0] mca_oob_tcp_peer_recv_blocking: recv() failed with errno=104 [rhea32:13463] mca_oob_tcp_accept: accept() failed with errno 24. [rhea32:134

[OMPI users] problem building static w/ 1.1.2_rc4

2006-10-13 Thread Josh England
Hi, I'm trying to statically compile a simple program, but have run into some difficulties so far. Trying with my own link line I get many complaints about symbols being redefined: [root@dlogin2 examples]# gcc -static -o cpi cpi.c -I/apps/x86_64/mpi/openmpi/gcc-3.4.6/openmpi-1.1.2_rc4-ofed/inclu

Re: [OMPI users] problem building static w/ 1.1.2_rc4

2006-10-13 Thread Scott Weitzenkamp (sweitzen)
Shouldn't mpicc always link system libraries (like -ldl and -lsysfs) dynamically? Scott Weitzenkamp SQA and Release Manager Server Virtualization Business Unit Cisco Systems > -Original Message- > From: users-boun...@open-mpi.org > [mailto:users-boun...@open-mpi.org] On Behalf Of Josh E

Re: [OMPI users] mca_oob_tcp_accept: accept() failed with errno 24

2006-10-13 Thread Aleph One
errno 24 means "Too many open files". Looks like you may be hitting the upper limit for the number of open file descriptors. Check /proc/sys/fs/file-max and see if you need to bump it up. Not sure if you need to bump up "ulimit -n", but worth a try. -Aleph On 10/14/06, Adam Moody wrote: Hello