Re: [O-MPI devel] MPI_Barrier in Netpipe causes segfault

2005-10-27 Thread Jeff Squyres
On Oct 27, 2005, at 7:12 PM, Brian Barrett wrote: I think we just committed a fix for this one on both the trunk and v1.0 branch. Will be in the tarballs tomorrow morning. Minor clarification just so that everyone is on the same sheet of music -- these fixes will be in the nightly tarballs t

Re: [O-MPI devel] MPI_Barrier in Netpipe causes segfault

2005-10-27 Thread Brian Barrett
On Oct 24, 2005, at 10:21 PM, Troy Benjegerdes wrote: On Mon, Oct 24, 2005 at 06:03:02PM -0500, Troy Benjegerdes wrote: troy@opteron1:/usr/src/netpipe3-dev$ mpirun -np 2 -mca btl_base_exclude openib NPmpi 1: opteron1 0: opteron1 mpirun noticed that job rank 1 with PID 352 on node "localhost"

Re: [O-MPI devel] MPI_Barrier in Netpipe causes segfault

2005-10-25 Thread Jeff Squyres
Troy -- We've managed to replicate this problem and are looking into it. Thanks for reporting it! Troy Benjegerdes wrote: On Mon, Oct 24, 2005 at 06:03:02PM -0500, Troy Benjegerdes wrote: troy@opteron1:/usr/src/netpipe3-dev$ mpirun -np 2 -mca btl_base_exclude openib NPmpi 1: opteron1 0: o

Re: [O-MPI devel] MPI_Barrier in Netpipe causes segfault

2005-10-25 Thread Jeff Squyres
I'm assuming that this is a production version of NP, right? (i.e., not a development version) Can you run the MPI processes through valgrind to see where the error really occurs? This corefile only shows the final results, not the actual cause. Troy Benjegerdes wrote: On Mon, Oct 24, 20

[O-MPI devel] MPI_Barrier in Netpipe causes segfault

2005-10-24 Thread Troy Benjegerdes
On Mon, Oct 24, 2005 at 06:03:02PM -0500, Troy Benjegerdes wrote: > troy@opteron1:/usr/src/netpipe3-dev$ mpirun -np 2 -mca btl_base_exclude > openib NPmpi > 1: opteron1 > 0: opteron1 > mpirun noticed that job rank 1 with PID 352 on node "localhost" exited > on signal 11. > 1 process killed (possibl