Does it hang when you run with -np 4? Sent from my phone. No type good.
On Sep 18, 2013, at 4:10 PM, "Ralph Castain" <r...@open-mpi.org> wrote: > Strange - it works fine for me on my Mac. However, I see one difference - I > only run it with np=1 > > On Sep 18, 2013, at 2:22 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com> > wrote: > >> On Sep 18, 2013, at 9:33 AM, George Bosilca <bosi...@icl.utk.edu> wrote: >> >>> 1. sm doesn't work between spawned processes. So you must have another >>> network enabled. >> >> I know :-). I have tcp available as well (OMPI will abort if you only run >> with sm,self because the comm_spawn will fail with unreachable errors -- I >> just tested/proved this to myself). >> >>> 2. Don't use the test case attached to my email, I left an xterm based >>> spawn and the debugging. It can't work without xterm support. Instead try >>> using the test case from the trunk, the one committed by Ralph. >> >> I didn't see any "xterm" strings in there, but ok. :-) I ran with >> orte/test/mpi/intercomm_create.c, and that hangs for me as well: >> >> ----- >> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >> ❯❯❯ mpirun -np 4 intercomm_create >> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) [rank >> 4] >> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) [rank >> 5] >> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) [rank >> 6] >> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) [rank >> 7] >> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 4] >> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 5] >> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 6] >> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 7] >> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >> [hang] >> ----- >> >> Similarly, on my Mac, it hangs with no output: >> >> ----- >> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >> ❯❯❯ mpirun -np 4 intercomm_create >> [hang] >> ----- >> >>> George. >>> >>> On Sep 18, 2013, at 07:53 , "Jeff Squyres (jsquyres)" <jsquy...@cisco.com> >>> wrote: >>> >>>> George -- >>>> >>>> When I build the SVN trunk (r29201) on 64 bit linux, your attached test >>>> case hangs: >>>> >>>> ----- >>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>> ❯❯❯ mpirun -np 4 intercomm_create >>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>> [rank 4] >>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>> [rank 5] >>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>> [rank 6] >>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>> [rank 7] >>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 4] >>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 5] >>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 6] >>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank 7] >>>> [hang] >>>> ----- >>>> >>>> On my Mac, it hangs without printing anything: >>>> >>>> ----- >>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>> ❯❯❯ mpirun -np 4 intercomm_create >>>> [hang] >>>> ----- >>>> >>>> >>>> On Sep 18, 2013, at 1:48 AM, George Bosilca <bosi...@icl.utk.edu> wrote: >>>> >>>>> Here is a quick (and definitively not the cleanest) patch that addresses >>>>> the MPI_Intercomm issue at the MPI level. It should be applied after >>>>> removal of 29166. >>>>> >>>>> I also added the corrected test case stressing the corner cases by doing >>>>> barriers at every inter-comm creation and doing a clean disconnect. >>>> >>>> >>>> -- >>>> Jeff Squyres >>>> jsquy...@cisco.com >>>> For corporate legal information go to: >>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>> >>>> _______________________________________________ >>>> devel mailing list >>>> de...@open-mpi.org >>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>> >>> _______________________________________________ >>> devel mailing list >>> de...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >> >> >> -- >> Jeff Squyres >> jsquy...@cisco.com >> For corporate legal information go to: >> http://www.cisco.com/web/about/doing_business/legal/cri/ >> >> _______________________________________________ >> devel mailing list >> de...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/devel > > _______________________________________________ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel