Just to close my end of this loop: as of trunk r29213, it all works for me. Thanks!
On Sep 18, 2013, at 12:52 PM, Ralph Castain <[email protected]> wrote: > Thanks George - much appreciated > > On Sep 18, 2013, at 9:49 AM, George Bosilca <[email protected]> wrote: > >> The test case was broken. I just pushed a fix. >> >> George. >> >> On Sep 18, 2013, at 16:49 , Ralph Castain <[email protected]> wrote: >> >>> Hangs with any np > 1 >>> >>> However, I'm not sure if that's an issue with the test vs the underlying >>> implementation >>> >>> On Sep 18, 2013, at 7:40 AM, "Jeff Squyres (jsquyres)" <[email protected]> >>> wrote: >>> >>>> Does it hang when you run with -np 4? >>>> >>>> Sent from my phone. No type good. >>>> >>>> On Sep 18, 2013, at 4:10 PM, "Ralph Castain" <[email protected]> wrote: >>>> >>>>> Strange - it works fine for me on my Mac. However, I see one difference - >>>>> I only run it with np=1 >>>>> >>>>> On Sep 18, 2013, at 2:22 AM, Jeff Squyres (jsquyres) <[email protected]> >>>>> wrote: >>>>> >>>>>> On Sep 18, 2013, at 9:33 AM, George Bosilca <[email protected]> wrote: >>>>>> >>>>>>> 1. sm doesn't work between spawned processes. So you must have another >>>>>>> network enabled. >>>>>> >>>>>> I know :-). I have tcp available as well (OMPI will abort if you only >>>>>> run with sm,self because the comm_spawn will fail with unreachable >>>>>> errors -- I just tested/proved this to myself). >>>>>> >>>>>>> 2. Don't use the test case attached to my email, I left an xterm based >>>>>>> spawn and the debugging. It can't work without xterm support. Instead >>>>>>> try using the test case from the trunk, the one committed by Ralph. >>>>>> >>>>>> I didn't see any "xterm" strings in there, but ok. :-) I ran with >>>>>> orte/test/mpi/intercomm_create.c, and that hangs for me as well: >>>>>> >>>>>> ----- >>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 4] >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 5] >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 6] >>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>> [rank 7] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 4] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 5] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 6] >>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>>> 7] >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>> [hang] >>>>>> ----- >>>>>> >>>>>> Similarly, on my Mac, it hangs with no output: >>>>>> >>>>>> ----- >>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>> [hang] >>>>>> ----- >>>>>> >>>>>>> George. >>>>>>> >>>>>>> On Sep 18, 2013, at 07:53 , "Jeff Squyres (jsquyres)" >>>>>>> <[email protected]> wrote: >>>>>>> >>>>>>>> George -- >>>>>>>> >>>>>>>> When I build the SVN trunk (r29201) on 64 bit linux, your attached >>>>>>>> test case hangs: >>>>>>>> >>>>>>>> ----- >>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 4] >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 5] >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 6] >>>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>>> [rank 7] >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 4] >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 5] >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 6] >>>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>>> [rank 7] >>>>>>>> [hang] >>>>>>>> ----- >>>>>>>> >>>>>>>> On my Mac, it hangs without printing anything: >>>>>>>> >>>>>>>> ----- >>>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>>>> [hang] >>>>>>>> ----- >>>>>>>> >>>>>>>> >>>>>>>> On Sep 18, 2013, at 1:48 AM, George Bosilca <[email protected]> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Here is a quick (and definitively not the cleanest) patch that >>>>>>>>> addresses the MPI_Intercomm issue at the MPI level. It should be >>>>>>>>> applied after removal of 29166. >>>>>>>>> >>>>>>>>> I also added the corrected test case stressing the corner cases by >>>>>>>>> doing barriers at every inter-comm creation and doing a clean >>>>>>>>> disconnect. >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Jeff Squyres >>>>>>>> [email protected] >>>>>>>> For corporate legal information go to: >>>>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> devel mailing list >>>>>>>> [email protected] >>>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>>>> >>>>>>> _______________________________________________ >>>>>>> devel mailing list >>>>>>> [email protected] >>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>>> >>>>>> >>>>>> -- >>>>>> Jeff Squyres >>>>>> [email protected] >>>>>> For corporate legal information go to: >>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>>>> >>>>>> _______________________________________________ >>>>>> devel mailing list >>>>>> [email protected] >>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>> >>>>> _______________________________________________ >>>>> devel mailing list >>>>> [email protected] >>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>> _______________________________________________ >>>> devel mailing list >>>> [email protected] >>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>> >>> _______________________________________________ >>> devel mailing list >>> [email protected] >>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >> >> _______________________________________________ >> devel mailing list >> [email protected] >> http://www.open-mpi.org/mailman/listinfo.cgi/devel > > _______________________________________________ > devel mailing list > [email protected] > http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Jeff Squyres [email protected] For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/
