Thanks George - much appreciated On Sep 18, 2013, at 9:49 AM, George Bosilca <bosi...@icl.utk.edu> wrote:
> The test case was broken. I just pushed a fix. > > George. > > On Sep 18, 2013, at 16:49 , Ralph Castain <r...@open-mpi.org> wrote: > >> Hangs with any np > 1 >> >> However, I'm not sure if that's an issue with the test vs the underlying >> implementation >> >> On Sep 18, 2013, at 7:40 AM, "Jeff Squyres (jsquyres)" <jsquy...@cisco.com> >> wrote: >> >>> Does it hang when you run with -np 4? >>> >>> Sent from my phone. No type good. >>> >>> On Sep 18, 2013, at 4:10 PM, "Ralph Castain" <r...@open-mpi.org> wrote: >>> >>>> Strange - it works fine for me on my Mac. However, I see one difference - >>>> I only run it with np=1 >>>> >>>> On Sep 18, 2013, at 2:22 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com> >>>> wrote: >>>> >>>>> On Sep 18, 2013, at 9:33 AM, George Bosilca <bosi...@icl.utk.edu> wrote: >>>>> >>>>>> 1. sm doesn't work between spawned processes. So you must have another >>>>>> network enabled. >>>>> >>>>> I know :-). I have tcp available as well (OMPI will abort if you only >>>>> run with sm,self because the comm_spawn will fail with unreachable errors >>>>> -- I just tested/proved this to myself). >>>>> >>>>>> 2. Don't use the test case attached to my email, I left an xterm based >>>>>> spawn and the debugging. It can't work without xterm support. Instead >>>>>> try using the test case from the trunk, the one committed by Ralph. >>>>> >>>>> I didn't see any "xterm" strings in there, but ok. :-) I ran with >>>>> orte/test/mpi/intercomm_create.c, and that hangs for me as well: >>>>> >>>>> ----- >>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>> [rank 4] >>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>> [rank 5] >>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>> [rank 6] >>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>> [rank 7] >>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>> 4] >>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>> 5] >>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>> 6] >>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) [rank >>>>> 7] >>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>> [hang] >>>>> ----- >>>>> >>>>> Similarly, on my Mac, it hangs with no output: >>>>> >>>>> ----- >>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>> [hang] >>>>> ----- >>>>> >>>>>> George. >>>>>> >>>>>> On Sep 18, 2013, at 07:53 , "Jeff Squyres (jsquyres)" >>>>>> <jsquy...@cisco.com> wrote: >>>>>> >>>>>>> George -- >>>>>>> >>>>>>> When I build the SVN trunk (r29201) on 64 bit linux, your attached test >>>>>>> case hangs: >>>>>>> >>>>>>> ----- >>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>> [rank 4] >>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>> [rank 5] >>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>> [rank 6] >>>>>>> b: MPI_Intercomm_create( intra, 0, intra, MPI_COMM_NULL, 201, &inter) >>>>>>> [rank 7] >>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>> a: MPI_Intercomm_create( ab_intra, 0, ac_intra, 0, 201, &inter) (0) >>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>> [rank 4] >>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>> [rank 5] >>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>> [rank 6] >>>>>>> c: MPI_Intercomm_create( MPI_COMM_WORLD, 0, intra, 0, 201, &inter) >>>>>>> [rank 7] >>>>>>> [hang] >>>>>>> ----- >>>>>>> >>>>>>> On my Mac, it hangs without printing anything: >>>>>>> >>>>>>> ----- >>>>>>> ❯❯❯ mpicc intercomm_create.c -o intercomm_create >>>>>>> ❯❯❯ mpirun -np 4 intercomm_create >>>>>>> [hang] >>>>>>> ----- >>>>>>> >>>>>>> >>>>>>> On Sep 18, 2013, at 1:48 AM, George Bosilca <bosi...@icl.utk.edu> wrote: >>>>>>> >>>>>>>> Here is a quick (and definitively not the cleanest) patch that >>>>>>>> addresses the MPI_Intercomm issue at the MPI level. It should be >>>>>>>> applied after removal of 29166. >>>>>>>> >>>>>>>> I also added the corrected test case stressing the corner cases by >>>>>>>> doing barriers at every inter-comm creation and doing a clean >>>>>>>> disconnect. >>>>>>> >>>>>>> >>>>>>> -- >>>>>>> Jeff Squyres >>>>>>> jsquy...@cisco.com >>>>>>> For corporate legal information go to: >>>>>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>>>>> >>>>>>> _______________________________________________ >>>>>>> devel mailing list >>>>>>> de...@open-mpi.org >>>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>>> >>>>>> _______________________________________________ >>>>>> devel mailing list >>>>>> de...@open-mpi.org >>>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>>> >>>>> >>>>> -- >>>>> Jeff Squyres >>>>> jsquy...@cisco.com >>>>> For corporate legal information go to: >>>>> http://www.cisco.com/web/about/doing_business/legal/cri/ >>>>> >>>>> _______________________________________________ >>>>> devel mailing list >>>>> de...@open-mpi.org >>>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>>> >>>> _______________________________________________ >>>> devel mailing list >>>> de...@open-mpi.org >>>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >>> _______________________________________________ >>> devel mailing list >>> de...@open-mpi.org >>> http://www.open-mpi.org/mailman/listinfo.cgi/devel >> >> _______________________________________________ >> devel mailing list >> de...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/devel > > _______________________________________________ > devel mailing list > de...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/devel