Sorry, my output listing was incomplete -- the program did run after the "No OpenFabrics" message, but (I presume) ran over Ethernet rather than InfiniBand. So I can't really say what was causing it to fail.

Andy

On 05/05/2016 06:09 PM, Nathan Hjelm wrote:
It should work fine with ob1 (the default). Did you determine what was
causing it to fail?

-Nathan

On Thu, May 05, 2016 at 06:04:55PM -0400, Andy Riebs wrote:
   For anyone like me who happens to google this in the future, the solution
   was to set OMPI_MCA_pml=yalla

   Many thanks Josh!

   On 05/05/2016 12:52 PM, Joshua Ladd wrote:

     We are working with Andy offline.

     Josh
     On Thu, May 5, 2016 at 7:32 AM, Andy Riebs <andy.ri...@hpe.com> wrote:

       I've built 1.10.2 with all my favorite configuration options, but I
       get messages such as this (one for each rank with
       orte_base_help_aggregate=0) when I try to run on a MOFED system:

       $ shmemrun -H hades02,hades03 $PWD/shmem.out
       --------------------------------------------------------------------------
       No OpenFabrics connection schemes reported that they were able to be
       used on a specific port.  As such, the openib BTL (OpenFabrics
       support) will be disabled for this port.

         Local host:           hades03
         Local device:         mlx4_0
         Local port:           2
         CPCs attempted:       rdmacm, udcm
       --------------------------------------------------------------------------

       My configure options:
       config_opts="--prefix=${INSTALL_DIR} \
               --without-mpi-param-check \
               --with-knem=/opt/mellanox/hpcx/knem \
               --with-mxm=/opt/mellanox/mxm  \
               --with-mxm-libdir=/opt/mellanox/mxm/lib \
               --with-fca=/opt/mellanox/fca \
               --with-pmi=${INSTALL_ROOT}/slurm \
               --without-psm --disable-dlopen \
               --disable-vt \
               --enable-orterun-prefix-by-default \
               --enable-debug-symbols"

       There aren't any obvious error messages in the build log -- what am I
       missing?

       Andy

       --
       Andy Riebs
       andy.ri...@hpe.com
       Hewlett-Packard Enterprise
       High Performance Computing Software Engineering
       +1 404 648 9024
       My opinions are not necessarily those of HPE

       _______________________________________________
       users mailing list
       us...@open-mpi.org
       Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
       Link to this post:
       http://www.open-mpi.org/community/lists/users/2016/05/29094.php

 _______________________________________________
 users mailing list
 us...@open-mpi.org
 Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
 Link to this post: http://www.open-mpi.org/community/lists/users/2016/05/29100.php

      
_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: http://www.open-mpi.org/community/lists/users/2016/05/29101.php

      

_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: https://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: http://www.open-mpi.org/community/lists/users/2016/05/29102.php

Reply via email to