FWIW, the error message is telling you exactly what is wrong and provides a 
link to the FAQ item on how to fix it.

It's a bit inelegant that it's segv'ing after that, but the real issue is what 
is described in the help message.


> On Mar 6, 2019, at 3:07 PM, Zhifeng Yang <yangz...@gmail.com> wrote:
> 
> Hi
> 
> I compiled openmpi-3.1.2 using pgi 18.10 on our IBM power9 machine. After 
> that, I used openmpi-3.1.2 to compile parallel-netcdf-1.8.1. However, I got 
> the following error.
> 
> ./nc_test -c    -d .
> [c699login01:12104] mca_base_component_repository_open: unable to open 
> mca_plm_lsf: libbat.so: cannot open shared object file: No such file or 
> directory (ignored)
> [c699login01:12104] mca_base_component_repository_open: unable to open 
> mca_ras_lsf: libbat.so: cannot open shared object file: No such file or 
> directory (ignored)
> --------------------------------------------------------------------------
> WARNING: There are more than one active ports on host 'c699login01', but the
> default subnet GID prefix was detected on more than one of these
> ports.  If these ports are connected to different physical IB
> networks, this configuration will fail in Open MPI.  This version of
> Open MPI requires that every physically separate IB subnet that is
> used between connected MPI processes must have different subnet ID
> values.
> 
> Please see this FAQ entry for more details:
> 
>   http://www.open-mpi.org/faq/?category=openfabrics#ofa-default-subnet-gid
> 
> NOTE: You can turn off this warning by setting the MCA parameter
>       btl_openib_warn_default_gid_prefix to 0.
> --------------------------------------------------------------------------
> [c699login01:12103] *** Process received signal ***
> [c699login01:12103] Signal: Segmentation fault (11)
> [c699login01:12103] Signal code:  (3)
> [c699login01:12103] Failing at address: 0x615f6c61706f0064
> [c699login01:12103] [ 0] [0x2000000504d8]
> [c699login01:12103] [ 1] [0x34333164]
> [c699login01:12103] [ 2] /lib64/libc.so.6(__sbrk+0x98)[0x200000729b28]
> [c699login01:12103] [ 3] 
> /lib64/libc.so.6(__default_morecore+0x18)[0x2000006aece8]
> [c699login01:12103] [ 4] /lib64/libc.so.6(+0x9511c)[0x2000006a511c]
> [c699login01:12103] [ 5] /lib64/libc.so.6(+0x96ff4)[0x2000006a6ff4]
> [c699login01:12103] [ 6] /lib64/libc.so.6(__libc_malloc+0x8c)[0x2000006a938c]
> [c699login01:12103] [ 7] 
> /home/vy57456/application/pgi/18.10/openmpi-3.1.2/lib/libopen-pal.so.40(opal_show_help_yylex+0x98)[0x20000099a8e0]
> [c699login01:12103] [ 8] 
> /home/vy57456/application/pgi/18.10/openmpi-3.1.2/lib/libopen-pal.so.40(opal_show_help_vstring+0x25c)[0x20000099a2f4]
> [c699login01:12103] [ 9] 
> /home/vy57456/application/pgi/18.10/openmpi-3.1.2/lib/libopen-rte.so.40(orte_show_help+0x70)[0x200000838f48]
> [c699login01:12103] [10] 
> /home/vy57456/application/pgi/18.10/openmpi-3.1.2/lib/openmpi/mca_btl_openib.so(+0x160d4)[0x2000036060d4]
> 
> For detail about error, please see the attached file.
> 
> Best,
> Zhifeng
> 
> <make_check.log>_______________________________________________
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users


-- 
Jeff Squyres
jsquy...@cisco.com

_______________________________________________
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Reply via email to