Lenny --

Did this get fixed? We were mucking with some mca param stuff on the trunk yesterday; not sure if it was related to this failure or not.


On Mar 26, 2008, at 10:34 AM, Lenny Verkhovsky wrote:
Hi, all

I compiled and builded source from trunk
and it causes segfault

/home/USERS/lenny/OMPI_ORTE_NEW/bin/mpirun -np 1 -H witch17 /home/ USERS/lenny/TESTS/ORTE/mpi_p01_NEW -t lt

--------------------------------------------------------------------------
It looks like MPI_INIT failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during MPI_INIT; some of which are due to configuration or environment
problems.  This failure appears to be an internal failure; here's some
additional information (which may only be relevant to an Open MPI
developer):
  mca_mpi_register_params() failed
  --> Returned "Error" (-1) instead of "Success" (0)
--------------------------------------------------------------------------
[witch17:01220] *** Process received signal ***
[witch17:01220] Signal: Segmentation fault (11)
[witch17:01220] Signal code:  (128)
[witch17:01220] Failing at address: (nil)
[witch17:01220] [ 0] /lib64/libpthread.so.0 [0x2aadf7072c10]
[witch17:01220] [ 1] /home/USERS/lenny/OMPI_ORTE_NEW/lib/libopen- pal.so.0(free+0x56) [0x2aadf6acb6d6] [witch17:01220] [ 2] /home/USERS/lenny/OMPI_ORTE_NEW/lib/libopen- pal.so.0(opal_argv_free+0x25) [0x2aadf6ab9635] [witch17:01220] [ 3] /home/USERS/lenny/OMPI_ORTE_NEW/lib/libmpi.so.0 [0x2aadf67f4206] [witch17:01220] [ 4] /home/USERS/lenny/OMPI_ORTE_NEW/lib/libmpi.so. 0(MPI_Init+0xf0) [0x2aadf68117c0] [witch17:01220] [ 5] /home/USERS/lenny/TESTS/ORTE/mpi_p01_NEW(main +0xef) [0x40109f] [witch17:01220] [ 6] /lib64/libc.so.6(__libc_start_main+0xf4) [0x2aadf7199154] [witch17:01220] [ 7] /home/USERS/lenny/TESTS/ORTE/mpi_p01_NEW [0x400ee9]
[witch17:01220] *** End of error message ***
--------------------------------------------------------------------------
mpirun noticed that process rank 0 with PID 1220 on node witch17 exited on signal 11 (Segmentation fault).
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel


--
Jeff Squyres
Cisco Systems

Reply via email to