That looks out-of-date to me - there have been commits all week long. Best 
guess is that the git mirror is behind

On Sep 20, 2013, at 6:48 AM, Victor Kocheganov <victor.kochega...@itseez.com> 
wrote:

> I have HEAD on git revision:
> commit 4c282fe5bc8a4143a8c6ac5c0f8d4af591277f6f
> Author: Ralph Castain <r...@open-mpi-git-mirror.example.com>
> Date:   Sun Sep 15 15:33:51 2013 +0000
> 
> May be there is a difference in PMI? I have PMI-1 on machine.
> 
> 
> On Fri, Sep 20, 2013 at 5:37 PM, Ralph Castain <r...@open-mpi.org> wrote:
> What revision level are you at? I just checked and it worked fine for me
> 
> On Sep 20, 2013, at 2:33 AM, Victor Kocheganov <victor.kochega...@itseez.com> 
> wrote:
> 
>> Hi folks!
>> 
>> I am trying to launch MPI master branch with srun (simple send/recv program, 
>> see attach) and using openib, but unfortunately I get a segfault.
>> 
>> Below is my workflow.
>> 1) I configured ompi/master with following line:
>> ./autogen.sh &&  ./configure --prefix=$PWD/install --with-openib --with-pmi 
>> && make -j3 && make install -j3
>> 
>> 2) exported (along with PATH and LD_LIBRARY_PATH) OMPI_MCA_btl variable:
>> export OMPI_MCA_btl=self,openib
>> 
>> 3) and launched with following line:
>> mpicc ~/usefull_tests/mpi_init.c && srun -n 2 ./a.out
>> 
>> Eventually I get following error: 
>> srun: error: mir6: task 1: Segmentation fault (core dumped)
>> srun: Terminating job step 17309.2
>> 
>> with following backtrace:
>> #0  0x00007f856c47b1d0 in ?? ()
>> #1  <signal handler called>
>> #2  0x00007f856d12d721 in rml_recv_cb (status=0, process_name=0x2027c50, 
>> buffer=0x7f857084ed10,
>>     tag=102, cbdata=0x0) at connect/btl_openib_connect_oob.c:823
>> #3  0x00007f857553ffb0 in orte_rml_base_process_msg (fd=-1, flags=4, 
>> cbdata=0x2027b80)
>>     at base/rml_base_msg_handlers.c:172
>> #4  0x00007f857522a6c6 in event_process_active_single_queue (base=0x1ed6c60, 
>> activeq=0x1ec9210)
>>     at event.c:1367
>> #5  0x00007f857522a93e in event_process_active (base=0x1ed6c60) at 
>> event.c:1437
>> #6  0x00007f857522afbc in opal_libevent2021_event_base_loop (base=0x1ed6c60, 
>> flags=1) at event.c:1645
>> #7  0x00007f85754ccc19 in orte_progress_thread_engine (obj=0x7f857577cf20) 
>> at runtime/orte_init.c:180
>> #8  0x0000003b5a6077f1 in start_thread () from /lib64/libpthread.so.0
>> #9  0x0000003b59ee570d in clone () from /lib64/libc.so.6
>> 
>> 
>> Can anybody please help with a reason of such failure?
>> 
>> P.s. I use  Red Hat Enterprise Linux Server release 6.2 with InfiniBand 
>> cards.
>> 
>> Thanks in advance,
>> Victor Kocheganov.
>> <mpi_test.c>_______________________________________________
>> devel mailing list
>> de...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> 
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to