Re: [OMPI devel] [OMPI users] bus error with openmpi-1.8.2 and gcc-4.9.0

2014-09-02 Thread Kawashima, Takahiro
Hi Siegmar, Ralph,

I forgot to follow the previous report, sorry.
The patch I suggested is not included in Open MPI 1.8.2.
The backtrace Siegmar reported points the problem that I fixed
in the patch.

  http://www.open-mpi.org/community/lists/users/2014/08/24968.php

Siegmar:
Could you try my patch again?

Ralph (or someone committer):
Open MPI 1.8 needs custom patch that I posted. See my previous mail.
Could you review it and commit it to v1.8 branch?

Regards,
Takahiro

> Hi,
> 
> yesterday I installed openmpi-1.8.2 on my machines (Solaris 10 Sparc
> (tyr), Solaris 10 x86_64 (sunpc0), and openSUSE Linux 12.1 x86_64
> (linpc0)) with gcc-4.9.0. A small program works on some machines,
> but breaks with a bus error on Solaris 10 Sparc.
> 
> 
> tyr small_prog 118 which mpicc
> /usr/local/openmpi-1.8.2_64_gcc/bin/mpicc
> tyr small_prog 119 ompi_info | grep MPI:
> Open MPI: 1.8.2
> tyr small_prog 120 mpiexec -np 1 --host linpc0 init_finalize
> Hello!
> tyr small_prog 121 mpiexec -np 1 --host sunpc0 init_finalize
> Hello!
> tyr small_prog 122 mpiexec -np 1 --host tyr init_finalize
> [tyr:28081] *** Process received signal ***
> [tyr:28081] Signal: Bus Error (10)
> [tyr:28081] Signal code: Invalid address alignment (1)
> [tyr:28081] Failing at address: 7fffd304
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-pal.so.6.2.0:opal_backtrace_print+0x2c
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-pal.so.6.2.0:0xcd118
> /lib/sparcv9/libc.so.1:0xd8b98
> /lib/sparcv9/libc.so.1:0xcc70c
> /lib/sparcv9/libc.so.1:0xcc918
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/openmpi/mca_db_hash.so:0x3ee8
>  [ Signal 10 (BUS)]
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-pal.so.6.2.0:opal_db_base_store+0xc8
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-rte.so.7.0.4:orte_util_decode_pidmap+0x798
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-rte.so.7.0.4:orte_util_nidmap_init+0x3cc
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/openmpi/mca_ess_env.so:0x226c
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-rte.so.7.0.4:orte_init+0x308
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libmpi.so.1.5.2:ompi_mpi_init+0x31c
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libmpi.so.1.5.2:MPI_Init+0x2a8
> /home/fd1026/SunOS/sparc/bin/init_finalize:main+0x10
> /home/fd1026/SunOS/sparc/bin/init_finalize:_start+0x7c
> [tyr:28081] *** End of error message ***
> --
> mpiexec noticed that process rank 0 with PID 28081 on node tyr exited on 
> signal 10 (Bus Error).
> --
> tyr small_prog 123 
> 
> 
> 
> gdb shows the following backtrace.
> 
> tyr small_prog 123 /usr/local/gdb-7.6.1_64_gcc/bin/gdb 
> /usr/local/openmpi-1.8.2_64_gcc/bin/mpiexec  
> GNU gdb (GDB) 7.6.1
> Copyright (C) 2013 Free Software Foundation, Inc.
> License GPLv3+: GNU GPL version 3 or later 
> This is free software: you are free to change and redistribute it.
> There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
> and "show warranty" for details.
> This GDB was configured as "sparc-sun-solaris2.10".
> For bug reporting instructions, please see:
> ...
> Reading symbols from 
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/bin/orterun...done.
> (gdb) run -np 1 --host tyr init_finalize
> Starting program: /usr/local/openmpi-1.8.2_64_gcc/bin/mpiexec -np 1 --host 
> tyr init_finalize
> [Thread debugging using libthread_db enabled]
> [New Thread 1 (LWP 1)]
> [New LWP2]
> [tyr:28099] *** Process received signal ***
> [tyr:28099] Signal: Bus Error (10)
> [tyr:28099] Signal code: Invalid address alignment (1)
> [tyr:28099] Failing at address: 7fffd244
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-pal.so.6.2.0:opal_backtrace_print+0x2c
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-pal.so.6.2.0:0xcd118
> /lib/sparcv9/libc.so.1:0xd8b98
> /lib/sparcv9/libc.so.1:0xcc70c
> /lib/sparcv9/libc.so.1:0xcc918
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/openmpi/mca_db_hash.so:0x3ee8
>  [ Signal 10 (BUS)]
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-pal.so.6.2.0:opal_db_base_store+0xc8
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-rte.so.7.0.4:orte_util_decode_pidmap+0x798
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-rte.so.7.0.4:orte_util_nidmap_init+0x3cc
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/openmpi/mca_ess_env.so:0x226c
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libopen-rte.so.7.0.4:orte_init+0x308
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libmpi.so.1.5.2:ompi_mpi_init+0x31c
> /export2/prog/SunOS_sparc/openmpi-1.8.2_64_gcc/lib64/libmpi.so.1.5.2:MPI_Init+0x2

[OMPI devel] Migrating Trac wiki and tickets to Github

2014-09-02 Thread Jeff Squyres (jsquyres)
As part of moving the final OMPI SVN repo to Github, we're also converting the 
Trac wiki and tickets.

I have a first attempt at converting the OMPI trac wiki to Github's Markdown:

   https://github.com/jsquyres/ompi-test/wiki

*** Note: there's a lot of stale content there -- because there's a lot of 
stale content on the Trac wiki.  I really didn't make an effort to update any 
stale (Trac) content, other than adding a small number "this page needs to be 
re-written for git/github.  The intent here is to make sure that there are no 
egregious Trac->Github wiki conversion errors.

Please feel free to make any changes/updates you want -- it's easy to move this 
wiki to its final location (i.e., when we establish the actual/final OMPI 
Github repo).

(ticket conversion is still in the works)

-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/