Thanks Paul,

can you please try the patch available at https://patch-diff.githubusercontent.com/raw/open-mpi/ompi-release/pull/1357.patch ?


Cheers,


Gilles


On 9/1/2016 2:12 AM, Paul Hargrove wrote:
On an emulated UltraSPARC system running Linux (and using V9 ABI) I was able to build the RC, but get a SIGBUS when running ring_c. The problem is an unaligned 64-bit access, as shown by the gdb session below.

I have not tried, but it *might* be possible to reproduce on PPC64 via "prctl --unaligned=signal".

-Paul


Core was generated by `examples/ring_c'.
Program terminated with signal 10, Bus error.
#0  0xf630ed64 in component_set_addr (peer=0xf6bb7114, uris=0x90ec8)
at /home/phargrov/OMPI/openmpi-2.0.1rc2-linux-sparcv9/openmpi-2.0.1rc2/orte/mca/oob/usock/oob_usock_component.c:318 318 if (OPAL_SUCCESS != opal_hash_table_get_value_uint64(&mca_oob_usock_module.peers,

(gdb) l
313         if (ORTE_PROC_IS_APP) {
314 /* if this is my daemon, then take it - otherwise, ignore */
315             if (ORTE_PROC_MY_DAEMON->jobid == peer->jobid &&
316 ORTE_PROC_MY_DAEMON->vpid == peer->vpid) {
317 ui64 = (uint64_t*)peer;
318 if (OPAL_SUCCESS != opal_hash_table_get_value_uint64(&mca_oob_usock_module.peers,
319  (*ui64), (void**)&pr) || NULL == pr) {
320   pr = OBJ_NEW(mca_oob_usock_peer_t);
321   pr->name = *peer;
322 opal_hash_table_set_value_uint64(&mca_oob_usock_module.peers, (*ui64), pr);

(gdb) print ui64
$1 = (uint64_t *) 0xf6bb7114

--
Paul H. Hargrove phhargr...@lbl.gov <mailto:phhargr...@lbl.gov>
Computer Languages & Systems Software (CLaSS) Group
Computer Science Department               Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900


_______________________________________________
devel mailing list
devel@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/devel

_______________________________________________
devel mailing list
devel@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/devel

Reply via email to