Re: [OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-23 Thread Paul Hargrove
Some progress: I fixed IA64.asm but still saw failures. I realized I'd not checked the ia64/atomic.h file. Lo and behold the origin of the bogus "sxt4" is a pair of improper casts, removed by the following: --- opal/include/opal/sys/ia64/atomic.h~2014-01-23 13:04:03.0 -0800 +++ op

Re: [OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-22 Thread Paul Hargrove
On Wed, Jan 22, 2014 at 2:22 PM, Paul Hargrove wrote: > My ia64 asm is a bit rusty, but I'll give a quick look if/when I can. I had a look (in v1.7) and this is what I see: $cat -n IA64.asm | grep -A14 opal_atomic_cmpset_acq_64: 70 opal_atomic_cmpset_acq_64: 71 .prologue

Re: [OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-22 Thread Paul Hargrove
On Wed, Jan 22, 2014 at 1:59 PM, Ralph Castain wrote: > Huh - afraid I can't see anything wrong so far. All looks normal and then > it just hangs. Any chance you can "gdb" to the proc and see where it is > stuck? > Ralph, The gstack output below looks like one thread is spinning on an atomic of

Re: [OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-22 Thread Ralph Castain
Huh - afraid I can't see anything wrong so far. All looks normal and then it just hangs. Any chance you can "gdb" to the proc and see where it is stuck? On Jan 22, 2014, at 11:39 AM, Paul Hargrove wrote: > Ralph, > > Attached is the requested output with the addition of "-mca > grpcomm_base_

Re: [OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-22 Thread Paul Hargrove
Ralph, Attached is the requested output with the addition of "-mca grpcomm_base_verbose 5". I have also attached a 2nd output with the further addition of "-mca oob_tcp_if_include lo" to ensure that this is not related to the firewall issues I've seen on other hosts. I have use of this host until

Re: [OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-22 Thread Ralph Castain
Weird - everything looks completely normal. Can you add -mca grpcomm_base_verbose 5 to your cmd line? On Jan 22, 2014, at 1:38 AM, Paul Hargrove wrote: > Following-up as promised: > > Output from an --enable-debug build is attached. > > -Paul > > > On Tue, Jan 21, 2014 at 11:25 PM, Paul Ha

Re: [OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-22 Thread Paul Hargrove
Following-up as promised: Output from an --enable-debug build is attached. -Paul On Tue, Jan 21, 2014 at 11:25 PM, Paul Hargrove wrote: > Yes, this is familiar. See: > http://www.open-mpi.org/community/lists/devel/2013/11/13182.php > > If I understand correctly, the thread ended with: > > On

[OMPI devel] 1.7.4rc: mpirun hangs on ia64

2014-01-22 Thread Paul Hargrove
Yes, this is familiar. See: http://www.open-mpi.org/community/lists/devel/2013/11/13182.php If I understand correctly, the thread ended with: On 03 December 2013, Sylvestre Ledru wrote: > FYI, Debian has stopped supporting ia64 for its next release > So, I stopped working on that issue. We