It appears that with Ralph's oob_tcp patches (paul.diff) everything is now OK on Solaris-11/x86-64.
On Solaris-10/SPARC I needed to fix guess_strlen() (or change "%u" to "%d" to avoid the issue) or else I didn't get very far at all (SEGV in orterun). However, with that issue resolved things are still not "golden". I have applied the oob_tcp patches and rebuilt on the Solaris-10/SPARC system. I had hoped it would fix an interrupted select warning I'd seen. However, it is still there along with the loopback-if warning and one about a failed accept(). Output is below. -Paul $ mpirun -mca btl sm,self -np 2 examples/ring_c -------------------------------------------------------------------------- WARNING: No loopback interface was found. This can cause problems when we spawn processes as they are likely to be unable to connect back to their host daemon. Sadly, it may take awhile for the connect attempt to fail, so you may experience a significant hang time. You may wish to ctrl-c out of your job and activate loopback support on at least one interface before trying again. -------------------------------------------------------------------------- select: Interrupted system call -------------------------------------------------------------------------- WARNING: No loopback interface was found. This can cause problems when we spawn processes as they are likely to be unable to connect back to their host daemon. Sadly, it may take awhile for the connect attempt to fail, so you may experience a significant hang time. You may wish to ctrl-c out of your job and activate loopback support on at least one interface before trying again. -------------------------------------------------------------------------- -------------------------------------------------------------------------- WARNING: No loopback interface was found. This can cause problems when we spawn processes as they are likely to be unable to connect back to their host daemon. Sadly, it may take awhile for the connect attempt to fail, so you may experience a significant hang time. You may wish to ctrl-c out of your job and activate loopback support on at least one interface before trying again. -------------------------------------------------------------------------- [xxx.xxx.xxx.xxx:09934] mca_oob_tcp_accept: accept() failed: Resource temporarily unavailable (11). Process 0 sending 10 to 1, tag 201 (2 processes in ring) Process 0 sent to 1 Process 0 decremented value: 9 Process 0 decremented value: 8 Process 0 decremented value: 7 Process 0 decremented value: 6 Process 0 decremented value: 5 Process 0 decremented value: 4 Process 0 decremented value: 3 Process 0 decremented value: 2 Process 0 decremented value: 1 Process 0 decremented value: 0 Process 0 exiting Process 1 exiting -- Paul H. Hargrove phhargr...@lbl.gov Computer Languages & Systems Software (CLaSS) Group Computer Science Department Tel: +1-510-495-2352 Lawrence Berkeley National Laboratory Fax: +1-510-486-6900