Hi Edgar,
this sounds reasonable. Looking at the program with the debugger, I can
see that 15/16 processes wait in PMPI_Allreduce whereas the other one is
in PMPI_Wait.
Also, the program works with mvapich and I guess the ADIO source tree is
more or less the same (correct me if I'm wrong)?!
I am wondering whether this is really due to the usage of
File_write_all. We had a bug in in 1.3 series so far (which will be
fixed in 1.3.4) where we lost message segments and thus had a deadlock
in Comm_dup if there was communication occurring *right after* the
Comm_dup. File_open executes a