On Jun 29, 2006, at 11:16 PM, Graham E Fagg wrote:
On Thu, 29 Jun 2006, Doug Gregor wrote:

When I use algorithm 6, I get:

[odin003.cs.indiana.edu:14174] *** An error occurred in MPI_Bcast
[odin005.cs.indiana.edu:10510] *** An error occurred in MPI_Bcast
Broadcasting integers from root 0...[odin004.cs.indiana.edu:11752]
*** An error occurred in MPI_Bcast

ops.. my mistake!.. only 0-5 are valid for bcast I have to change the error message:
[reliant:06935] coll:tuned:bcast_intra_do_forced algorithm 6
[reliant:06935] coll:tuned:bcast_intra_do_forced attempt to select algorithm 6 when only 0-6 is valid?

I'm still trying to find out why it hangs.. let you know as soon as I find anything but right now I am testing using TCP.

FWIW, I was able to reproduce the problem with both tcp and mvapi.

Can you let me know the exact path and LD_LIBRARY_PATH your using on odin?

PATH=/san/mpi/openmpi-1.1-gcc/bin:/u/dgregor/bin:/usr/kerberos/bin:/ usr/local/bin:/bin:/usr/bin:/usr/X11R6/bin:/san/intel/cc/9.0/bin:/san/ intel/fc/9.0/bin:/san/intel/idb/9.0/bin:/san/pathscale/bin:/usr/local/ sched/slurm/bin

LD_LIBRARY_PATH=/san/mpi/openmpi-1.1-gcc/lib:/san/intel/cc/9.0/lib:


        Doug

Reply via email to