On Wed, 2009-11-18 at 01:28 -0800, Bill Broadley wrote: > A rather stable production code that has worked with various versions > of MPI > on various architectures started hanging with gcc-4.4.2 and openmpi > 1.3.33 > > Which lead me to this thread.
If you're investigating hangs in a parallel job take a look at the tool linked to below (padb), it should be able to give you a parallel stack trace and the message queues for the job. http://padb.pittman.org.uk/full-report.html Ashley, -- Ashley Pittman, Bath, UK. Padb - A parallel job inspection tool for cluster computing http://padb.pittman.org.uk