David Zhang wrote:
When my MPI code fails (seg fault), it usually cause the rest of the mpi
process to abort as well. Perhaps rather than calling abort(), perhaps
you could do a divide-by-zero operation to halt the program?
David Zhang
University of California, San Diego
>
On Thu, Aug 12, 2010 at 6:49 PM, David Ronis <david.ro...@mcgill.ca
<mailto:david.ro...@mcgill.ca>> wrote:
I've got a mpi program that is supposed to to generate a core file if
problems arise on any of the nodes. I tried to do this by adding a
call to abort() to my exit routines but this doesn't work; I get no core
file, and worse, mpirun doesn't detect that one of my nodes has
aborted(?) and doesn't kill off the entire job, except in the trivial
case where the number of processors I'm running on is 1. I've replaced
abort with MPI_Abort, which kills everything off, but leaves no core
file. Any suggestions how I can get one and still have mpi exit?
Thanks in advance.
David
Also, make sure your computers' coredumpsize / core file size
limit is not zero, which is sometimes the case.
Gus Correa