[OMPI users] Anything on fault tolerance?

2007-07-16 Thread Mohammad Huwaidi
Hi, What's the latest status on Open-MPI fault tolerance? Is there any progress? I am only interested to intercept problems when they occur (such as a node crash) without taking the whole MPI_WORLD down with it. At least, I want to cope with such situation. I did use ERRORS_THROW_EXCEPTIONS;

Re: [OMPI users] mpi with icc, icpc and ifort :: segfault (Jeff Squyres)

2007-07-16 Thread Ricardo Reis
On Fri, 13 Jul 2007, Jeff Squyres wrote: Ah -- intel 10. I think you said this before but I blew right past it. I have not personally tested with intel 10; I don't know if anyone else on the team has. I've just installed, compiled and tested OMPI with intel 10. in a PentIV with EMT64 and it

Re: [OMPI users] DataTypes with "holes" for writing files

2007-07-16 Thread Brian Barrett
I wouldn't worry about it. 1.2.3 has no ROMIO fixes over 1.2.2. Brian On Jul 16, 2007, at 9:42 AM, jody wrote: Brian, I am using OpenMPI 1.2.2, so i am lagging a bit behind. Should i update to 1.2.3 and do the test again? Thanks for the info Jody On 7/16/07, Brian Barrett wrote: Jody -

Re: [OMPI users] DataTypes with "holes" for writing files

2007-07-16 Thread jody
Brian, I am using OpenMPI 1.2.2, so i am lagging a bit behind. Should i update to 1.2.3 and do the test again? Thanks for the info Jody On 7/16/07, Brian Barrett wrote: Jody - I usually update the ROMIO package before each major release (1.0, 1.1, 1.2, etc.) and then only within a major r

[OMPI users] mpirun hangs on remote nodes -- how to find where and why?

2007-07-16 Thread Bill Johnstone
Hello. I'm trying to use Open MPI 1.2.3 on a cluster of dual-processor AMD64 nodes. These nodes are all connected via gigabit ethernet on a private, self-contained IP network. The OS is GNU/Linux, gcc 4.1.2, kernel 2.6.21 . Open MPI was configured with --prefix=/usr/local and installed via make

Re: [OMPI users] DataTypes with "holes" for writing files

2007-07-16 Thread Brian Barrett
Jody - I usually update the ROMIO package before each major release (1.0, 1.1, 1.2, etc.) and then only within a major release series when a bug is found that requires an update. This seems to be one of those times ;). Just to make sure we're all on the same page, which version of Open

Re: [OMPI users] end-to-end data reliability

2007-07-16 Thread Brian Barrett
On Jul 15, 2007, at 10:05 PM, Isaac Huang wrote: Hello, I read from the FAQ that current Open MPI releases don't support end-to-end data reliability. But I still have some confusing that can't be solved by googling or reading the FAQ: 1. I read from "MPI - The Complete Reference" that "MPI prov

Re: [OMPI users] DataTypes with "holes" for writing files

2007-07-16 Thread jody
Rob, thanks for your info. Do you know whether OpenMPI will use a newer version of ROMIO sometimes soon? Jody On 7/13/07, Robert Latham wrote: On Tue, Jul 10, 2007 at 04:36:01PM +, jody wrote: > Error: Unsupported datatype passed to ADIOI_Count_contiguous_blocks > [aim-nano_02:9] MPI

[OMPI users] end-to-end data reliability

2007-07-16 Thread Isaac Huang
Hello, I read from the FAQ that current Open MPI releases don't support end-to-end data reliability. But I still have some confusing that can't be solved by googling or reading the FAQ: 1. I read from "MPI - The Complete Reference" that "MPI provides the user with reliable message transmission. A