Dear ALL,
I am trying to checkpoint MPI application using the self
component. I had a look at the OPEN MPI FT user's guide Draft 1.4. but is still
unsure.
I have installed openmpi as follows:
jean$ ./configure --prefix=/home/jean/openmpi/ --enable-debug
--enable-mpi-profile -
Hi All,
I'm a trying to install openmpi with self. However, I am experiencing
some problems with openmpi itself.
I have successfully installed the software and added the path in the .bashrc
file as follows:
export PATH="/home/jean/openmpisof/bin:$PATH"
export LD_LIBRARY_PATH="/home/je
2009, 2:39 PM
What OMPI version are you using?
On Wed, Aug 19, 2009 at 8:36 AM, Jean Potsam wrote:
Hi All,
I'm a trying to install openmpi with self. However, I am experiencing
some problems with openmpi itself.
I have successfully installed the software and added the path
Dear All,
I tried to install openmpi 1.3 in my home directory on one of the
nodes of our cluster. I have adminsitrative rights and is installing as a
superuser. The configure phase is fine. The make all install part give me the
following problems:
##
Dear All,
I have installed openmpi 1.3.2 on one on the nodes of our cluster
and is running a simple helloword mpi program. The program runs fine but I get
a lot of unexpected messages in between the result.
##
jean@n06:~/examples$ mpirun -n
Dear all,
I am trying to checkpoint an mpi application at specific points
in my program. So, i created a small function as follows:
void mychkpt()
{
system ("ompi-checkpoint -v `pidof mpirun`");
}
and I am calling it in my MPI application at specific points. e.g
##
pri
Dear All,
I have installed openmpi 1.3.2 in my home directory (
/home/jean/openmpisof/ ) and BLCR in /usr/local/blcr. I have added the
following in the .bashrc file
export PATH=/home/jean/openmpisof/bin/:$PATH
export LD_LIBRARY_PATH=/home/jean/openmpisof/lib/:$LD_LIBRARY_PATH
exp
Hi Everyone,
I wrote a small program with a function to trigger the
checkpointing mechanism as follows:
#include
#include
#include
#include
#include
void trigger_checkpoint();
int main(int argc, char **argv)
{
int rank,size;
Hi Everyone,
I noticed that it hangs just before displaying the following
while trying to checkpoint the application.
[sun06:15252] orte_checkpoint: notify_hnp: Requested a checkpoint of jobid
[INVALID]
###
Can it be re
ayed as an INVALID handle.
-- Josh
On Sep 11, 2009, at 9:50 AM, Jean Potsam wrote:
>
> Hi Everyone,
> I noticed that it hangs just before displaying the following
>while trying to checkpoint the application.
>
>
> [sun06:152
Hi Everyone,
I am trying to checkpoint an mpi application running on
multiple nodes. However, I get some error messages when i trigger the
checkpointing process.
Error: expected_component: PID information unavailable!
Error: expected_component: Component Name information u
--- On Mon, 11/1/10, Josh Hursey wrote:
From: Josh Hursey
Subject: Re: [OMPI users] checkpointing multi node and multi process
applications
To: "Open MPI Users"
List-Post: users@lists.open-mpi.org
Date: Monday, 11 January, 2010, 21:42
On Dec 19, 2009, at 7:
Dear All,
I am having a problem with openmpi . I have installed openmpi 1.4
and blcr 0.8.1
I have written a small mpi application as follows below:
###
#include
#include
#include
#include
#include
#include
#include
#include
#include
#include
#include
be a bug with BLCR. Either way let us know what you
find out.
Cheers,
Josh
On Mar 27, 2010, at 6:17 AM, jody wrote:
> I'm not sure if this is the cause of your problems:
> You define the constant BUFFER_SIZE, but in the code you use a constant
> called BUFSIZ...
> Jody
>
>
14 matches
Mail list logo