Re: [OMPI devel] Checkpoint/Restart svn trunk

2008-08-12 Thread Caciano Machado
Thanks. Now it's working. On Tue, Aug 12, 2008 at 8:21 AM, Jeff Squyres wrote: > Ralph committed a proper fix yesterday; see if that works for you. > > On Aug 11, 2008, at 7:44 PM, Caciano Machado wrote: > >> Jeff, >> >> Here is an ugly hack that I'm using to get this working in Linux until >> Jo

Re: [OMPI devel] Checkpoint/Restart svn trunk

2008-08-12 Thread Jeff Squyres
Ralph committed a proper fix yesterday; see if that works for you. On Aug 11, 2008, at 7:44 PM, Caciano Machado wrote: Jeff, Here is an ugly hack that I'm using to get this working in Linux until Josh returns. ## --- ompi-trunk/orte/util

Re: [OMPI devel] Checkpoint/Restart svn trunk

2008-08-11 Thread Caciano Machado
Jeff, Here is an ugly hack that I'm using to get this working in Linux until Josh returns. ## --- ompi-trunk/orte/util/hnp_contact.c 2008-08-12 12:10:07.0 +0200 +++ ompi-trunk-caciano/orte/util/hnp_contact.c 2008-08-12 12:08:52.000

Re: [OMPI devel] Checkpoint/Restart svn trunk

2008-08-11 Thread Jeff Squyres
This is likely to two things: - we just made some minor changes to the session directory stuff - the checkpoint/restart guy (Josh) is off on vacation for about 3 weeks I'll file a ticket about this so that he's aware of it and can fix it when he returns. Thanks for the heads-up! On Aug 11,

Re: [OMPI devel] Checkpoint/Restart svn trunk

2008-08-11 Thread Caciano Machado
I found that open mpi is looking for the file contact.txt in the wrong directory. It always searches the file in the directory "/tmp/openmpi-sessions-root@debian_0//" but this file exists only in "/tmp/openmpi-sessions-root@debian_0//0". When I copy contact.txt to the directory where open mpi searc

[OMPI devel] Checkpoint/Restart svn trunk

2008-08-11 Thread Caciano Machado
Hi, I'm trying to run the last checkpoint/restart (rev 19235) but ompi is showing the following error in "ompi-checkpoint". It seems to be something in function "orte_list_local_hnps" of the file orte/util/hnp_contact.c. I'm using BLCR 0.7.2 and it's working correctly with the example application