Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-29 Thread Sharon Brunett
Thanks, I'll try the version you recommend below! Josh Hursey wrote: Your previous email indicted that you were using r18241. I committed in r18276 a patch that should fix this problem. Let me know if you still see it after that update. Cheers, Josh On Apr 29, 2008, at 3:18 PM, Sharon

Re: [OMPI users] "Permission denied" during MPI installation

2008-04-29 Thread Linwei Wang
Thank you for the advice, Jeff. Best, Linwei On Apr 29, 2008, at 4:26 PM, Jeff Squyres wrote: Alternatively, you can specify to install to a different location with the --prefix option to configure (possibly somewhere you do have write access). On Apr 29, 2008, at 3:17 PM, Linwei Wang

Re: [OMPI users] "Permission denied" during MPI installation

2008-04-29 Thread Jeff Squyres
Alternatively, you can specify to install to a different location with the --prefix option to configure (possibly somewhere you do have write access). On Apr 29, 2008, at 3:17 PM, Linwei Wang wrote: Yeah, it works. Thank you all for the help. Best, Linwei On Apr 29, 2008, at 4:00 PM,

Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-29 Thread Sharon Brunett
Josh, I'm also having trouble using ompi-restart on a snapspot made from a run which was previously checkpointed. In other words, restarting a previously restarted run! (a) start the run mpirun -np 16 -am ft-enable-cr ./a.out <---do an ompi-checkpoint on the mpirun pid from (a) from

Re: [OMPI users] "Permission denied" during MPI installation

2008-04-29 Thread Linwei Wang
Yeah, it works. Thank you all for the help. Best, Linwei On Apr 29, 2008, at 4:00 PM, Alberto Giannetti wrote: Linwei, are you running the command as root? Try using sudo: # sudo make install It will ask you for an administrator password. On Apr 29, 2008, at 3:54 PM, Linwei Wang wrote:

Re: [OMPI users] "Permission denied" during MPI installation

2008-04-29 Thread Davi Vercillo C. Garcia
Hi, > Making install in etc > test -z "/usr/local/etc" || ../../config/install-sh -c -d "/usr/local/ > etc" > /usr/bin/install -c -m 644 openmpi-mca-params.conf /usr/local/etc/ > openmpi-mca-params.conf > install: /usr/local/etc/openmpi-mca-params.conf: Permission denied > make[3]: ***

Re: [OMPI users] "Permission denied" during MPI installation

2008-04-29 Thread Alberto Giannetti
Linwei, are you running the command as root? Try using sudo: # sudo make install It will ask you for an administrator password. On Apr 29, 2008, at 3:54 PM, Linwei Wang wrote: Dear all, I'm new to MPI... I'm trying to install open MPI on my mac (Leopard).. But during the installation (with

[OMPI users] "Permission denied" during MPI installation

2008-04-29 Thread Linwei Wang
Dear all, I'm new to MPI... I'm trying to install open MPI on my mac (Leopard).. But during the installation (with the command "make install") it keeps generating errors of "Permission denied" Anyone has an idea of the reasons? Part of the output is like: " Making install in etc test -z

Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-29 Thread Sharon Brunett
Josh, Thanks for the quick response. I'll test against some key applications we would like to use blcr checkpointing/restarting against. Perhaps if we're lucky and careful, we'll be able to get some near term use out of what we have installed. Sharon Josh Hursey wrote: Sharon, This is,

Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-29 Thread Sharon Brunett
Josh Hursey wrote: On Apr 29, 2008, at 12:55 AM, Sharon Brunett wrote: I'm finding that using ompi-checkpoint on an application which is very cpu bound takes a very very long time. For example, trying to checkpoint a 4 or 8 way Pallas MPI Benchmark application can take more than an

Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-29 Thread Sharon Brunett
Josh, Thanks for your inputs. Yes, I'm able to restart properly outside the hostfile issues. The problems were with the permissions on /var/run/nscd/passwd The hostfile issues have now also been resolved...the problem was interactions with maui/torque's hostfile and getting a proper hostfile

Re: [OMPI users] setting the btl_tcp_eager_limit

2008-04-29 Thread George Bosilca
The btl_tcp_sndbuf and btl_tcp_rcvbuf are limited by the kernel (usually 128K), so there is no reason to set them to something huge, if the kernel is unable to support these values. The eager didn't get modified between 1.1 and 1.2, so it should work as expected. george. On Apr 28,

Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-29 Thread Josh Hursey
On Apr 29, 2008, at 12:55 AM, Sharon Brunett wrote: I'm finding that using ompi-checkpoint on an application which is very cpu bound takes a very very long time. For example, trying to checkpoint a 4 or 8 way Pallas MPI Benchmark application can take more than an hour. The problem is not

Re: [OMPI users] blcr_checkpoint_peer: execvp returned -1

2008-04-29 Thread Leonardo Fialho
Josh, Yesterday at night I made some changes, checkout a new SVN version, and revise completely the BLCR installation. It´s working fine. I suspect 2 different things: 1) cache or old files (configured with older BLCR version path) in autom4te, configure or dependencies; 2) some miss

Re: [OMPI users] infiniband

2008-04-29 Thread Pavel Shamis (Pasha)
SLIM H.A. wrote: Is it possible to get information about the usage of hca ports similar to the result of the mx_endpoint_info command for Myrinet boards? The ibstat command gives information like this: Port 1: State: Active Physical state: LinkUp but does not say whether a job is actually

Re: [OMPI users] openmpi-1.3a1r18241 ompi-restart issue

2008-04-29 Thread Sharon Brunett
I'm finding that using ompi-checkpoint on an application which is very cpu bound takes a very very long time. For example, trying to checkpoint a 4 or 8 way Pallas MPI Benchmark application can take more than an hour. The problem is not where I'm dumping checkpoints (I've tried local and an nfs