Hi Belaid

PBS loves to read the nodes' list backwards.
If you want to start with WN1,
put it last on the Torque/PBS "nodes" file.

Gus Correa
---------------------------------------------------------------------
Gustavo Correa
Lamont-Doherty Earth Observatory - Columbia University
Palisades, NY, 10964-8000 - USA
---------------------------------------------------------------------


Belaid MOA wrote:
I actually tried both:
-- in the interactive mode, as soon as I hit enter, the PBS sends me to a worker node (WN2) that does not have tm support. I guess if I added the head node to the list of PBS nodes, I would not run into the problem. However, I am glad I did run into the problem. You guys helped me understand how mpirun interacts with PBS. -- in the batch mode, the PBS chooses WN2 as the root node (rank 0) to run mpirun.
So, Both behaved in the same in my case.

With many thanks Ralph for the clarification. I am now getting into the openMPI spirit. By the way, I love it :)

~Belaid.



------------------------------------------------------------------------
From: r...@open-mpi.org
Date: Tue, 1 Dec 2009 19:44:38 -0700
To: us...@open-mpi.org
Subject: Re: [OMPI users] mpirun is using one PBS node only

Just to further show my confusion (since I wrote much of the TM support):

If you get an interactive allocation and then type "mpirun ....", mpirun will execute on the node upon which you are sitting. Jeff's statement is -only- true if you "qsub" the job - i.e., you run it in batch mode.

From your prior notes, I was under the impression you were running interactively. If this is true, then you only need OMPI to be built with TM support on the node which you are interactively working on.

If you are running it in batch mode, then Jeff is correct - you will need OMPI built with TM support on all nodes since you (in general) cannot know which node PBS will launch you on.

Hope that is clear
Ralph

On Dec 1, 2009, at 7:35 PM, Belaid MOA wrote:

> Yes, this page is definitely incorrect if you want to run with PBS/TM > support -- you definitely need to install with TM support on all
    nodes.
> > The reason is that PBS will launch your script (and therefore > "mpirun") on the first node of the job. This node must have an Open > MPI mpirun with TM support so that it can query PBS to get the nodes > to launch on, use the TM API to launch processes on the nodes > allocated to the job, etc.
    That's exactly the sound logic I understood from OpenMPI
    documentation but
    I thought the website is reliable in the sense it configured the
    server node with
    tm support.

    I am very grateful to subscribe to this list. It is one of the most
    active list I ever saw.

    Thanks a lot Jeff.

    With Best Regards.
    ~Belaid.

    ------------------------------------------------------------------------
    Get a great deal on Windows 7 and see how it works the way you
    want. See the Windows 7 offers now.
    
<http://go.microsoft.com/?linkid=9691813>_______________________________________________
    users mailing list
    us...@open-mpi.org <mailto:us...@open-mpi.org>
    http://www.open-mpi.org/mailman/listinfo.cgi/users



------------------------------------------------------------------------
Get a great deal on Windows 7 and see how it works the way you want. See the Windows 7 offers now. <http://go.microsoft.com/?linkid=9691813>


------------------------------------------------------------------------

_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to