|
Hi, All, I have a cluster with 1 head node and 3 slave nodes, and their hostnames are: master: m01.c01 slaves: s01.c01 s02.c01 s03.c01 So I wanna build a small grid. I installed mpich-g2, globus, torque into my cluster, and the slaves share the mpich-g2 installation on the head node. I have pasted the "hello world" example with the mpich package, which proves my installation of globus is OK. Then I interfaced torque and globus and submitted the "hello world" job above with a RSL file like this: +It worked very well and the output is: [[EMAIL PROTECTED] hello]$ globusrun -w -f hello.rslThen I set the $mpirun in pbs.pm to $MPICH-G2_HOME/bin/mpirun, and submitted a mpich-g2 job: the classical "cpi" program with the mpich package. But it failed. This is the RSL file: +The output is: [[EMAIL PROTECTED] examples]$ ./mpirun -globusrsl cpi.rslSo I googled it and someone said that I have to remove the line "(jobtype=mpi)" if I don't use Vender MPI. I did it and the errors were gone, but it seems like all the processes ran on the head nodes while none on the slaves: [gt@m01.c01 examples]$ ./mpirun -globusrsl cpi.rslCould anyone tell me what's wrong with it? Thanks in advance! Best Regards, Narisu, Beihang University, Beijing, China. Email:[EMAIL PROTECTED] |
- [gt-user] Problems with mpich-g2 + globus + torque 那日苏
