Hello List,

Just finished installing the oscar cluster.  All went well.

Then began testing mpich. As a local user I am able to use mpirun or the pbs system to run jobs.

I can use mpirun as one of my non-local, ldap in this case, users and the job runs.

But when I use the pbs system to submit the identical job, the job hangs in the queue. I am seeing the following in the mom_logs on the nodes involved:
=========================
02/15/2006 15:06:22;0008; pbs_mom;Job;6.master;No Password Entry for User tmac0501 02/15/2006 15:10:26;0008; pbs_mom;Job;6.master;ERROR: received request 'ABORT_JOB' from 192.168.0.6:1023 for job '6.master' (job does not exist locally)
========================

Not sure what I have to configure. I haven't seen anyhting in the pbs docs regarding authentication yet.

The ldap users can ssh to each node and their home is mounted.

Ideas / Suggestions?

TIA
--
Jim Summers
School of Computer Science-University of Oklahoma
-------------------------------------------------


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Oscar-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to