This is a known issue that occurs occasionally with PBS... to blow away persistent job state:
Stop the server
Remove contents of /var/spool/pbs/server_priv/jobs
Start server.

Jeremy

At 03:36 PM 2/7/2003 -0500, Brian Williams wrote:
I'm not exactly sure how, but one of my PBS jobs crashed ofr blew up and
won't finish executing. However, I can't seem to kill it, even after
restarting maui, pbs_mom and pbs_server.
qstat says:
Job id           Name             User             Time Use S Queue
---------------- ---------------- ---------------- -------- - -----
2313.athisl      spf425           testbed                 0 R workq

but when I try to qdel 2313 as either testbed or root, i get:
testbed@athisl:~/scripts/PBS/SPF[116] qdel 2313
qdel: Server could not connect to MOM 2313.athisl.quantumleap.us
qmgr seems to work fine, and I can add new things to the queue, but they
won't run. So somehow I've lost my connection to mom even tho the process is
running fine and has restarted fine. any help?
Thanks,
Brian

Brian E. Williams
Software Developer and Systems Administrator
Quantum Leap Innovations
(302)894-8036               [EMAIL PROTECTED]
http://copland.udel.edu/~brianw


-------------------------------------------------------
This SF.NET email is sponsored by:
SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See!
http://www.vasoftware.com
_______________________________________________
Oscar-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/oscar-users


-------------------------------------------------------
This SF.NET email is sponsored by:
SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See!
http://www.vasoftware.com
_______________________________________________
Oscar-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to