Re: [gridengine users] queue instance "all.q@comp065.local" dropped because it is temporarily not available

2014-10-21 Thread Waleed Lutfi
Update: Going through the spool messages of comp065 I found this message: 10/21/2014 14:48:34| main|comp065|E|can't start job "155": can't open file /opt/gridengine/default/spool/comp065/active_jobs/155.1/pe_hostfile: No such file or Note that spool directory is a mounted NFS directory. I tried

[gridengine users] queue instance "all.q@comp065.local" dropped because it is temporarily not available

2014-10-21 Thread Waleed Lutfi
Dear all, I am currently configuring Grid Engine on a fresh install of Rocks cluster. I have 3 compute nodes. Whenever I submit any job it only runs on 1 of the nodes and the other nodes' jobs halt in 't' state. Running 'qconf -tsm', I get the following log: Tue Oct 21 14:36:49 2014|