Re: [gridengine users] The state of a queue

2014-04-22 Thread Joseph Farran
Thank you all for the helpful suggestions. Mark, your scripts are exactly what I was looking! Thanks. Joseph ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] Fwd: grid engine installation in ubuntu 12.04

2014-04-22 Thread Neelaya Dhatchayani
Hi, the issue was with the hostname. i changed the hostname of ip 127.0.0.1 in /etc/hosts from localhost to "myhostname" now i am able to connect to qmaster. the problem was doubled hostname thanks neelaya On Tue, Apr 22, 2014 at 5:04 PM, Reuti wrote: > Am 22.04.2014 um 07:33 schrieb Neela

Re: [gridengine users] can't get password entry for user "xxxx". Either the user does not exist or NIS error!

2014-04-22 Thread Ian Kaufman
Is qrsh using the SSH subsystem? Or straight rsh/rlogin? Does this happen with all users? Or a specific one? Have you tried -verbose or set SGE_DEBUG_LEVEL? Ian On Tue, Apr 22, 2014 at 7:53 AM, Prentice Bisbal wrote: > On 04/22/2014 03:13 AM, Mikael Brandström Durling wrote: >> >> 21 apr 2014

Re: [gridengine users] can't get password entry for user "xxxx". Either the user does not exist or NIS error!

2014-04-22 Thread Prentice Bisbal
On 04/22/2014 03:13 AM, Mikael Brandström Durling wrote: 21 apr 2014 kl. 19:59 skrev Prentice Bisbal : After one of these qrsh jobs fails, I get the following e-mail: Job 5326173 caused action: Job 5326173 set to ERROR User= Queue =pow1...@. Start Time = End Time

Re: [gridengine users] GE2011.11p1 on SLES 11.3 - execution host in error state

2014-04-22 Thread Reuti
Am 22.04.2014 um 11:11 schrieb Sve N: > thanks for your answer, Reuti: > The spool directory is a local folder, which exists and can be used (to > confirm I just tested this with the "KEEP_ACTIVE" parameter set - > interestingly the error occured not before the fourth of the small jobs, > which

Re: [gridengine users] h_vmem + CUDA

2014-04-22 Thread Reuti
Hi, Am 22.04.2014 um 01:17 schrieb Ilya M: >>> I have been using h_vmem as a consumable resource to limit the amount of >>> memory users can request and to make sure jobs don't use more than they >>> requested. It all has been working fine until we added nodes with GPU >>> modules. >>> >>> Th

Re: [gridengine users] Fwd: grid engine installation in ubuntu 12.04

2014-04-22 Thread Reuti
Am 22.04.2014 um 07:33 schrieb Neelaya Dhatchayani: > -- Forwarded message -- > From: Neelaya Dhatchayani > Date: Tue, Apr 22, 2014 at 11:02 AM > Subject: Re: [gridengine users] grid engine installation in ubuntu 12.04 > To: Marco Donauer > > > Hi Marco, > > thank you for your

Re: [gridengine users] GE2011.11p1 on SLES 11.3 - execution host in error state

2014-04-22 Thread Sve N
Hi, thanks for your answer, Reuti: The spool directory is a local folder, which exists and can be used (to confirm I just tested this with the "KEEP_ACTIVE" parameter set - interestingly the error occured not before the fourth of the small jobs, which indicates, that time (or load, or similar) a

Re: [gridengine users] can't get password entry for user "xxxx". Either the user does not exist or NIS error!

2014-04-22 Thread Mikael Brandström Durling
21 apr 2014 kl. 19:59 skrev Prentice Bisbal : > After one of these qrsh jobs fails, I get the following e-mail: > > Job 5326173 caused action: Job 5326173 set to ERROR > User= > Queue =pow1...@. > Start Time = > End Time= > failed assumedly before job:can't get p