Thank you all for the helpful suggestions.
Mark, your scripts are exactly what I was looking! Thanks.
Joseph
___
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users
Hi,
the issue was with the hostname.
i changed the hostname of ip 127.0.0.1 in /etc/hosts from localhost to
"myhostname"
now i am able to connect to qmaster.
the problem was doubled hostname
thanks
neelaya
On Tue, Apr 22, 2014 at 5:04 PM, Reuti wrote:
> Am 22.04.2014 um 07:33 schrieb Neela
Is qrsh using the SSH subsystem? Or straight rsh/rlogin?
Does this happen with all users? Or a specific one?
Have you tried -verbose or set SGE_DEBUG_LEVEL?
Ian
On Tue, Apr 22, 2014 at 7:53 AM, Prentice Bisbal
wrote:
> On 04/22/2014 03:13 AM, Mikael Brandström Durling wrote:
>>
>> 21 apr 2014
On 04/22/2014 03:13 AM, Mikael Brandström Durling wrote:
21 apr 2014 kl. 19:59 skrev Prentice Bisbal :
After one of these qrsh jobs fails, I get the following e-mail:
Job 5326173 caused action: Job 5326173 set to ERROR
User=
Queue =pow1...@.
Start Time =
End Time
Am 22.04.2014 um 11:11 schrieb Sve N:
> thanks for your answer, Reuti:
> The spool directory is a local folder, which exists and can be used (to
> confirm I just tested this with the "KEEP_ACTIVE" parameter set -
> interestingly the error occured not before the fourth of the small jobs,
> which
Hi,
Am 22.04.2014 um 01:17 schrieb Ilya M:
>>> I have been using h_vmem as a consumable resource to limit the amount of
>>> memory users can request and to make sure jobs don't use more than they
>>> requested. It all has been working fine until we added nodes with GPU
>>> modules.
>>>
>>> Th
Am 22.04.2014 um 07:33 schrieb Neelaya Dhatchayani:
> -- Forwarded message --
> From: Neelaya Dhatchayani
> Date: Tue, Apr 22, 2014 at 11:02 AM
> Subject: Re: [gridengine users] grid engine installation in ubuntu 12.04
> To: Marco Donauer
>
>
> Hi Marco,
>
> thank you for your
Hi,
thanks for your answer, Reuti:
The spool directory is a local folder, which exists and can be used (to confirm
I just tested this with the "KEEP_ACTIVE" parameter set - interestingly the
error occured not before the fourth of the small jobs, which indicates, that
time (or load, or similar) a
21 apr 2014 kl. 19:59 skrev Prentice Bisbal :
> After one of these qrsh jobs fails, I get the following e-mail:
>
> Job 5326173 caused action: Job 5326173 set to ERROR
> User=
> Queue =pow1...@.
> Start Time =
> End Time=
> failed assumedly before job:can't get p