On Wed, Jun 13, 2012 at 1:47 AM, Erik Soyez
<[email protected]> wrote:
> You probably need some kind of cronjob to suspend and unsuspend your
> parallel jobs correctly.  Or does anyone have a patch for this?

Erik,

So is/was it really working when you try it with SGE 6.2u5??

I have not looked into the code that handles parallel job suspension
in detail (we were working on "near-by" code in 2008 and Shannon was
also looking into the suspending parallel jobs at that time, and thus
we just relied on him to debug the code :-D ).

However, in order to properly handle the case you metioned, the
qmaster will need to keep track of the number of times subordination
happens to a job. And I can already think of issues if the accounting
code is not accurate enough.

Do you know if other batch systems handle the case you mentioned correctly?

Rayson



>
> Regards, Erik Soyez.
>
>
>
> On Tue, 12 Jun 2012, Joseph Farran wrote:
>
>> Well, for our needs, we *REALLY* need Parallel Job suspension.    It's
>> not even a choice for us.
>>
>> If Torque/Maui can do it, I am sure OGE can do it without issues.
>>
>> Can someone please tell me what patch I need to install to un-break /
>> turn-on Parallel job suspension?
>>
>> If you guys are that paranoid about PE suspension, how about adding an
>> on/off flag for this since the code is already there and let the admin pick?
>>
>>
>> On 06/12/2012 06:52 AM, Dave Love wrote:
>>>
>>> "Joseph A. Farran"<[email protected]>  writes:
>>>
>>>> If you guys are taking requests, *please* add suspension and ignore old
>>>> Sun recommendation.
>>>
>>> Support for suspension exists, it's just broken (per the issue Reuti
>>> pointed to).  The use of | is clearly wrong, but the other bit isn't
>>> clear.  It's one of the available patches I wanted to understand before
>>> applying (and had forgotten about).  Can anyone cast more light on it?
>
>
>
> --
>
>
>
>
>
>
> --
> Vorstandsvorsitzender/Chairman of the board of management:
> Gerd-Lothar Leonhart
> Vorstand/Board of Management:
> Dr. Bernd Finkbeiner, Michael Heinrichs, Dr. Arno Steitz, Dr. Ingrid Zech
> Vorsitzender des Aufsichtsrats/
> Chairman of the Supervisory Board:
> Philippe Miltin
> Sitz/Registered Office: Tuebingen
> Registergericht/Registration Court: Stuttgart
> Registernummer/Commercial Register No.: HRB 382196
>
>
> _______________________________________________
> users mailing list
> [email protected]
> https://gridengine.org/mailman/listinfo/users

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to