> Am 20.12.2016 um 02:45 schrieb John_Tai <[email protected]>:
>
> I spoke too soon. I can request PE and virtual_free separately, but I cannot
> request both:
>
>
>
> # qsub -V -b y -cwd -now n -pe cores 7 -l mem=10G -q all.q@ibm037 xclock
Above you request "mem" (which is a snapshot of the actual usage and may vary
over the runtime of other jobs [unless they request the total amount already at
the beginning of the job and stay with it]).
> Your job 180 ("xclock") has been submitted
> # qstat
> job-ID prior name user state submit/start at queue
> slots ja-task-ID
> -----------------------------------------------------------------------------------------------------------------
> 180 0.55500 xclock johnt qw 12/20/2016 09:43:41
> 7
> # qstat -j 180
> ==============================================================
> job_number: 180
> exec_file: job_scripts/180
> submission_time: Tue Dec 20 09:43:41 2016
> owner: johnt
> uid: 162
> group: sa
> gid: 4563
> sge_o_home: /home/johnt
> sge_o_log_name: johnt
> sge_o_path:
> /home/sge/sge8.1.9-1.el5/bin:/home/sge/sge8.1.9-1.el5/bin/lx-amd64:/bin:/usr/bin:/usr/local/bin:/usr/X11R6/bin:/home/johnt/bin:.
> sge_o_shell: /bin/tcsh
> sge_o_workdir: /home/johnt/sge8
> sge_o_host: ibm005
> account: sge
> cwd: /home/johnt/sge8
> hard resource_list: virtual_free=10G
10G times 7 = 70 GB
The node has this amount of memory installed and it is defined this way in
`qconf -me ibm037`?
-- Reuti
> mail_list: johnt@ibm005
> notify: FALSE
> job_name: xclock
> jobshare: 0
> hard_queue_list: all.q@ibm037
> env_list: TERM=xterm,DISPLAY=dsls11:3. [..]
> script_file: xclock
> parallel environment: cores range: 7
> binding: NONE
> job_type: binary
> scheduling info: cannot run in queue "sim.q" because it is not
> contained in its hard queue list (-q)
> cannot run in queue "pc.q" because it is not
> contained in its hard queue list (-q)
> cannot run in PE "cores" because it only offers 0
> slots
>
>
>
>
>
> -----Original Message-----
> From: Reuti [mailto:[email protected]]
> Sent: Saturday, December 17, 2016 10:16
> To: Reuti
> Cc: John_Tai; [email protected]; Coleman, Marcus [JRDUS Non-J&J]
> Subject: Re: [gridengine users] John's cores pe (Was: users Digest...)
>
>
> Am 17.12.2016 um 11:34 schrieb Reuti:
>
>>
>> Am 17.12.2016 um 02:01 schrieb John_Tai:
>>
>>> It is working!! Thank you to all that replied to me and helped me figure
>>> this out.
>>>
>>> I meant to set the default to 2G so that was my mistake. I changed it to:
>>>
>>> virtual_free mem MEMORY <= YES YES 2G
>>> 0
>>
>> That's strange. A plain "2" was for me always two bytes. A "h_vmem" of 2
>> bytes would crash the job instantly when it got scheduled, but for
>> "virtual_free" (which is only a guidance for SGE how to distribute jobs) it
>> shouldn't hinder the scheduling at all.
>>
>> `man sge_types` also lists:
>>
>> If no multiplier is present, the value is just counted in bytes.
>
> We have set "-w e" in /usr/sge/default/common/sge_request, and then I even
> face an "Unable to run job: error: no suitable queues." This happens whether
> the low 2 byte value is specified in the complex definition `qconf -mc` or on
> the command line as "-l virutal_free=2".
>
> It turns out, that the minimum value which is being accepted is: 33.
>
> -- Reuti
>
>
>>
>>> And it's working now. Although I'm not sure why it affected the PE.
>>>
>>> Also I didn't set a global one, what is the purpose of the global one?
>>> Should I set it?
>>
>> No, it was only one place I would have checked too. The global complexes
>> therein can for example be used for a limit in the number of licenses of an
>> application you have and which can be used floating in the cluster (one
>> could prefer to put such a limit in an RQS though).
>>
>> If you would have set it up there, it would have been the "overall limit of
>> memory which can be used in the complete cluster at the same time".
>>
>> -- Reuti
>>
>>
>>> # qconf -se global
>>> hostname global
>>> load_scaling NONE
>>> complex_values NONE
>>> load_values NONE
>>> processors 0
>>> user_lists NONE
>>> xuser_lists NONE
>>> projects NONE
>>> xprojects NONE
>>> usage_scaling NONE
>>> report_variables NONE
>>>
>>>
>>> -----Original Message-----
>>> From: Reuti [mailto:[email protected]]
>>> Sent: Friday, December 16, 2016 7:36
>>> To: John_Tai
>>> Cc: Christopher Heiny; [email protected]; Coleman, Marcus [JRDUS Non-J&J]
>>> Subject: Re: [gridengine users] John's cores pe (Was: users Digest...)
>>>
>>>
>>>> Am 16.12.2016 um 09:53 schrieb John_Tai <[email protected]>:
>>>>
>>>> virtual_free mem MEMORY <= YES YES 2
>>>> 0
>>>
>>> This would mean, that the default consumption is 2 bytes. I already feared
>>> that a high values was programmed here. More suitable would be a default of
>>> 1G or so.
>>>
>>> Is there any virtual_free complex defined on a global level: qconf -se
>>> global
>>>
>>> -- Reuti
>>> ________________________________
>>>
>>> This email (including its attachments, if any) may be confidential and
>>> proprietary information of SMIC, and intended only for the use of the named
>>> recipient(s) above. Any unauthorized use or disclosure of this email is
>>> strictly prohibited. If you are not the intended recipient(s), please
>>> notify the sender immediately and delete this email from your computer.
>>>
>>
>>
>> _______________________________________________
>> users mailing list
>> [email protected]
>> https://gridengine.org/mailman/listinfo/users
>
> ________________________________
>
> This email (including its attachments, if any) may be confidential and
> proprietary information of SMIC, and intended only for the use of the named
> recipient(s) above. Any unauthorized use or disclosure of this email is
> strictly prohibited. If you are not the intended recipient(s), please notify
> the sender immediately and delete this email from your computer.
>
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users