Re: [gridengine users] Separate priority calculation for GPU and non-GPU queues

pavel Tue, 02 May 2017 09:54:46 -0700

(there was a confusion with e-mails on and off the list, I apologize)Thanks, 
Reuti! I created a project and added it as a separate sub-tree:# qconf 
-sstreeid=0name=roottype=0shares=1childnodes=1,2id=1name=defaulttype=0shares=1childnodes=NONEid=2name=gpuprojtype=1shares=1childnodes=NONEIt
 is now enforced via JSV on all GPU jobs. Now, what do I need to change in my 
scheduler config?# qconf -ssconfalgorithm &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; defaultschedule_interval 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 0:0:10maxujobs &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp;1900queue_sort_method &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; seqnojob_load_adjustments &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp;np_load_avg=0.50load_adjustment_decay_time &nbsp; &nbsp; &nbsp; 
&nbsp;0:7:30load_formula &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp;np_load_avgschedd_job_info &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; trueflush_submit_sec &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;0flush_finish_sec &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;0params &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp;nonereprioritize_interval &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
0:0:0halftime &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp;48usage_weight_list &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; 
cpu=0.333000,mem=0.333000,io=0.334000compensation_factor &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; 5.000000weight_user &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 1.000000weight_project &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp;0.250000weight_department &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; 0.250000weight_job &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;0.000000weight_tickets_functional 
&nbsp; &nbsp; &nbsp; &nbsp; 9800weight_tickets_share &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp;200share_override_tickets &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp;TRUEshare_functional_shares &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; TRUEmax_functional_jobs_to_schedule &nbsp; 2000report_pjob_tickets 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; TRUEmax_pending_tasks_per_job 
&nbsp; &nbsp; &nbsp; &nbsp; 50halflife_decay_list &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; cpu=1440:mem=1440:io=1440policy_hierarchy &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;FOSweight_ticket &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
0.500000weight_waiting_time &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
1.000000weight_deadline &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; 0.000000weight_urgency &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp;0.000000weight_priority &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; 0.500000max_reservation &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 0default_duration &nbsp; &nbsp; &nbsp; 
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; 
&nbsp;0:10:0Thanks!&nbsp;Pavel---------------------Sent 
from&nbsp;Reuti&nbsp;&lt;[email protected]&gt;&nbsp;on&nbsp;02.05.2017 
- 3:38 pm:Hi,


&gt; Am 02.05.2017 um 15:11 schrieb [email protected]:
&gt; 
&gt; we run SGE 8.1.9 in a cluster with many multi-core machines and some GPU 
machines. We were able to configure different queues with different complexes 
and the scheduling works fine in general. What I'm not yet happy with is the 
way the priority is calculated.
&gt; 
&gt; A typical user will submit a few GPU jobs and many non-GPU jobs. The two 
queues are strictly separated - no GPU jobs will run on a non-GPU machine and 
no non-GPU jobs will run on a GPU machine. Unfortunately, the regular CPU-jobs 
that run on non-GPU machines seem to reduce the priority of the pending GPU 
jobs. This seems to penalize GPU-users even if their GPU consumption has been 
very low for a long time.

Yes, that's understandable.


&gt; Since the GPU-per-user ratio is very low (~1.5), I would like the SGE to 
keep scheduling the CPU jobs as before, but do something different for GPU 
jobs. Ideally, I would like it to ignore all non-GPU usage when calculating 
priority for GPU jobs.
&gt; 
&gt; What is the best mechanism to achieve this separation? I could install two 
instances of SGE (one for GPU, one for non-GPU queues) so that their schedulers 
are 100% independent, but it is difficult to maintain and I would prefer 
anything more "native".

Are you using projects right now? Either voluntarily of forced by a JSV, the 
GPU jobs could be assigned to a (different) project in a share tree policy and 
get a different scheduling behavior this way.

-- Reuti

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] Separate priority calculation for GPU and non-GPU queues

Reply via email to