Hello,
We have computer cluster at our faculty based on the nodes
equipped with two Intel Xeon (R) Processors E5-2695 v3 (i.e. 2x14 = 28
physical = 56
logical cores/node), where we use SGE or more precisely OGS/GE (OGS / GE
2011.11p1)
to run/distribute jobs.
On one of these nodes we would like to create a "high priority
queue" that should provide CPU resources preferentially to those jobs
which were submitted using this queue which should eventually restrict/
decrease
use of CPU resources in case of already running jobs which were submitted
earlier to this node using "ordinary queue".
Until now we just experimented with the SGE / OGE queue parameter
"priority" which can be used to set a "nice" parameter for the given job.
First we tested the value -10 (which appeared to be totally sufficient on
ordinary workstation with 12 logical CPU cores (tested here without
SGE) just using "nice" parameter) and later also -19.
In the situation when the given node was nearly fully loaded (i.e. 54-55
busy CPU slots from the total 56 available) with jobs submitted using
"ordinary queue" we submitted here one parallel (24-slots) job using "high
priority queue" hoping that we achieve the similar effect as we saw in our
12-log. core workstation, i.e. that the high priority job will get nearly 24
x100% CPU usage at the expense of running jobs submitted using "ordinary
queue".
We performed this test with a parallel MPI job (pmemd.MPI - Molecular
Dynamics) and then another test with the GAMESS job (QM) where
parallelization is accomplished using TCP / IP sockets and SystemV shared
memory.
Unfortunately, neither one test did not meet our expectations.
SGE successfully assigned the "nice" value -10 and later -19 to the job
submitted in "high priority queue"
but on the other hand this fact was not reflected properly in the allocation
of CPU resources for the high priority job. We obtained quite different and
unsatisfactory situation comparing to our first preliminary experiments
(without SGE just using "nice" parameter) on ordinary 12-log.CPU cores
workstation.
Please see here relevant screens.
http://physics.ujep.cz/~mmaly/SCREENS/
I would be grateful for any relevant comments/tips which could help us to
successfully solve
our problem with high priority queue.
Thank you very much in advance,
Best wishes,
Marek
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users