On 05/16/2013 02:07 AM, Reuti wrote:
Am 16.05.2013 um 01:00 schrieb Orion Poplawski:
On 05/15/2013 04:20 PM, Dave Love wrote:
Orion Poplawski <[email protected]> writes:
On 05/13/2013 09:40 AM, Orion Poplawski wrote:
Would it be possible for qhost -xml output to include the number of slots used
by a job on that host?
Okay, I see how it is done - there are multiple job entries for each
slot.
Indeed, like the non-XML version. The SGE support
in<https://oss.trac.surfsara.nl/jobmonarch> should provide an example of
similar processing from qstat output.
Job 31911 is a 4 slot pe job - why are there 5 entries (one of which
is a "MASTER")?
Because the PE has job_is_first_task false?
It is, but something doesn't jive:
job_is_first_task
The job_is_first_task parameter can be set to TRUE or FALSE. A value of
TRUE indicates
that the Sun Grid Engine job script already contains one of the tasks of
the parallel
application (the number of slots reserved for the job is the number of
slots requested
with the -pe switch), while a value of FALSE indicates that the job
script (and its
child processes) is not part of the parallel program (the number of
slots reserved for
the job is the number of slots requested with the -pe switch + 1).
My -pe mpi 4 job:
qstat:
job-ID prior name user state submit/start at queue
slots ja-task-ID
-----------------------------------------------------------------------------------------------------------------
31923 0.60188 osu_bw-int orion r 05/15/2013 16:56:58
[email protected] 4
With `qstat -g t` you get the similar output to the one below. Whether it shows
3 or 4 slaves depends as noted on the setting of job_is_first_task and tries to
reflect what's granted to the job. That the master process is supposed to be
idling (according to your setup) you can't get from the plain output of course.
-- Reuti
That isn't getting to my question though. The man page says "the number of
slots reserved for the job is the number of slots requested with the -pe
switch + 1". But I don't see any evidence of that, unless I don't understand
what "slots reserved" means. If I submit a -pe 8 job, it will run on a host
with 8 slots. As near as I can tell, the parenthetical remarks should just be
removed.
--
Orion Poplawski
Technical Manager 303-415-9701 x222
NWRA, Boulder/CoRA Office FAX: 303-415-9702
3380 Mitchell Lane [email protected]
Boulder, CO 80301 http://www.nwra.com
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users