Dear all
I'm facing a strange problem with some parralel programs.
Ive run a job ina queue with 24 hours limit time. The job
Qacct report this (4 cores):
qname all.q
hostname compute-0-3.local
group estudiante
owner xairarg
project NONE
department defaultdepartment
jobname RespBact
jobnumber 1335842
taskid 24
account sge
priority 0
qsub_time Sat May 2 20:29:57 2020
start_time Sat May 2 22:19:00 2020
end_time Sun May 3 19:40:54 2020
granted_pe thread
slots 4
failed 0
exit_status 0
ru_wallclock 76914s
ru_utime 1128016.632s
ru_stime 2191.568s
ru_maxrss 20.811MB
ru_ixrss 0.000B
ru_ismrss 0.000B
ru_idrss 0.000B
ru_isrss 0.000B
ru_minflt 351497264
ru_majflt 0
ru_nswap 0
ru_inblock 71047120
ru_oublock 1087912
ru_msgsnd 0
ru_msgrcv 0
ru_nsignals 0
ru_nvcsw 43908490
ru_nivcsw 80940156
cpu 1130208.200s
mem 13575.491TBs
io 2.022GB
iow 0.000s
maxvmem 20.813GB
arid undefined
ar_sub_time undefined
category -pe thread 4
Ths job was running 21h22 approximaly. The roblem is that qacct report a
cpu time of 1130208 seconds, in place of 4*76914 = 307656 . That is, 3
time more! As it was using 12 cores.
I remember someone speak about this problem in the list.
What's wrong with this accounting?
Regards.
--
-- Jérôme
-Ah évidemment j'en suis pas encore aux toiles de maître, mais enfin
c'est un début
-Oh c'est un début qui promet. Mais tu vois si j'étais chez moi comme tu
le disais si gentiment,bah j'mettrai ça ailleurs.
-Qu'est-ce que je disais, y s'rait mieux près de la fenêtre. Tu le
verrais où toi ?
-À la cave.
(Michel Audiard)
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users