Le 16/03/2018 à 03:41, Arnau a écrit :
> Hi,
>
> is the name resolution working as expected? is xx.yy.zz =
> invitado.uuab.ibt.unam.mx <http://invitado.uuab.ibt.unam.mx/> ? is the
> resolution of invitado.uuab.ibt.unam.mx
> <http://invitado.uuab.ibt.unam.mx/> to 10.0.6.50 ? are you using fqdn or
> short names (ignore_fqdn) ?
>
Dear Arnau
I've made some try to understand why this issue. But i failed ...
So, i instaled in te same machine a debian 8.10 version, install the
soge paquages v 8.1.9 and run into the qmaster and execd instalation
without any problem.
So i install this machine with debian 9.4. When i try to install the
soge package v 8.1.9, i've got this error:
sge depends on db5.3-util; however:
Package db5.3-util is not installed.
sge depends on libhwloc5 (>= 1.10.0); however:
Package libhwloc5 is not installed.
sge depends on libjemalloc1 (>= 2.1.1); however:
Package libjemalloc1 is not installed.
sge depends on libmunge2 (>= 0.5.8); however:
Package libmunge2 is not installed.
sge depends on libssl1.0.0 (>= 1.0.0); however:
Package libssl1.0.0 is not installed.
sge depends on libxm4 (>= 2.3.4); however:
Package libxm4 is not installed.
So i decide to compile the SoGE source from gitlab, version
"9b99ef86b09516976fdad5cbd39af7066766" .
I could obtain the 3 deb packages sge sge-common sge-doc and install then.
The issue occurs at the same point of the configuration:
"Adding admin and submit hosts"
i get back:
error: commlib error: got read error (closing
"invitado.uuab.ibt.unam.mx/qmaster/1")
unable to contact qmaster using port 6444 on host
"invitado.uuab.ibt.unam.mx"
If i decide to not add some admin host, the same arror occurs
during "Creating the default <all.q> queue and <allhosts> hostgroup"
error: commlib error: got read error (closing
"invitado.uuab.ibt.unam.mx/qmaster/1")
unable to contact qmaster using port 6444 on host
"invitado.uuab.ibt.unam.mx"
Command failed: ./bin/lx-amd64/qconf -Ahgrp /tmp/hostqueue635
Probably a permission problem. Please check file access permissions.
Check root read/write permission. Check if SGE daemons are running.
And the program exit.
In other hand, i've check that the service of qmaster is on on the
machine, with "nc":
$ nc -zv invitado.uuab.ibt.unam.mx 20-7000
invitado:
Tue Mar 20 17:33:19 2018
invitado.uuab.ibt.unam.mx [10.0.6.50] 6444 (sge-qmaster) open
invitado.uuab.ibt.unam.mx [10.0.6.50] 22 (ssh) open
Next i I folow the grid engine TroubleShooting, and select all of the
posible dl value (1 ->10), and save the log file, when i was trying to
run the above qconf command. I don't know if you want to check out some
of then?
Hope tat you could help us in this issue.
Regards
--
-- Jérôme
La violencia es el último recurso del incompetente.
(Fundación, Isaac Azimov)
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users