Le 16/03/2018 à 03:41, Arnau a écrit :
> Hi,
> 
> is the name resolution working as expected? is xx.yy.zz = 
> invitado.uuab.ibt.unam.mx <http://invitado.uuab.ibt.unam.mx/> ? is the
> resolution of  invitado.uuab.ibt.unam.mx
> <http://invitado.uuab.ibt.unam.mx/> to 10.0.6.50 ? are you using fqdn or
> short names (ignore_fqdn) ?
> 

Dear Arnau

I've made some try to understand why this issue. But i failed ...

So, i instaled in te same machine a debian 8.10 version, install the
soge paquages v 8.1.9 and run into the qmaster and execd instalation
without any problem.

So i install this machine with debian 9.4. When i try to install the
soge package v 8.1.9, i've got this error:

sge depends on db5.3-util; however:
  Package db5.3-util is not installed.
 sge depends on libhwloc5 (>= 1.10.0); however:
  Package libhwloc5 is not installed.
 sge depends on libjemalloc1 (>= 2.1.1); however:
  Package libjemalloc1 is not installed.
 sge depends on libmunge2 (>= 0.5.8); however:
  Package libmunge2 is not installed.
 sge depends on libssl1.0.0 (>= 1.0.0); however:
  Package libssl1.0.0 is not installed.
 sge depends on libxm4 (>= 2.3.4); however:
  Package libxm4 is not installed.


So i decide to compile the SoGE source from gitlab, version
"9b99ef86b09516976fdad5cbd39af7066766" .

I could obtain the 3 deb packages sge sge-common sge-doc and install then.

The issue occurs at the same point of the configuration:

"Adding admin and submit hosts"

i get back:

error: commlib error: got read error (closing
"invitado.uuab.ibt.unam.mx/qmaster/1")
unable to contact qmaster using port 6444 on host
"invitado.uuab.ibt.unam.mx"

If i decide to not add some admin host, the same arror occurs
during "Creating the default <all.q> queue and <allhosts> hostgroup"

error: commlib error: got read error (closing
"invitado.uuab.ibt.unam.mx/qmaster/1")
unable to contact qmaster using port 6444 on host
"invitado.uuab.ibt.unam.mx"

Command failed: ./bin/lx-amd64/qconf -Ahgrp /tmp/hostqueue635
Probably a permission problem. Please check file access permissions.
Check root read/write permission. Check if SGE daemons are running.

And the program exit.

In other hand, i've check that the service of qmaster is on on the
machine, with "nc":

$ nc -zv invitado.uuab.ibt.unam.mx 20-7000

                                                              invitado:
Tue Mar 20 17:33:19 2018

invitado.uuab.ibt.unam.mx [10.0.6.50] 6444 (sge-qmaster) open
invitado.uuab.ibt.unam.mx [10.0.6.50] 22 (ssh) open

Next i I folow the  grid engine TroubleShooting, and select all of the
posible dl value (1 ->10), and save the log file, when i was trying to
run the above qconf command. I don't know if you want to check out some
of then?

Hope tat you could help us in this issue.

Regards


-- 
-- Jérôme
La violencia es el último recurso del incompetente.
        (Fundación, Isaac Azimov)
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to