On my DNS servers, same answers : nslookup compute-0-4 > > Server: 127.0.0.1 > > Address: 127.0.0.1#53 > > >> Name: compute-0-4.local > > Address: 10.1.255.249 > >
Same result with ".local" at the end. > nslookup 10.1.255.249 > Server: 127.0.0.1 Address: 127.0.0.1#53 > 249.255.1.10.in-addr.arpa name = compute-0-4.local. Same thing directly from compute-0-4 2012/5/3 Simon Matthews <[email protected]> > > > On Thu, May 3, 2012 at 2:03 AM, Rémy Dernat <[email protected]> wrote: > >> Hi, >> >> Thanks for your answers. >> >> I tried with ".local" at the end. It did not change anything for qping on >> execd. >> The order for name resolution is hosts and then, bind, according to >> /etc/host.conf >> In nsswitch.conf, I have (for host resolution) : >> >>> hosts: files dns >> >> >> >> I have checked my named log file and SGE log files... No errors. I have >> another master node, and if I try the qping command to an exec node, it >> works !!!!? My /etc/hosts files are the same, and /etc/resolv.conf is the >> same too... I compared my bind configuration on both DNS masters without >> success. The bind versions are differents but configuration files seems to >> be the same. >> >> My first master is in the list of administrative host. >> >> How can I check more precisely what happens ? >> > > nslookup compute-0-4.local > > Then, get the IP address that this returns and run: > > nslookup <IP address> > > What does it return? > > Simon > > >> -Regards, >> >> Remy. >> >> 2012/5/3 Simon Matthews <[email protected]> >> >>> >>> >>> On Wed, May 2, 2012 at 9:06 AM, Rémy Dernat <[email protected]> wrote: >>> >>>> Hi, >>>> >>>> I have a very strange hostname/IP resolve problem : >>>> >>>> >>>> From qmaster, if I qping a compute node : >>>> >>>> qping compute-0-4 537 execd 1 >>>> >>>> >>>> Answer of compute-0-4 : >>>> >>>> endpoint compute-0-4.local/execd/1 at port 537: can't find connection >>>> access denied: client IP resolved to host name "". This is not >>>> identical to clients host name "" >>>> endpoint compute-0-4.local/execd/1 at port 537: can't find connection >>>> >>>> I have the same problem for all my nodes but this problem does not >>>> occur with qsub or qstat... ! So SGE is working fine (except that little >>>> error). >>>> >>>> I know that I have changed some informations in my /var/named/... files >>>> and in my /etc/hosts , but I can not find my error. >>>> >>>> The qping in the other direction, execd to qmaster, is working fine. >>>> >>>> The result of >>>> hostname -s >>>> on qmaster give the content of $SGE_ROOT/$SGE_CELL/common/act_qmaster >>>> on all nodes and on qmaster. >>>> >>> >>> >>> It looks like the mapping of IP address -> hostname does not match the >>> mapping of hostname -> IP address. Depending on how your network resolves >>> the IP address, this could be in: >>> /etc/hosts >>> NIS "hosts" map >>> DNS >>> etc.. >>> >>> You need to check the order specified in /etc/nsswitch.conf and then >>> examine each mechanism (files, NIS, DNS) to see what name the IP address >>> of compute-0-4 resolves to in the first mechanism that resolves it. >>> >>> Simon >>> >> >> >> _______________________________________________ >> users mailing list >> [email protected] >> https://gridengine.org/mailman/listinfo/users >> >> >
_______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
