I have no idea if this is the solution, but we had an issue with Rocks and the 
head node where the daemon wouldn't start properly due to the private interface 
being on eth0. I would spit out a message similar to what you posted.

The solution was to create the host_aliases file under default/common:

echo "$(/bin/hostname -s).local $(/bin/hostname --fqdn) $(/bin/hostname -s)" > 
$SGE_ROOT/default/common/host_aliases

Perhaps something similar needs to be done for the login node since it's 
multihomed.

-----Original Message-----
From: [email protected] [mailto:[email protected]] On 
Behalf Of Joseph Farran
Sent: Wednesday, May 09, 2012 4:10 PM
To: [email protected] Users
Subject: [gridengine users] Installing OGE on Rocks Login Node

Hello.

I have a cluster running Rocks 5.4.3 that I originally setup with Torque/Maui.  
  I am testing Open Grid Scheduler using the ge2011.11.tar distribution.

I setup OGE on the master head node and was able to also setup 6 compute nodes 
using "start_gui_installer" on the head node.    All 6 compute nodes were setup 
without any issues.

All works except that when I tried to setup our login node, I cannot.    The 
login node has both a private & public network interfaces.   I want to setup 
our login node "login-node.xxx.uci.edu" as an Executable and Submit node.

When I try to setup our Rocks login node using the private name of login-1-1, 
it complains with:

     The error message was:
        error: commlib error: access denied (client IP resolved to host name 
"login-1-1.local". This is not identical to clients host name   
"login-node.xxx.uci.edu")
     ERROR: unable to contact qmaster using port 6444 on host "headnode.local"

So then I try installing OGE using the public name of  "login-node.xxx.uci.edu" 
and it also complains.   As soon as I enter "login-node.xxx.uci.edu" the state 
column turns red with "Resolvable" and the "Install" GUI button is greyed out 
so I cannot continue.

Looks like OGE is confused about the actual fully qualified name of our login 
node.   The FQN is "login-node.xxx.uci.edu" but neither name seems to work.

What is the correct why to get around this?

Joseph
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to