Compare the contents of $SGE_ROOT/$SGE_CELL/act_qmaster to what you have in /etc/hosts -- the act_qmaster file contains the hostname for what SGE believes is the qmaster. That hostname needs to resolve perfectly in DNS or in your /etc/hosts file.

You can also experiment with the $SGE_ROOT/utilbin/gethostname and gethostbyname etc. commands to see how SGE resolves the local naming environment

And finally make sure that you have ports 6444 and 6445 open on your firewall!



Carlos Scaloni wrote:

Hi friends

I installed the sge6_2u5 but when i try to start the qmaster i see this:

/etc/init.d/sgemaster.p6444
    starting sge_qmaster

sge_qmaster start problem

sge_qmaster didn't start!


and in /tmp/sge_messages.txt :

05/26/2011 20:44:53|  main|proyecto|C|abort qmaster startup due to
communication errors

I don't know what the problem is!


I installed it with: sudo ./install_qmaster The installation finished
without any error!
Options that i used: admin user is sgeadmin, sge_qmaster port 6444,
sge_execd port 6445, classic pooling, gid range 20000-21000
the rest options by default!

I try to start it with: /etc/init.d/sgemaster.p6444

The file /tmp/sge_message contains this:
05/26/2011 20:42:39| main|proyecto|C|abort qmaster startup due to
communication errors
05/26/2011 20:44:53| main|proyecto|C|abort qmaster startup due to
communication errors

My hostname is:
hostname
proyecto.local

And i have /etc/host so:

cat /etc/hosts
127.0.0.1 localhost
::1 localhost
192.168.56.101 proyecto.local
10.0.2.15 proyecto.local


Can anyone help me, please??

thanks in advance


_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to