HB Version 2.14 (yes sorry, cant upgrade) on Sparc Solaris10

Im having an issue that crm is respawning:

heartbeat[10117]: 2010/06/29_13:03:17 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/attrd":
heartbeat[10117]: 2010/06/29_13:03:26 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/crmd":
heartbeat[10117]: 2010/06/29_13:04:57 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/crmd":
heartbeat[10117]: 2010/06/29_13:06:28 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/crmd":
heartbeat[10117]: 2010/06/29_13:07:59 ERROR: Respawning client

its caused by

caused by:

crmd[11344]: 2010/06/29_13:33:38 ERROR: register_with_ha: Cannot sign on with
heartbeat: hb_api_signon: Can't initiate connection  to heartbeat
crmd[11344]: 2010/06/29_13:33:38 ERROR: config_query_callback: Local CIB query
resulted in an error: not connected

The Permissions seem correct tho:

egrep '(65|17)' /etc/passwd /etc/group
/etc/passwd:hacluster:x:17:65::/var/lib/heartbeat/cores/hacluster:/bin/sh
/etc/group:haclient::65:

the GID is not used for other names.

The issue is fixed with adding this to ha.cf

apiauth ipfail uid=hacluster
apiauth ccm uid=hacluster
apiauth ping gid=haclient uid=root
apiauth default gid=haclient

so basically i read another thread that there could be a gid-> name mapping 
problem. Im not certain which of the above lines are enough to fix it (i guess 
the last one?!). I dont know any other way to verify that the gid is correct 
here.

Any ideas?

Thank you

for reference, my ha.cf:

autojoin any
keepalive 3
deadtime 20
ucast lo 127.0.0.1
# normally crm on but debugging the issue here
crm respawn

apiauth ipfail uid=hacluster
apiauth ccm uid=hacluster
apiauth ping gid=haclient uid=root
apiauth default gid=haclient

logfile /var/log/ha-log
debugfile /var/log/ha-debug


      
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to