HB Version 2.14 (yes sorry, cant upgrade) on Sparc Solaris10
Im having an issue that crm is respawning:
heartbeat[10117]: 2010/06/29_13:03:17 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/attrd":
heartbeat[10117]: 2010/06/29_13:03:26 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/crmd":
heartbeat[10117]: 2010/06/29_13:04:57 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/crmd":
heartbeat[10117]: 2010/06/29_13:06:28 ERROR: Respawning client
"/opt/heartbeat/lib/heartbeat/crmd":
heartbeat[10117]: 2010/06/29_13:07:59 ERROR: Respawning client
its caused by
caused by:
crmd[11344]: 2010/06/29_13:33:38 ERROR: register_with_ha: Cannot sign on with
heartbeat: hb_api_signon: Can't initiate connection to heartbeat
crmd[11344]: 2010/06/29_13:33:38 ERROR: config_query_callback: Local CIB query
resulted in an error: not connected
The Permissions seem correct tho:
egrep '(65|17)' /etc/passwd /etc/group
/etc/passwd:hacluster:x:17:65::/var/lib/heartbeat/cores/hacluster:/bin/sh
/etc/group:haclient::65:
the GID is not used for other names.
The issue is fixed with adding this to ha.cf
apiauth ipfail uid=hacluster
apiauth ccm uid=hacluster
apiauth ping gid=haclient uid=root
apiauth default gid=haclient
so basically i read another thread that there could be a gid-> name mapping
problem. Im not certain which of the above lines are enough to fix it (i guess
the last one?!). I dont know any other way to verify that the gid is correct
here.
Any ideas?
Thank you
for reference, my ha.cf:
autojoin any
keepalive 3
deadtime 20
ucast lo 127.0.0.1
# normally crm on but debugging the issue here
crm respawn
apiauth ipfail uid=hacluster
apiauth ccm uid=hacluster
apiauth ping gid=haclient uid=root
apiauth default gid=haclient
logfile /var/log/ha-log
debugfile /var/log/ha-debug
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems