Hello everyone,
I have a cluster with the following configuration


node 1: pc-1
node 2: pc-2

primitive asterisk asterisk \
params user=root group=root maxfiles=65536 monitor_sipuri="sip:10.2.31.240"
primitive pingGW PingOnFailOver
primitive tftp lsb:tftpd-hpa \
        op monitor interval=30s \
        op start interval=0 timeout=120s \
        op stop interval=0 timeout=120s
primitive virtual-ip IPaddr2 \
        params ip=10.2.31.240 cidr_netmask=20
colocation et-cluster-dependency inf: virtual-ip asterisk pingGW tftp
order et-cluster-order inf: virtual-ip asterisk pingGW tftp


I have installed sipsak to use the ocf:heartbeat:asterisk RA
Asterisk has a binding on the virtual ip (bindnetaddr)
Resource asterisk doesn't start and I get the following errors with crm_mon


Online: [ pc-1 pc-2 ]

virtual-ip      (ocf::heartbeat:IPaddr2):       Started pc-2

Failed actions:
asterisk_start_0 on pc-1 'unknown error' (1): call=20, status=complete, last-rc-change='Fri Jul 24 18:0
9:05 2015', queued=0ms, exec=2131ms
asterisk_start_0 on pc-2 'unknown error' (1): call=25, status=complete, last-rc-change='Fri Jul 24 18:0
9:21 2015', queued=0ms, exec=2123ms


I tried to debug the RA as described here http://clusterlabs.org/wiki/Debugging_Resource_Failures, configured the cluster only with the virtual ip (10.2.31.240)

root@pc-1:~# echo $OCF_ROOT
/usr/lib/ocf
root@pc-1:~# export OCF_RESKEY_user=root
root@pc-1:~# export OCF_RESKEY_group=root
root@pc-1:~# export OCF_RESKEY_maxfiles=65536
root@pc-1:~# export OCF_RESKEY_monitor_sipuri=sip:10.2.31.240



root@pc-1:~# /usr/lib/ocf/resource.d/heartbeat/asterisk start ; echo $?
ERROR: /usr/lib/ocf/resource.d/heartbeat/asterisk: 1: kill: No such process
INFO: Asterisk PBX not running: removing old PID file
ERROR: Unable to connect to remote asterisk (does /var/run/asterisk/asterisk.ctl exist?)
INFO: Asterisk PBX not running yet
INFO: 0 active channels 0 active calls 0 calls processed
ERROR: command failed: sipsak -s sip:10.2.31.240
ERROR: Asterisk PBX start failed
1

root@pc-1:~# /usr/lib/ocf/resource.d/heartbeat/asterisk start ; echo $? INFO: Asterisk PBX already running
0


Running the script from shell I get those errors the first time, but asterisk starts, infact if I run the script again it results online. It happened that twice the resource started correctly with the others, but after a failover the resource didn't start on the other node.

What am I missing?

Thanks


P.s.

My conf is
Debian Jessie 8.1
Pacemaker 1.1.12
Corosync 2.3.4
crmsh 2.1.0
asterisk 13.4.0

------------------------------------------------------------------------
_______________________________________________
Users mailing list: Users@clusterlabs.org
http://clusterlabs.org/mailman/listinfo/users

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to