[Linux-HA] Error while running OCF scripts
Hi. Heartbeat version is heartbeat-3.0.4-1.fc15.1.i686. While testing ocf script IPaddr2, it returned an error message "./IPaddr2: line 68: /lib/heartbeat/ocf-shellfuncs: No such file or directory" [root@node1 heartbeat]# ./IPaddr2 status ./IPaddr2: line 68: /lib/heartbeat/ocf-shellfuncs: No such file or directory When I referred the IPaddr2 code, I found that ${OCF_ROOT} is not getting initialized, resulting an error. # Initialization: : ${OCF_FUNCTIONS_DIR=${OCF_ROOT}/lib/heartbeat} . ${OCF_FUNCTIONS_DIR}/ocf-shellfuncs So, I replaced the above 2 lines with the below one in IPaddr2 and tested it, it was working fine. . /usr/lib/ocf/lib/heartbeat/ocf-shellfuncs And also tested the script by replacing with the below one also and was working fine. . /usr/lib/ocf/resource.d/heartbeat/.ocf-shellfuncs So, is this a bug in ocf scripts or I have missed some step? Please suggest me. Regards, Padmaja. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] configuring monitor action in OCF script needed??
Hi All, Can we have an OCF script without configuring monitor action. If yes, then what action will heartbeat calls first. Please help. Thanks, lakshmi ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] logging errors
Hi I have run the heartbeat with resource agent as the resource and it was running well. But now, when I am testing the heartbeat with same resource agent with CTS. It is raising the error. I conducted the CTS testing with httpd as my resource, and CTS testing was successfull. So, I am confused, why I am getting these log messages, while conducting CTS. The error messages are Aug 11 11:57:35 Random seed is: 1250006255 Aug 11 11:57:35 BEGINNING 60 TESTS Aug 11 11:57:35 HA configuration directory: /etc/ha.d Aug 11 11:57:35 System log files: /var/log/ha-log-local7 Aug 11 11:57:35 Enable Stonith: 1 Aug 11 11:57:35 Enable Fencing: 1 Aug 11 11:57:35 Enable Standby: 1 Aug 11 11:57:35 Cluster nodes: Aug 11 11:57:36 * node1: e3037ebc-763a-4fca-9ed6-b8144e3e68f0 Aug 11 11:57:36 * node2: 9658d60a-bc2c-4e03-b23a-990f38b374c8 Aug 11 11:57:37 Stopping Cluster Manager on all nodes Aug 11 11:57:37 Starting Cluster Manager on all nodes. Aug 11 12:00:45 BadNews: Aug 11 11:55:35 node1 crmd: [31109]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node2 crmd: [13497]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: native_add_running: Resource ocf::RA_auth:RA_auth_2 appears to be active on 2 nodes. Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: See http://linux-ha.org/v2/faq/resource_too_active for more information. Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: native_create_actions: Attempting recovery of resource RA_auth_2 Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: process_pe_message: Transition 4: ERRORs found during PE processing. PEngine Input stored in: /var/lib/heartbeat/pengine/pe-error-49.bz2 Aug 11 12:00:49 Running test SpecialTest1 (node2) [1] Aug 11 12:03:49 BadNews: Aug 11 11:59:07 node2 crmd: [14052]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error Aug 11 12:03:49 BadNews: Aug 11 11:59:26 node1 crmd: [31932]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error My ha.cf file is logfacility local7 keepalive 2 deadtime 30 warntime 10 initdead 60 udpport 694 ucast eth0 172.25.149.254 auto_failback on node node1 node2 use_logd yes crm on So please help me out with this ! Regards, Padmaja. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] logging errors
Hi I have run the heartbeat with resource agent as the resource and it was running well. But now, when I am testing the heartbeat with same resource agent with CTS. It is raising the error. I conducted the CTS testing with httpd as my resource, and CTS testing was successfull. So, I am confused, why I am getting these log messages, while conducting CTS ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] logging erros
___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] Logging errors
Hi All, I have set the CTS configurations for running the test cases. And I am getting these errors. My ha.cf is logfacility local7 keepalive 2 deadtime 30 warntime 10 initdead 60 udpport 694 ucast eth0 172.25.149.254 auto_failback on node node1 node2 use_logd yes crm on Aug 11 11:57:35 Random seed is: 1250006255 Aug 11 11:57:35 BEGINNING 60 TESTS Aug 11 11:57:35 HA configuration directory: /etc/ha.d Aug 11 11:57:35 System log files: /var/log/ha-log-local7 Aug 11 11:57:35 Enable Stonith: 1 Aug 11 11:57:35 Enable Fencing: 1 Aug 11 11:57:35 Enable Standby: 1 Aug 11 11:57:35 Cluster nodes: Aug 11 11:57:36 * node1: e3037ebc-763a-4fca-9ed6-b8144e3e68f0 Aug 11 11:57:36 * node2: 9658d60a-bc2c-4e03-b23a-990f38b374c8 Aug 11 11:57:37 Stopping Cluster Manager on all nodes Aug 11 11:57:37 Starting Cluster Manager on all nodes. Aug 11 12:00:45 BadNews: Aug 11 11:55:35 node1 crmd: [31109]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node2 crmd: [13497]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: native_add_running: Resource ocf::RA_auth:RA_auth_2 appears to be active on 2 nodes. Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: See http://linux-ha.org/v2/faq/resource_too_active for more information. Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: native_create_actions: Attempting recovery of resource RA_auth_2 Aug 11 12:00:45 BadNews: Aug 11 11:55:55 node1 pengine: [31118]: ERROR: process_pe_message: Transition 4: ERRORs found during PE processing. PEngine Input stored in: /var/lib/heartbeat/pengine/pe-error-49.bz2 Aug 11 12:00:49 Running test SpecialTest1 (node2) [1] Aug 11 12:03:49 BadNews: Aug 11 11:59:07 node2 crmd: [14052]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error Aug 11 12:03:49 BadNews: Aug 11 11:59:26 node1 crmd: [31932]: ERROR: process_lrm_event: LRM operation RA_auth_2_monitor_0 (call=3, rc=1) Error unknown error Please help me out. Awaiting your help!!! Regards, Padmaja. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] OCF_ERROR_GENERIC
Hi all, I have a strange issue, that ocf_error_generic is being ingored at times. For example, suppose in the start function of the ocf script if I explicity return ocf_error_generic, the services should shift to the secondary node as of my knowledge, but that's not happening. Instead, heartbeat is calling monitor function again and again. But if I explicity mention exit(1) in the start function, then the heartbeat shifts its services to the secondary node. Same is the case with any function in the ocf script. So, why this is happening!! Please help me out. Regards, padmaja ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] which action first
Hi All, I want to know that when heartbeat is started, which function does it calls first? Is it "monitor" or "start"? Waiting for reply, Regards, Padmaja ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] monitor OCF script
Hi All, If the return value of monitor action is other than 0 and 7, then does it 1. Tries to start the resource in the primary node itself or 2. Does it shift to secondary node Awaiting your help!! Padmaja. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] monitor
Hi all, ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] OCF_RESKEY_monitor_scripts
Hi all, I have gone through monitor_script in Xen RA. monitor_script in Xen RA is used in the monitor section of the RA.It calls the script to check if the resources inside the Virtual Machine are still running. Could any one help me in implementing script for OCF_RESKEY_monitor_script parameter in the monitor section . Thanks in Advance, Maddali ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] OCF Resource Agents ( heartbeat )
Hi all, I have seen parameter monitor_scripts in the Xen RA. i didn't found such hooks for other virtualization techniques.I was trying to write script manually to get the status of resources present in virtual machine.can anyone help me how to proceed to find the status of resources present in VM. Thanks in Advance. ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
[Linux-HA] OCF Resource Agents ( heartbeat )
Hi all, How can we monitor the status of resources that are present in virtual machine . will it be possible. Thanks in advance ___ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems