Im facing issues with my cluster setup. "N+1" Pacemaker Hosting Oracle 11g Instances. Node name "azteca"
I cannot get "oralsnr" to start my DB listener, it refuses on both nodes. "Oracle" RA is starting first, after all File systems and VIP starts. But no way to get Listener UP. When I do a manual start from "/oracle/11.2.0/db_1/bin/lsnrctl start" it works just fine. (Using oracle "user" shell prompt) CRM Config "Oracle RA" primitive p_oracle1 ocf:heartbeat:oracle \ params sid="xib11" home="/oracle/11.2.0/db_1" user="oracle" ipcrm="orauser" \ op start interval="0" timeout="120s" \ op stop interval="0" timeout="120s" \ op monitor interval="15s" primitive p_oralsnr ocf:heartbeat:oralsnr \ params sid="xib11" listener="LISTENER" user="oracle" home="/oracle/11.2.0/db_1" \ op start interval="0" timeout="30s" \ op stop interval="0" timeout="30s" \ op monitor interval="15s" group oracle_grp p_oracle1 p_oralsnr \ meta target-role="Started" order o_fs_before_listener inf: oracle_fs oracle_grp colocation ora_on_fs inf: oracle_grp oracle_fs >>>>>>>> ERROR LOG: azteca:/var/log # cat messages | grep p_oralsnr Jun 5 17:02:24 azteca crmd: [24262]: info: do_lrm_rsc_op: Performing key=20:900:7:8bf8ffb9-cc40-42c5-9dfa-cdb84ec20d97 op=p_oralsnr_monitor_0 ) Jun 5 17:02:24 azteca lrmd: [24259]: info: rsc:p_oralsnr probe[401] (pid 9369) Jun 5 17:02:24 azteca lrmd: [24259]: info: operation monitor[401] on p_oralsnr for client 24262: pid 9369 exited with return code 7 Jun 5 17:02:24 azteca crmd: [24262]: info: process_lrm_event: LRM operation p_oralsnr_monitor_0 (call=401, rc=7, cib-update=812, confirmed=true) not running Jun 5 17:02:34 azteca crmd: [24262]: info: do_lrm_rsc_op: Performing key=64:900:0:8bf8ffb9-cc40-42c5-9dfa-cdb84ec20d97 op=p_oralsnr_start_0 ) Jun 5 17:02:34 azteca lrmd: [24259]: info: rsc:p_oralsnr start[404] (pid 11102) Jun 5 17:02:34 azteca lrmd: [24259]: info: operation start[404] on p_oralsnr for client 24262: pid 11102 exited with return code 1 Jun 5 17:02:34 azteca crmd: [24262]: info: process_lrm_event: LRM operation p_oralsnr_start_0 (call=404, rc=1, cib-update=814, confirmed=true) unknown error Jun 5 17:02:34 azteca attrd: [24260]: info: find_hash_entry: Creating hash entry for fail-count-p_oralsnr Jun 5 17:02:34 azteca attrd: [24260]: info: attrd_trigger_update: Sending flush op to all hosts for: fail-count-p_oralsnr (INFINITY) Jun 5 17:02:34 azteca attrd: [24260]: info: attrd_perform_update: Sent update 4121: fail-count-p_oralsnr=INFINITY Jun 5 17:02:34 azteca attrd: [24260]: info: find_hash_entry: Creating hash entry for last-failure-p_oralsnr Jun 5 17:02:34 azteca attrd: [24260]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-p_oralsnr (1338933754) Jun 5 17:02:34 azteca attrd: [24260]: info: attrd_perform_update: Sent update 4124: last-failure-p_oralsnr=1338933754 Jun 5 17:02:34 azteca crmd: [24262]: info: do_lrm_rsc_op: Performing key=17:901:0:8bf8ffb9-cc40-42c5-9dfa-cdb84ec20d97 op=p_oralsnr_stop_0 ) Jun 5 17:02:34 azteca lrmd: [24259]: info: rsc:p_oralsnr stop[405] (pid 11210) Jun 5 17:02:34 azteca lrmd: [24259]: info: operation stop[405] on p_oralsnr for client 24262: pid 11210 exited with return code 0 Jun 5 17:02:34 azteca crmd: [24262]: info: process_lrm_event: LRM operation p_oralsnr_stop_0 (call=405, rc=0, cib-update=816, confirmed=true) ok Jun 5 17:06:41 azteca attrd: [24260]: info: attrd_trigger_update: Sending flush op to all hosts for: fail-count-p_oralsnr (<null>) Jun 5 17:06:41 azteca attrd: [24260]: info: attrd_perform_update: Sent delete 4130: node=azteca, attr=fail-count-p_oralsnr, id=<n/a>, set=(null), section=status Jun 5 17:06:41 azteca attrd: [24260]: info: attrd_perform_update: Sent delete 4132: node=azteca, attr=fail-count-p_oralsnr, id=<n/a>, set=(null), section=status Jun 5 17:06:41 azteca attrd: [24260]: info: attrd_perform_update: Sent delete 4134: node=azteca, attr=fail-count-p_oralsnr, id=<n/a>, set=(null), section=status Jun 5 17:06:42 azteca attrd: [24260]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-p_oralsnr (1338933754) Jun 5 17:06:42 azteca crmd: [24262]: info: do_lrm_invoke: Removing resource p_oralsnr from the LRM Jun 5 17:06:42 azteca crmd: [24262]: info: do_lrm_invoke: Resource 'p_oralsnr' deleted for 3880_crm_resource on tlg0025 Jun 5 17:06:42 azteca crmd: [24262]: info: notify_deleted: Notifying 3880_crm_resource on tlg0025 that p_oralsnr was deleted Jun 5 17:06:42 azteca crmd: [24262]: info: send_direct_ack: ACK'ing resource op p_oralsnr_delete_60000 from 0:0:crm-resource-3880: lrm_invoke-lrmd-1338934002- 184 Jun 5 17:06:42 azteca attrd: [24260]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-p_oralsnr (1338933754) Jun 5 17:06:42 azteca attrd: [24260]: info: attrd_trigger_update: Sending flush op to all hosts for: last-failure-p_oralsnr (1338933754) Jun 5 17:06:42 azteca crmd: [24262]: info: do_lrm_rsc_op: Performing key=19:906:7:8bf8ffb9-cc40-42c5-9dfa-cdb84ec20d97 op=p_oralsnr_monitor_0 ) Jun 5 17:06:42 azteca lrmd: [24259]: info: rsc:p_oralsnr probe[421] (pid 25778) Jun 5 17:06:42 azteca lrmd: [24259]: info: operation monitor[421] on p_oralsnr for client 24262: pid 25778 exited with return code 7 Jun 5 17:06:42 azteca crmd: [24262]: info: process_lrm_event: LRM operation p_oralsnr_monitor_0 (call=421, rc=7, cib-update=841, confirmed=true) not running >>>>>>>>>>>>END LOG VERSION: ============ Last updated: Tue Jun 5 17:08:48 2012 Stack: openais Current DC: azteca - partition with quorum 2 Nodes configured, 2 expected votes 19 Resources configured. ============ Thanks. Jorge.
_______________________________________________ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org