Re: [Pacemaker] HA KVM over DRBD primary/secondary configuration
Hi Rasto, Note that on RHEL 6/CentOS 6, you should run the Pacemaker through CMAN and not a Corosync plugin Not glad to hear that... We are using Pacemaker+Corosync everywhere (SuSe, CentOS, OracleLinux servers). Is there any way to use LCMC in this setup? 19.04.2013 09:30, Rasto Levrinc : We used the amazing LCMC tool for the hosts, drbd and pacemaker and Corosync setup http://lcmc.sourceforge.net/ LCMC does lots of the setup automatic - a huge timesaver. I'm glad to hear that. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [Pacemaker] HA KVM over DRBD primary/secondary configuration
On Fri, Apr 19, 2013 at 9:11 AM, Alexandr A. Alexandrov shurr...@gmail.com wrote: Hi Rasto, Note that on RHEL 6/CentOS 6, you should run the Pacemaker through CMAN and not a Corosync plugin I wonder if that's still true, but better be safe than sorry. Not glad to hear that... We are using Pacemaker+Corosync everywhere (SuSe, CentOS, OracleLinux servers). Is there any way to use LCMC in this setup? there's no problem using LCMC in this setup. It will not help to create the cman config, though. That's a feature with very low priority for me at the moment. Rasto 19.04.2013 09:30, Rasto Levrinc пишет: We used the amazing LCMC tool for the hosts, drbd and pacemaker and Corosync setup http://lcmc.sourceforge.net/ LCMC does lots of the setup automatic - a huge timesaver. I'm glad to hear that. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [Pacemaker] pcs equivalent of crm configure erase
Hi Chris, No, you're definitely not missing anything. The 'pcs cluster cib' output isn't pretty. why there is an approach to build a new config-tool at all? Why not use the crmsh? Why it was dropped in the RHEL-build? Does it have some relevant disadvantages? -- To Answer please replace invalid with de ! Zum Antworten bitte invalid durch de ersetzen ! Chau y hasta luego, Thorolf ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [Pacemaker] Routing-Ressources on a 2-Node-Cluster
Hi, Because the nodes are located in different networks, each node needs it's own route-ressource, that is only valid if the node is passive and will be removed when this node goes active and gets the default route for the cluster-ip. I did not found any solution for this, so I'm going the other way. I added static routes with a high metric to each server: route add 1.2.3.4 via 10.20.11.1 metric 20 and route add 1.2.3.4 via 10.20.11.1 metric 20 When a server gets active, it will get the cluster-ip 10.20.10.70 and the default route to 10.20.10.1. In addition, the active node will get a new route via the cluster-ip with a lower metric, that overwrites the above static route: route add 1.2.3.4 via 10.20.10.1 metric 10 I tested this manually and it works as expected. There is just a small Problem: the heartbeat-route resource does not allow defining the metric :-( So should I change the /usr/lib/ocf/resource.d/heartbeat/Route, should I create a new start-script or is there any other solution? Thanks a lot for your help, -- To Answer please replace invalid with de ! Zum Antworten bitte invalid durch de ersetzen ! Chau y hasta luego, Thorolf ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [Pacemaker] pcs equivalent of crm configure erase
I don't know why redhat doesn't give to the users the alternatives to use what they want sorry for my ugly english :-) Thanks 2013/4/19 T. nos...@godawa.de Hi Chris, No, you're definitely not missing anything. The 'pcs cluster cib' output isn't pretty. why there is an approach to build a new config-tool at all? Why not use the crmsh? Why it was dropped in the RHEL-build? Does it have some relevant disadvantages? -- To Answer please replace invalid with de ! Zum Antworten bitte invalid durch de ersetzen ! Chau y hasta luego, Thorolf ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org -- esta es mi vida e me la vivo hasta que dios quiera ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [Pacemaker] HA KVM over DRBD primary/secondary configuration
On Fri, Apr 19, 2013 at 11:15 AM, Alexandr A. Alexandrov shurr...@gmail.com wrote: Hi! I do not need CMAN config to be created, since I do not use CMAN... It's possible, you would have to on RHEL6 system. I tried LCMC, it seems to recognize cluster/drbd settings. However, when I was finishing looking at it, I got corosync dead! :-( Did Corosync actually sefgault? Looks like Corosync problem, LCMC doesn't interact with it directly. Rasto Apr 19 12:45:30 kennedy corosync[20271]: [QB] IPC credentials authenticated (20271-19215-31) Apr 19 12:45:30 kennedy cib[17082]:error: pcmk_cpg_dispatch: Connection to the CPG API failed: 2 Apr 19 12:45:30 kennedy cib[17082]:error: cib_ais_destroy: Corosync connection lost! Exiting. Apr 19 12:45:30 kennedy crmd[16172]:error: pcmk_cpg_dispatch: Connection to the CPG API failed: 2 Apr 19 12:45:30 kennedy cib[17082]:error: cib_ipc_closed: Client crm_mon not found in the hashtable Apr 19 12:45:30 kennedy crmd[16172]:error: crm_ipc_read: Connection to cib_shm failed Apr 19 12:45:30 kennedy crmd[16172]:error: mainloop_gio_callback: Connection to cib_shm[0x6503b0] closed (I/O condition=17) Apr 19 12:45:30 kennedy crmd[16172]:error: crmd_cib_connection_destroy: Connection to the CIB terminated... Apr 19 12:45:31 kennedy pacemakerd[17080]:error: cfg_connection_destroy: Connection destroyed Apr 19 12:45:31 kennedy pacemakerd[17080]:error: cpg_connection_destroy: Connection destroyed Apr 19 12:45:31 kennedy pacemakerd[17080]: notice: pcmk_shutdown_worker: Shuting down Pacemaker Apr 19 12:45:31 kennedy pacemakerd[17080]: notice: stop_child: Stopping crmd: Sent -15 to process 16172 Apr 19 12:45:31 kennedy stonith-ng[17083]:error: pcmk_cpg_dispatch: Connection to the CPG API failed: 2 Apr 19 12:45:31 kennedy stonith-ng[17083]:error: stonith_peer_ais_destroy: AIS connection terminated Apr 19 12:45:31 kennedy crmd[16172]:error: do_log: FSA: Input I_ERROR from crmd_cib_connection_destroy() received in state S_NOT_DC Apr 19 12:45:31 kennedy crmd[16172]: notice: do_state_transition: State transition S_NOT_DC - S_RECOVERY [ input=I_ERROR cause=C_FSA_INTERNAL origin=crmd_ cib_connection_destroy ] Apr 19 12:45:31 kennedy crmd[16172]:error: do_recover: Action A_RECOVER (0100) not supported Apr 19 12:45:31 kennedy crmd[16172]:error: do_log: FSA: Input I_TERMINATE from do_recover() received in state S_RECOVERY Apr 19 12:45:31 kennedy attrd[17085]:error: pcmk_cpg_dispatch: Connection to the CPG API failed: 2 Apr 19 12:45:31 kennedy attrd[17085]: crit: attrd_ais_destroy: Lost connection to Corosync service! Apr 19 12:45:31 kennedy attrd[17085]: notice: main: Exiting... Apr 19 12:45:31 kennedy attrd[17085]: notice: main: Disconnecting client 0x61ec10, pid=16172... Apr 19 12:45:31 kennedy attrd[17085]:error: attrd_cib_connection_destroy: Connection to the CIB terminated... Apr 19 12:45:31 kennedy pacemakerd[17080]:error: pcmk_child_exit: Child process attrd exited (pid=17085, rc=1) Apr 19 12:45:31 kennedy pacemakerd[17080]:error: send_cpg_message: Sending message via cpg FAILED: (rc=9) Bad handle Apr 19 12:45:31 kennedy pacemakerd[17080]:error: send_cpg_message: Sending message via cpg FAILED: (rc=9) Bad handle 19.04.2013 11:44, Rasto Levrinc пишет: On Fri, Apr 19, 2013 at 9:11 AM, Alexandr A. Alexandrov shurr...@gmail.com wrote: Hi Rasto, Note that on RHEL 6/CentOS 6, you should run the Pacemaker through CMAN and not a Corosync plugin I wonder if that's still true, but better be safe than sorry. Not glad to hear that... We are using Pacemaker+Corosync everywhere (SuSe, CentOS, OracleLinux servers). Is there any way to use LCMC in this setup? there's no problem using LCMC in this setup. It will not help to create the cman config, though. That's a feature with very low priority for me at the moment. Rasto 19.04.2013 09:30, Rasto Levrinc пишет: We used the amazing LCMC tool for the hosts, drbd and pacemaker and Corosync setup http://lcmc.sourceforge.net/ LCMC does lots of the setup automatic - a huge timesaver. I'm glad to hear that. ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org -- Dipl.-Ing. Rastislav Levrinc rasto.levr...@gmail.com Linux Cluster Management Console http://lcmc.sf.net/
Re: [Pacemaker] Routing-Ressources on a 2-Node-Cluster
Hi David, Why can't both your cluster nodes have 10.20.10.1 as their default route all the time? because the different locations have different networks, routers, IP-addresses etc. But to be always reachable from the outside, they need one common IP-address, that is valid on the active node. And only for the active node the 10.20.10.1 is the way to go out, the passive node does even not have any connectivity to the outside. There is just a little hole for getting updates and for the pingd-resource needed for the HA-functionality. This is done by a proxy and/or by routes and it is working as expected. Your configuration seems to have way too many moving parts and since you are making routing changes when the nodes become primary it is difficult Well, beside the servers that have to run on the active node, the only moving part is the cluster-ip, it's default route and an additional route. So I do not see too much complexity? Not 100% sure what you're trying to accomplish, but when a config is this complex it's usually not ideal. That might be correct, but this is not the problem I have. So any other hint what I should do to get the metric working for the route-resources? Thanks a lot, -- Chau y hasta luego, Thorolf ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
[Pacemaker] Kernel WARN unpack_status in syslog
Guys, I can't get rid of following warnings: Apr 19 19:00:37 node2 crmd: [32230]: WARN: start_subsystem: Client pengine already running as pid 32240 Apr 19 19:00:44 node2 pengine: [32240]: WARN: unpack_status: Node node1 in status section no longer exists Apr 19 19:00:44 node2 pengine: [32240]: WARN: unpack_status: Node node2 in status section no longer exists Apr 19 19:00:44 node2 pengine: [32240]: notice: process_pe_message: Configuration WARNINGs found during PE processing. Please run crm_verify -L to identify issues. root@node2:~# crm_verify -LV crm_verify[13317]: 2013/04/19_19:03:04 WARN: unpack_status: Node node1 in status section no longer exists crm_verify[13317]: 2013/04/19_19:03:04 WARN: unpack_status: Node node2 in status section no longer exists Warnings found during check: config may not be valid Since I have nagios running through syslog emailing warnings and errors, this is pretty annoying. And disabling warn checks isn't an option. Any clues? I do have /etc/hosts entries. Ivor Prebeg smime.p7s Description: S/MIME cryptographic signature ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
[Pacemaker] crm_attribute not returning node attribute
Given: host1# crm node attribute host1 show foo scope=nodes name=foo value=bar Why doesn't this return anything: host1# crm_attribute --node host1 --name foo --query host1# echo $? 0 cibadmin -Q confirms the presence of the attribute: node id=host1 uname=host1 instance_attributes id=nodes-host1 nvpair id=nodes-host1-foo name=foo value=bar/ /instance_attributes /node This is on pacemaker 1.1.8 on EL6.4 and crmsh. Thoughts? b. signature.asc Description: OpenPGP digital signature ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org