Re: [ClusterLabs] Is there a Trick to Making Corosync Work on Microsoft Azure?
I got it. From: Eric Robinson [mailto:eric.robin...@psmnv.com] Sent: Wednesday, August 23, 2017 6:51 PM To: Cluster Labs - All topics related to open-source clustering welcomedSubject: Re: [ClusterLabs] Is there a Trick to Making Corosync Work on Microsoft Azure? I figured out the cause. CMAN got installed by yum, and so none of my changes to corosync.conf had any effect, including the udpu directive. Now I just have to figure out how to enable unicast in cman. -- Eric Robinson From: Eric Robinson [mailto:eric.robin...@psmnv.com] Sent: Wednesday, August 23, 2017 3:16 PM To: Cluster Labs - All topics related to open-source clustering welcomed > Subject: [ClusterLabs] Is there a Trick to Making Corosync Work on Microsoft Azure? I created two nodes on Micrsoft Azure, but I can't get them to join a cluster. Any thoughts? OS: RHEL 6.9 Corosync version: 1.4.7-5.el6.x86_64 Node names: ha001a (172.28.0.4/23), ha001b (172.28.0.5/23) The nodes are on the same subnet and can ping and ssh to each other just fine by either host name or IP address. I have configured corosync to use unicast. corosync-cfgtool looks fine... [root@ha001b corosync]# corosync-cfgtool -s Printing ring status. Local node ID 2 RING ID 0 id = 172.28.0.5 status = ring 0 active with no faults ...but corosync-objctl only shows the local node... [root@ha001b corosync]# corosync-objctl |grep join totem.join=60 runtime.totem.pg.mrp.srp.memb_join_tx=1 runtime.totem.pg.mrp.srp.memb_join_rx=1 runtime.totem.pg.mrp.srp.members.2.join_count=1 runtime.totem.pg.mrp.srp.members.2.status=joined ...pcs status shows... Cluster name: ha001 Stack: cman Current DC: ha001b (version 1.1.15-5.el6-e174ec8) - partition with quorum Last updated: Wed Aug 23 18:04:33 2017 Last change: Wed Aug 23 17:51:07 2017 by root via cibadmin on ha001b 2 nodes and 0 resources configured Online: [ ha001b ] OFFLINE: [ ha001a ] No resources Daemon Status: cman: active/disabled corosync: active/disabled pacemaker: active/disabled pcsd: active/disabled ...it shows the opposite on the other node... [root@ha001a ~]# corosync-objctl |grep join totem.join=60 runtime.totem.pg.mrp.srp.memb_join_tx=1 runtime.totem.pg.mrp.srp.memb_join_rx=1 runtime.totem.pg.mrp.srp.members.1.join_count=1 runtime.totem.pg.mrp.srp.members.1.status=joined [root@ha001a ~]# pcs status Cluster name: ha001 Stack: cman Current DC: ha001a (version 1.1.15-5.el6-e174ec8) - partition with quorum Last updated: Wed Aug 23 18:06:04 2017 Last change: Wed Aug 23 17:51:03 2017 by root via cibadmin on ha001a 2 nodes and 0 resources configured Online: [ ha001a ] OFFLINE: [ ha001b ] No resources Daemon Status: cman: active/disabled corosync: active/disabled pacemaker: active/disabled pcsd: active/disabled ...here is my corosync.conf... compatibility: whitetank totem { version: 2 secauth: off interface { member { memberaddr: 172.28.0.4 } member { memberaddr: 172.28.0.5 } ringnumber: 0 bindnetaddr: 172.28.0.0 mcastport: 5405 ttl: 1 } transport: udpu } logging { fileline: off to_logfile: yes to_syslog: yes logfile: /var/log/cluster/corosync.log debug: off timestamp: on logger_subsys { subsys: AMF debug: off } } I used tcpdump and I see a lot of traffic between them on port 2224, but nothing else. Is there an issue because the bindinetaddr is 172.28.0.0 but the members have a /23 mask? -- Eric Robinson ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [ClusterLabs] Is there a Trick to Making Corosync Work on Microsoft Azure?
I figured out the cause. CMAN got installed by yum, and so none of my changes to corosync.conf had any effect, including the udpu directive. Now I just have to figure out how to enable unicast in cman. -- Eric Robinson From: Eric Robinson [mailto:eric.robin...@psmnv.com] Sent: Wednesday, August 23, 2017 3:16 PM To: Cluster Labs - All topics related to open-source clustering welcomedSubject: [ClusterLabs] Is there a Trick to Making Corosync Work on Microsoft Azure? I created two nodes on Micrsoft Azure, but I can't get them to join a cluster. Any thoughts? OS: RHEL 6.9 Corosync version: 1.4.7-5.el6.x86_64 Node names: ha001a (172.28.0.4/23), ha001b (172.28.0.5/23) The nodes are on the same subnet and can ping and ssh to each other just fine by either host name or IP address. I have configured corosync to use unicast. corosync-cfgtool looks fine... [root@ha001b corosync]# corosync-cfgtool -s Printing ring status. Local node ID 2 RING ID 0 id = 172.28.0.5 status = ring 0 active with no faults ...but corosync-objctl only shows the local node... [root@ha001b corosync]# corosync-objctl |grep join totem.join=60 runtime.totem.pg.mrp.srp.memb_join_tx=1 runtime.totem.pg.mrp.srp.memb_join_rx=1 runtime.totem.pg.mrp.srp.members.2.join_count=1 runtime.totem.pg.mrp.srp.members.2.status=joined ...pcs status shows... Cluster name: ha001 Stack: cman Current DC: ha001b (version 1.1.15-5.el6-e174ec8) - partition with quorum Last updated: Wed Aug 23 18:04:33 2017 Last change: Wed Aug 23 17:51:07 2017 by root via cibadmin on ha001b 2 nodes and 0 resources configured Online: [ ha001b ] OFFLINE: [ ha001a ] No resources Daemon Status: cman: active/disabled corosync: active/disabled pacemaker: active/disabled pcsd: active/disabled ...it shows the opposite on the other node... [root@ha001a ~]# corosync-objctl |grep join totem.join=60 runtime.totem.pg.mrp.srp.memb_join_tx=1 runtime.totem.pg.mrp.srp.memb_join_rx=1 runtime.totem.pg.mrp.srp.members.1.join_count=1 runtime.totem.pg.mrp.srp.members.1.status=joined [root@ha001a ~]# pcs status Cluster name: ha001 Stack: cman Current DC: ha001a (version 1.1.15-5.el6-e174ec8) - partition with quorum Last updated: Wed Aug 23 18:06:04 2017 Last change: Wed Aug 23 17:51:03 2017 by root via cibadmin on ha001a 2 nodes and 0 resources configured Online: [ ha001a ] OFFLINE: [ ha001b ] No resources Daemon Status: cman: active/disabled corosync: active/disabled pacemaker: active/disabled pcsd: active/disabled ...here is my corosync.conf... compatibility: whitetank totem { version: 2 secauth: off interface { member { memberaddr: 172.28.0.4 } member { memberaddr: 172.28.0.5 } ringnumber: 0 bindnetaddr: 172.28.0.0 mcastport: 5405 ttl: 1 } transport: udpu } logging { fileline: off to_logfile: yes to_syslog: yes logfile: /var/log/cluster/corosync.log debug: off timestamp: on logger_subsys { subsys: AMF debug: off } } I used tcpdump and I see a lot of traffic between them on port 2224, but nothing else. Is there an issue because the bindinetaddr is 172.28.0.0 but the members have a /23 mask? -- Eric Robinson ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [ClusterLabs] ClusterLabs.Org Documentation Problem?
I have a BIG correction. If you follow the instructions titled, "Pacemaker 1.1 for Corosync 2.x," and NOT the ones entitled, "Pacemaker 1.1 for CMAN or Corosync 1.x," guess what? It installs cman anyway, and you spend a couple of days wondering why none of your changes to corosync.conf seem to be working. -- Eric Robinson -Original Message- From: Jan Friesse [mailto:jfrie...@redhat.com] Sent: Tuesday, August 22, 2017 11:52 PM To: Cluster Labs - All topics related to open-source clustering welcomed; kgail...@redhat.com Subject: Re: [ClusterLabs] ClusterLabs.Org Documentation Problem? > Thanks for the reply. Yes, it's a bit confusing. I did end up using the > documentation for Corosync 2.X since that seemed newer, but it also assumed > CentOS/RHEL7 and systemd-based commands. It also incorporates cman, pcsd, > psmisc, and policycoreutils-pythonwhich, which are all new to me. If there is > anything I can do to assist with getting the documentation cleaned up, I'd be > more than glad to help. Just a small correction. Documentation shouldn't incorporate cman. Cman was used with corosync 1.x as a configuration layer and (more important) quorum provider. With Corosync 2.x quorum provider is already in corosync so no need for cman. > > -- > Eric Robinson > > -Original Message- > From: Ken Gaillot [mailto:kgail...@redhat.com] > Sent: Tuesday, August 22, 2017 2:08 PM > To: Cluster Labs - All topics related to open-source clustering > welcomed > Subject: Re: [ClusterLabs] ClusterLabs.Org Documentation Problem? > > On Tue, 2017-08-22 at 19:40 +, Eric Robinson wrote: >> The documentation located here… >> >> >> >> http://clusterlabs.org/doc/ >> >> >> >> …is confusing because it offers two combinations: >> >> >> >> Pacemaker 1.0 for Corosync 1.x >> >> Pacemaker 1.1 for Corosync 2.x >> >> >> >> According to the documentation, if you use Corosync 1.x you need >> Pacemaker 1.0, but if you use Corosync 2.x then you need Pacemaker >> 1.1. >> >> >> >> However, on my Centos 6.9 system, when I do ‘yum install pacemaker >> corosync” I get the following versions: >> >> >> >> pacemaker-1.1.15-5.el6.x86_64 >> >> corosync-1.4.7-5.el6.x86_64 >> >> >> >> What’s the correct answer? Does Pacemaker 1.1.15 work with Corosync >> 1.4.7? If so, is the documentation at ClusterLabs misleading? >> >> >> >> -- >> Eric Robinson > > The page actually offers a third option ... "Pacemaker 1.1 for CMAN or > Corosync 1.x". That's the configuration used by CentOS 6. > > However, that's still a bit misleading; the documentation set for "Pacemaker > 1.1 for Corosync 2.x" is the only one that is updated, and it's mostly > independent of the underlying layer, so you should prefer that set. > > I plan to reorganize that page in the coming months, so I'll try to make it > clearer. > > -- > Ken Gaillot > > > > > > ___ > Users mailing list: Users@clusterlabs.org > http://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org Getting started: > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://bugs.clusterlabs.org > ___ > Users mailing list: Users@clusterlabs.org > http://lists.clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org Getting started: > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://bugs.clusterlabs.org > ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
[ClusterLabs] Is there a Trick to Making Corosync Work on Microsoft Azure?
I created two nodes on Micrsoft Azure, but I can't get them to join a cluster. Any thoughts? OS: RHEL 6.9 Corosync version: 1.4.7-5.el6.x86_64 Node names: ha001a (172.28.0.4/23), ha001b (172.28.0.5/23) The nodes are on the same subnet and can ping and ssh to each other just fine by either host name or IP address. I have configured corosync to use unicast. corosync-cfgtool looks fine... [root@ha001b corosync]# corosync-cfgtool -s Printing ring status. Local node ID 2 RING ID 0 id = 172.28.0.5 status = ring 0 active with no faults ...but corosync-objctl only shows the local node... [root@ha001b corosync]# corosync-objctl |grep join totem.join=60 runtime.totem.pg.mrp.srp.memb_join_tx=1 runtime.totem.pg.mrp.srp.memb_join_rx=1 runtime.totem.pg.mrp.srp.members.2.join_count=1 runtime.totem.pg.mrp.srp.members.2.status=joined ...pcs status shows... Cluster name: ha001 Stack: cman Current DC: ha001b (version 1.1.15-5.el6-e174ec8) - partition with quorum Last updated: Wed Aug 23 18:04:33 2017 Last change: Wed Aug 23 17:51:07 2017 by root via cibadmin on ha001b 2 nodes and 0 resources configured Online: [ ha001b ] OFFLINE: [ ha001a ] No resources Daemon Status: cman: active/disabled corosync: active/disabled pacemaker: active/disabled pcsd: active/disabled ...it shows the opposite on the other node... [root@ha001a ~]# corosync-objctl |grep join totem.join=60 runtime.totem.pg.mrp.srp.memb_join_tx=1 runtime.totem.pg.mrp.srp.memb_join_rx=1 runtime.totem.pg.mrp.srp.members.1.join_count=1 runtime.totem.pg.mrp.srp.members.1.status=joined [root@ha001a ~]# pcs status Cluster name: ha001 Stack: cman Current DC: ha001a (version 1.1.15-5.el6-e174ec8) - partition with quorum Last updated: Wed Aug 23 18:06:04 2017 Last change: Wed Aug 23 17:51:03 2017 by root via cibadmin on ha001a 2 nodes and 0 resources configured Online: [ ha001a ] OFFLINE: [ ha001b ] No resources Daemon Status: cman: active/disabled corosync: active/disabled pacemaker: active/disabled pcsd: active/disabled ...here is my corosync.conf... compatibility: whitetank totem { version: 2 secauth: off interface { member { memberaddr: 172.28.0.4 } member { memberaddr: 172.28.0.5 } ringnumber: 0 bindnetaddr: 172.28.0.0 mcastport: 5405 ttl: 1 } transport: udpu } logging { fileline: off to_logfile: yes to_syslog: yes logfile: /var/log/cluster/corosync.log debug: off timestamp: on logger_subsys { subsys: AMF debug: off } } I used tcpdump and I see a lot of traffic between them on port 2224, but nothing else. Is there an issue because the bindinetaddr is 172.28.0.0 but the members have a /23 mask? -- Eric Robinson ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [ClusterLabs] ClusterLabs.Org Documentation Problem?
Thanks for the reply. Yes, it's a bit confusing. I did end up using the documentation for Corosync 2.X since that seemed newer, but it also assumed CentOS/RHEL7 and systemd-based commands. It also incorporates cman, pcsd, psmisc, and policycoreutils-pythonwhich, which are all new to me. If there is anything I can do to assist with getting the documentation cleaned up, I'd be more than glad to help. Just a small correction. Documentation shouldn't incorporate cman. Cman was used with corosync 1.x as a configuration layer and (more important) quorum provider. With Corosync 2.x quorum provider is already in corosync so no need for cman. -- Eric Robinson -Original Message- From: Ken Gaillot [mailto:kgail...@redhat.com] Sent: Tuesday, August 22, 2017 2:08 PM To: Cluster Labs - All topics related to open-source clustering welcomedSubject: Re: [ClusterLabs] ClusterLabs.Org Documentation Problem? On Tue, 2017-08-22 at 19:40 +, Eric Robinson wrote: The documentation located here… http://clusterlabs.org/doc/ …is confusing because it offers two combinations: Pacemaker 1.0 for Corosync 1.x Pacemaker 1.1 for Corosync 2.x According to the documentation, if you use Corosync 1.x you need Pacemaker 1.0, but if you use Corosync 2.x then you need Pacemaker 1.1. However, on my Centos 6.9 system, when I do ‘yum install pacemaker corosync” I get the following versions: pacemaker-1.1.15-5.el6.x86_64 corosync-1.4.7-5.el6.x86_64 What’s the correct answer? Does Pacemaker 1.1.15 work with Corosync 1.4.7? If so, is the documentation at ClusterLabs misleading? -- Eric Robinson The page actually offers a third option ... "Pacemaker 1.1 for CMAN or Corosync 1.x". That's the configuration used by CentOS 6. However, that's still a bit misleading; the documentation set for "Pacemaker 1.1 for Corosync 2.x" is the only one that is updated, and it's mostly independent of the underlying layer, so you should prefer that set. I plan to reorganize that page in the coming months, so I'll try to make it clearer. -- Ken Gaillot ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org ___ Users mailing list: Users@clusterlabs.org http://lists.clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org