Re: [Pacemaker] [corosync] active/active with Radius

2015-02-16 Thread Jan Friesse
This is really question for pacemaker list, so CCing. Regards, Honza Hi, I would like Corosync to manage Radius in an active/active configuration but I don't know how I should add this, so was wondering if somebody could point me in the right direction. Thanks and kind regards,

Re: [Pacemaker] [Openais] Issues with a squid cluster.

2015-02-10 Thread Jan Friesse
This is really question for pacemaker list, so CCing. Regards, Honza Redeye napsal(a): I am not certain where I should post this, hopefully someone will point me in the right direction. I have a two node cluster on Ubuntu 12.04, corosync, pacemaker, and squid. Squid is not starting at

Re: [Pacemaker] [Openais] problem to delete resource

2015-02-04 Thread Jan Friesse
This is really question for pacemaker list, so CCing. Regards, Honza Vladimir Berezovski (vberezov) napsal(a): Hi , I added a new resourse like crm(live)configure# primitive p_drbd_ora ocf:linbit:drbd params drbd_resource=clusterdb_res_ora op monitor interval=60s but its status is

Re: [Pacemaker] Corosync fails to start when NIC is absent

2015-01-20 Thread Jan Friesse
Thank you, Kostya On Wed, Jan 14, 2015 at 1:31 PM, Kostiantyn Ponomarenko konstantin.ponomare...@gmail.com wrote: Thank you. Now I am aware of it. Thank you, Kostya On Wed, Jan 14, 2015 at 12:59 PM, Jan Friesse jfrie...@redhat.com wrote: Kostiantyn, Honza, Thank you for helping me

Re: [Pacemaker] [corosync] CoroSync's UDPu transport for public IP addresses?

2015-01-19 Thread Jan Friesse
is, that Pacemaker must then be configured in a way that quorum is not required. Regards, Honza It would help to install and launch corosync instantly by novices. On Fri, Jan 16, 2015 at 7:31 PM, Jan Friesse jfrie...@redhat.com wrote: Dmitry Koterov napsal(a): such messages (for now

Re: [Pacemaker] [corosync] CoroSync's UDPu transport for public IP addresses?

2015-01-16 Thread Jan Friesse
] Members[1]: 1760315215 Jan 14 10:48:28 node1 corosync[15156]: [MAIN ] Completed service synchronization, ready to provide service. On Mon, Jan 5, 2015 at 6:45 PM, Jan Friesse jfrie...@redhat.com wrote: Dmitry, Sure, in logs I see adding new UDPU member {IP_ADDRESS} (so DNS names

Re: [Pacemaker] CoroSync's UDPu transport for public IP addresses?

2015-01-14 Thread Jan Friesse
[15156]: [MAIN ] Completed service synchronization, ready to provide service. On Mon, Jan 5, 2015 at 6:45 PM, Jan Friesse jfrie...@redhat.com wrote: Dmitry, Sure, in logs I see adding new UDPU member {IP_ADDRESS} (so DNS names are definitely resolved), but in practice the cluster does

Re: [Pacemaker] Corosync fails to start when NIC is absent

2015-01-14 Thread Jan Friesse
Kostiantyn, Honza, Thank you for helping me. So, there is no defined behavior in case one of the interfaces is not in the system? You are right. There is no defined behavior. Regards, Honza Thank you, Kostya On Tue, Jan 13, 2015 at 12:01 PM, Jan Friesse jfrie...@redhat.com

Re: [Pacemaker] Corosync fails to start when NIC is absent

2015-01-13 Thread Jan Friesse
Kostiantyn, According to the https://access.redhat.com/solutions/638843 , the interface, that is defined in the corosync.conf, must be present in the system (see at the bottom of the article, section ROOT CAUSE). To confirm that I made a couple of tests. Here is a part of the

Re: [Pacemaker] CoroSync's UDPu transport for public IP addresses?

2015-01-05 Thread Jan Friesse
is resolved, corosync works only with IP. This means, code path is exactly same with IP or with DNS. Do you have logs from corosync? Honza On Fri, Jan 2, 2015 at 2:49 PM, Jan Friesse jfrie...@redhat.com wrote: Dmitry, No, I meant that if you pass a domain name in ring0_addr, there are no errors

Re: [Pacemaker] CoroSync's UDPu transport for public IP addresses?

2015-01-02 Thread Jan Friesse
Dmitry, No, I meant that if you pass a domain name in ring0_addr, there are no errors in logs, corosync even seems to find nodes (based on its logs), And crm_node -l shows them, but in practice nothing really works. A verbose error message would be very helpful in such case. This sounds

Re: [Pacemaker] CMAN and Pacemaker with IPv6

2014-07-16 Thread Jan Friesse
for me is I'm using the package from OpenSUSE repo. When i turn back to CentOS repo, which store lower version, the Dependency problem has occurred. Anyway, thank you for your help. Teenigma On Mon, Jul 14, 2014 at 8:51 PM, Jan Friesse jfrie...@redhat.com wrote: Honza, How do I include

Re: [Pacemaker] CMAN and Pacemaker with IPv6

2014-07-14 Thread Jan Friesse
configure the Multicast address as manual. Could you advise me the solution? Many thanks in advance. Te On Thu, Jul 10, 2014 at 6:14 PM, Jan Friesse jfrie...@redhat.com wrote: Teerapatr, Hi Honza, As you said I use the nodename identify by hostname (which be accessed via IPv6) and the node also has

Re: [Pacemaker] CMAN and Pacemaker with IPv6

2014-07-14 Thread Jan Friesse
Honza, How do I include the patch with my CentOS package? Do I need to compile them manually? Yes. Also official CentOS version was never 1.4.5. If you are using CentOS, just use stock 1.4.1-17.1. Patch is included there. Honza TeEniGMa On Mon, Jul 14, 2014 at 3:21 PM, Jan Friesse jfrie

Re: [Pacemaker] CMAN and Pacemaker with IPv6

2014-07-10 Thread Jan Friesse
Teerapatr, OK, some problems are solved. I use the incorrect hostname. For now, the new problem has occured. Starting cman... Node address family does not match multicast address family Unable to get the configuration Node address family does not match multicast address family

Re: [Pacemaker] CMAN and Pacemaker with IPv6

2014-07-10 Thread Jan Friesse
). If these are VMs, make sure to properly configure bridge (just disable firewall) and allow mcast_querier. Honza On node0, crm_mon show node1 offline. In the same way, node one show node0 is down. So the split brain problem occur here. Regards, Te On Thu, Jul 10, 2014 at 2:50 PM, Jan

Re: [Pacemaker] [Openais] unmanaged resource failed - how to get back?

2014-06-30 Thread Jan Friesse
Stefan, sending to Pacemaker list because your question seems to be not Corosync related. Regards, Honza Senftleben, Stefan (itsc) napsal(a): Hello, I set the cluster in a maintainance mode with: crm configure property maintenance-mode=true . Afterwards I did stop one resource manually,

Re: [Pacemaker] [Openais] Filesystem vs. Master-Slave MySQL resource

2014-06-03 Thread Jan Friesse
Matej, this is really question for pacemaker mailing list. Hello, I have the following setup: 2 nodes: db-01, db-02 Groups of resources: fs-01: iscsi+lvm+fs at db-01 fs-02: iscsi+lvm+fs at db-02 fs-01 is for mounting data files for MySQL at db-01, fs-02 for db-02 MySQL resources:

Re: [Pacemaker] auto_tie_breaker in two node cluster

2014-05-21 Thread Jan Friesse
I am not quite understand how auto_tie_breaker works. Say we have a cluster with 2 nodes and enabled auto_tie_breaker feature. Each node has 2 NICs. One NIC is used for cluster communication and another one is used for providing some services from the cluster. So the question is how the nodes

Re: [Pacemaker] pacemaker not started by corosync on ubuntu 14.04

2014-05-12 Thread Jan Friesse
Vladimir, Vladimir napsal(a): Hello everyone, I'm trying to get corosync/pacemaker run on Ubuntu 14.04. In my Ubuntu 12.04 setups pacemaker was started by corosync. Actually I thought the Yes. 12.04 used corosync 1.x with pacemaker plugin. service {...} section in the corosync.conf is

Re: [Pacemaker] corosync [TOTEM ] Process pause detected for 577 ms

2014-05-05 Thread Jan Friesse
-30 17:07 GMT+02:00 Jan Friesse jfrie...@redhat.com: Emmanuel, emmanuel segura napsal(a): Hello Jan, Thanks for the explanation, but i saw this in my log

Re: [Pacemaker] corosync [TOTEM ] Process pause detected for 577 ms

2014-04-30 Thread Jan Friesse
was not triggered :(, but it's enabled 2014-04-25 18:36 GMT+02:00 emmanuel segura emi2f...@gmail.com: Hello Jan, I found this problem in two hp blade system and the strange thing is the fencing was triggered :( 2014-04-25 9:27 GMT+02:00 Jan Friesse jfrie...@redhat.com: Emanuel, emmanuel

Re: [Pacemaker] corosync [TOTEM ] Process pause detected for 577 ms

2014-04-30 Thread Jan Friesse
(upgrade to 2.3.3 will solve it automatically, because plugins in corosync 2.x are no longer support). Regards, Honza Thanks 2014-04-30 9:42 GMT+02:00 Jan Friesse jfrie...@redhat.com: Emmanuel, there is no need to trigger fencing on Process pause detected Also fencing

Re: [Pacemaker] corosync [TOTEM ] Process pause detected for 577 ms

2014-04-25 Thread Jan Friesse
Emanuel, emmanuel segura napsal(a): Hello List, I have this two lines in my cluster logs, somebody can help to know what this means. :: corosync [TOTEM ]

Re: [Pacemaker] corosync does not reflect the node status correctly

2014-03-31 Thread Jan Friesse
Michael, Michael Schwartzkopff napsal(a): Hi, we just upgraded to corosync-1.4.5-2.5 from the suse build server. On one cluster we have the problem, that corosync-objctl does not reflect the status So if I understand it correctly, you have multiple clusters and all of them was upgraded and

Re: [Pacemaker] Errors while compiling

2014-03-19 Thread Jan Friesse
Stephan Buchner napsal(a): Hm, i tried recompiling all three packages (libqb, corosync and pacemaker), using versions which have been marked stable by the gentoo project. I used the following versions: libqb = 0.14.4 corosync

Re: [Pacemaker] Pacemaker/corosync freeze

2014-03-14 Thread Jan Friesse
Message- From: Attila Megyeri [mailto:amegy...@minerva-soft.com] Sent: Thursday, March 13, 2014 1:45 PM To: The Pacemaker cluster resource manager; Andrew Beekhof Subject: Re: [Pacemaker] Pacemaker/corosync freeze Hello, -Original Message- From: Jan Friesse [mailto:jfrie

Re: [Pacemaker] Pacemaker/corosync freeze

2014-03-14 Thread Jan Friesse
To: The Pacemaker cluster resource manager; Andrew Beekhof Subject: Re: [Pacemaker] Pacemaker/corosync freeze Hello, -Original Message- From: Jan Friesse [mailto:jfrie...@redhat.com] Sent: Thursday, March 13, 2014 10:03 AM To: The Pacemaker cluster resource manager Subject: Re

Re: [Pacemaker] Pacemaker/corosync freeze

2014-03-13 Thread Jan Friesse
... Also can you please try to set debug: on in corosync.conf and paste full corosync.log then? I set debug to on, and did a few restarts but could not reproduce the issue yet - will post the logs as soon as I manage to reproduce. Perfect. Another option you can try to set is netmtu

Re: [Pacemaker] Pacemaker/corosync freeze

2014-03-12 Thread Jan Friesse
Attila Megyeri napsal(a): -Original Message- From: Andrew Beekhof [mailto:and...@beekhof.net] Sent: Tuesday, March 11, 2014 10:27 PM To: The Pacemaker cluster resource manager Subject: Re: [Pacemaker] Pacemaker/corosync freeze On 12 Mar 2014, at 1:54 am, Attila Megyeri

Re: [Pacemaker] Pacemaker/corosync freeze

2014-03-12 Thread Jan Friesse
Attila Megyeri napsal(a): Hello Jan, Thank you very much for your help so far. -Original Message- From: Jan Friesse [mailto:jfrie...@redhat.com] Sent: Wednesday, March 12, 2014 9:51 AM To: The Pacemaker cluster resource manager Subject: Re: [Pacemaker] Pacemaker/corosync freeze

Re: [Pacemaker] Pacemaker/corosync freeze

2014-03-12 Thread Jan Friesse
Attila Megyeri napsal(a): -Original Message- From: Jan Friesse [mailto:jfrie...@redhat.com] Sent: Wednesday, March 12, 2014 2:27 PM To: The Pacemaker cluster resource manager Subject: Re: [Pacemaker] Pacemaker/corosync freeze Attila Megyeri napsal(a): Hello Jan, Thank you very

Re: [Pacemaker] [corosync] corosync Segmentation fault.

2014-02-26 Thread Jan Friesse
Andrey, what version of corosync and libqb are you using? Can you please attach output from valgrind (and gdb backtrace)? Thanks, Honza Andrey Groshev napsal(a): Hi, ALL. Something I already confused, or after updating any package or myself something broke, but call corosycn killed by

Re: [Pacemaker] [corosync] corosync Segmentation fault.

2014-02-26 Thread Jan Friesse
Andrey, can you please give a try to patch [PATCH] votequorum: Properly initialize atb and atb_string which I've sent to ML (it should be there soon)? Thanks, Honza Andrey Groshev napsal(a): 26.02.2014, 12:11, Jan Friesse jfrie...@redhat.com: Andrey, what version of corosync and libqb

Re: [Pacemaker] [corosync] corosync Segmentation fault.

2014-02-26 Thread Jan Friesse
Andrey Groshev napsal(a): 26.02.2014, 16:11, Jan Friesse jfrie...@redhat.com: Andrey, can you please give a try to patch [PATCH] votequorum: Properly initialize atb and atb_string which I've sent to ML (it should be there soon)? Yes. Service is running. Thanks. # corosync-quorumtool

Re: [Pacemaker] Multicast pitfalls? corosync [TOTEM ] Retransmit List:

2014-02-14 Thread Jan Friesse
Beo, do you experiencing cluster split? If answer is no, then you don't need to do anything. Maybe network buffer is just filled. But, if answer is yes, try reduce mtu size (netmtu in configuration) to value like 1000. Regards, Honza Beo Banks napsal(a): Hi, i have a fresh 2 node cluster

Re: [Pacemaker] error: send_cpg_message: Sending message via cpg FAILED: (rc=6) Try again

2013-12-09 Thread Jan Friesse
Brian J. Murrell (brian) napsal(a): I seem to have another instance where pacemaker fails to exit at the end of a shutdown. Here's the log from the start of the service pacemaker stop: Dec 3 13:00:39 wtm-60vm8 crmd[14076]: notice: do_state_transition: State transition S_POLICY_ENGINE -

Re: [Pacemaker] Network outage debugging

2013-11-13 Thread Jan Friesse
Andrew Beekhof napsal(a): On 13 Nov 2013, at 11:49 am, Sean Lutner s...@rentul.net wrote: On Nov 12, 2013, at 7:33 PM, Andrew Beekhof and...@beekhof.net wrote: On 13 Nov 2013, at 11:22 am, Sean Lutner s...@rentul.net wrote: On Nov 12, 2013, at 6:01 PM, Andrew Beekhof

Re: [Pacemaker] Network outage debugging

2013-11-13 Thread Jan Friesse
Sean Lutner napsal(a): On Nov 13, 2013, at 3:15 AM, Jan Friesse jfrie...@redhat.com wrote: Andrew Beekhof napsal(a): On 13 Nov 2013, at 11:49 am, Sean Lutner s...@rentul.net wrote: On Nov 12, 2013, at 7:33 PM, Andrew Beekhof and...@beekhof.net wrote: On 13 Nov 2013, at 11:22 am

Re: [Pacemaker] Simple installation Pacemaker + CMAN + fence-agents

2013-11-10 Thread Jan Friesse
Andrew Beekhof napsal(a): Something seems very wrong with this at the corosync level. Even fenced and the dlm are having issues. Jan: Could this be firewall related? Yes. This can be ether firewall on mcast issue. I would recommend to turn off firewall completely (for testing). If this

Re: [Pacemaker] Could not initialize corosync configuration API error 2

2013-10-31 Thread Jan Friesse
Andrew, this problem was already discussed on corosync-ml. Andrew Beekhof napsal(a): Jan: not sure if you're on the pacemaker list On 29 Oct 2013, at 6:43 pm, Bauer, Stefan (IZLBW Extern) stefan.ba...@iz.bwl.de wrote: Dear Developers/Users, we’re using Pacemaker 1.1.7 and Corosync

Re: [Pacemaker] Pacemaker 1.1.8 and corosync's cpg service?

2013-05-22 Thread Jan Friesse
Mike, did you entered local node in nodelist? Because this may explain behavior you were describing. Honza Mike Edwards napsal(a): On Tue, May 21, 2013 at 11:15:56AM +1000, Andrew Beekhof babbled thus: cpg_join() is returning CS_ERR_TRY_AGAIN here. Jan: Any idea why this might happen? Thats

Re: [Pacemaker] Pacemaker 1.1.8 and corosync's cpg service?

2013-05-22 Thread Jan Friesse
Mike Edwards napsal(a): Which would be the recommended trqansport? I'm not tied to any particular method. As long as UDP (multicast) works for you, it's better solution (better tested, faster, ...). UDPU is targeted for deployments where multicast is problem. Regards, Honza On Wed,

Re: [Pacemaker] Pacemaker 1.1.8 and corosync's cpg service?

2013-05-22 Thread Jan Friesse
internally converted to recommended version with nodelist (so that's what you've sent). Regards, Honza Mike Edwards napsal(a): Yep. The config I pasted has the bindnetaddr set to 10.10.23.50, which also happens to be defined as node 1. On Wed, May 22, 2013 at 09:28:13AM +0200, Jan Friesse

Re: [Pacemaker] [Openais] Hawk 0.5.2 Debian packages

2013-02-26 Thread Jan Friesse
Great news! Regards, Honza Charles Williams napsal(a): Hey all, I recently got a chance to finally build Debian packages for the 0.5.2 version of ClusterLabs Hawk GUI. These are Squeeze packages ATM (Wheezy to come next week dependent upon testing of the current packages) and I am

Re: [Pacemaker] [corosync] Corosync memory usage rising

2013-02-04 Thread Jan Friesse
Andrew Beekhof napsal(a): On Thu, Jan 31, 2013 at 8:10 AM, Yves Trudeau y.trud...@videotron.ca wrote: Hi, Is there any known memory leak issue corosync 1.4.1. I have a setup here where corosync eats memory at a few kB a minute: 1.4.1 for sure. But it looks you are using 1.4.1-7 (EL

Re: [Pacemaker] [corosync] Corosync 2.1.0 dies on both nodes in cluster

2012-11-08 Thread Jan Friesse
out this line in /etc/hosts on all nodes in the cluster. http://burning-midnight.blogspot.com/2012/07/cluster-building-ubuntu-1204-revised.html Thanks, Andrew - Original Message - From: Jan Friesse jfrie...@redhat.com To: Andrew Martin amar...@xes-inc.com Cc

Re: [Pacemaker] [corosync] Corosync 2.1.0 dies on both nodes in cluster

2012-11-08 Thread Jan Friesse
and cluster should work. There are basically two problems: - ipc_shm is leaking memory - if there is no memory, libqb mmap nonallocated memory and receives sigbus Angus is working on both issues. Regards, Honza Jan Friesse napsal(a): Andrew, thanks for valgrind report (even it didn't showed

Re: [Pacemaker] [corosync] Corosync 2.1.0 dies on both nodes in cluster

2012-11-07 Thread Jan Friesse
) and post a backtrace if it still happens, that would be great. Thanks Angus Thanks, Andrew - Original Message - From: Jan Friesse jfrie...@redhat.com To: Andrew Martin amar...@xes-inc.com Cc: disc...@corosync.org, The Pacemaker cluster resource manager pacemaker

Re: [Pacemaker] [corosync] Corosync 2.1.0 dies on both nodes in cluster

2012-11-07 Thread Jan Friesse
see much wrong with the log either. If you could run with the latest (libqb-0.14.3) and post a backtrace if it still happens, that would be great. Thanks Angus Thanks, Andrew - Original Message - From: Jan Friesse jfrie...@redhat.com To: Andrew Martin amar...@xes

Re: [Pacemaker] [corosync] Corosync 2.1.0 dies on both nodes in cluster

2012-11-05 Thread Jan Friesse
be great. Thanks Angus Thanks, Andrew - Original Message - From: Jan Friesse jfrie...@redhat.com To: Andrew Martin amar...@xes-inc.com Cc: disc...@corosync.org, The Pacemaker cluster resource manager pacemaker@oss.clusterlabs.org Sent: Thursday, November 1, 2012 7:55:52

Re: [Pacemaker] [corosync] Corosync 2.1.0 dies on both nodes in cluster

2012-11-01 Thread Jan Friesse
Ansdrew, I was not able to find anything interesting (from corosync point of view) in configuration/logs (corosync related). What would be helpful: - if corosync died, there should be /var/lib/corosync/fdata-DATETTIME-PID of dead corosync. Can you please xz them and store somewhere (they are