Re: [Openais] totem token timeout increase

2014-08-27 Thread Steven Dake
On 08/26/2014 07:17 AM, Vasil Valchev wrote: Hello all, I have a RHEL 5 (openais) cluster with intermittent issues on the heartbeat network, and was thinking to increase the totem token value to 90s (currently is 30s). Are there any negative effects from this change, apart from the cluster

Re: [Openais] totem token timeout increase

2014-08-27 Thread Steven Dake
on the network that takes it out of service to repair. Regards, -steve BR, Vasil On Thu, Aug 28, 2014 at 12:03 AM, Steven Dake sd...@redhat.com mailto:sd...@redhat.com wrote: On 08/26/2014 07:17 AM, Vasil Valchev wrote: Hello all, I have a RHEL 5 (openais) cluster

Re: [Openais] Request of information about rrp mode passive versus rrp mode active

2013-11-27 Thread Steven Dake
On 11/27/2013 08:20 AM, Moullé Alain wrote: Hi, the man page of corosync.conf gives : Active replication offers slightly lower latency from transmit to delivery in faulty network environments but with less performance. Passive replication may nearly double the speed of the totem protocol if

Re: [Openais] question stack Pacemaker/corosync on SLES11

2012-03-08 Thread Steven Dake
On 03/08/2012 03:10 AM, Tim Serong wrote: On 03/08/2012 07:23 PM, alain.mou...@bull.net wrote: Hi Darren And thanks. I effectively found that the stack is started with the service openais : no more 'corosync' neither 'pacemaker' lsb scripts. But I'am surprised because I think I remind that

[Openais] change to commit policy

2011-09-09 Thread Steven Dake
Russell pointed out a problem with his recent patch for mutexes. It is only applicable to 1.4/1.3 branches. It is not applicable to master. Currently our policy is that all patches go into master, and 1 person is responsible for backports to other branches. This leaves out the important case

Re: [Openais] Installing corosync from source

2011-09-07 Thread Steven Dake
On 09/06/2011 06:05 PM, Nick Khamis wrote: Hello Everyone, We are moving everything over from heartbeat, after the last update brought the cluster to it's knees... What we are interested in is using corosync, pacemaker to LVS mysql, and asterisk. We have not looked into asterisk yet, and we

Re: [Openais] Configuration Hash Table - API proposal

2011-09-07 Thread Steven Dake
On 09/01/2011 05:17 AM, Jan Friesse wrote: Included is API proposal for replacement of objdb/confdb API. It should keep all good things there (triggers, ...), remove hard to use bits (like whole object idea) and improve existing things (like typing) Even I wrote it before, also configuration

[Openais] [PATCH] Allow nss building conditionally with rpmbuild operation

2011-09-02 Thread Steven Dake
Signed-off-by: Steven Dake sd...@redhat.com --- corosync.spec.in |8 1 files changed, 8 insertions(+), 0 deletions(-) diff --git a/corosync.spec.in b/corosync.spec.in index 74ab851..5c651aa 100644 --- a/corosync.spec.in +++ b/corosync.spec.in @@ -11,6 +11,7 @@ %bcond_with snmp

Re: [Openais] cpg behavior on transitional membership change

2011-09-02 Thread Steven Dake
On 09/02/2011 12:59 AM, Vladislav Bogdanov wrote: Hi all, I'm trying to further investigate problem I described at https://www.redhat.com/archives/cluster-devel/2011-August/msg00133.html The main problem for me there is that pacemaker first sees transitional membership with left nodes,

[Openais] [PATCH] Ignore memb_join messages during flush operations

2011-09-01 Thread Steven Dake
a memb_join operation that occurs during flushing can result in an entry into the GATHER state from the RECOVERY state. This results in the regular sort queue being used instead of the recovery sort queue, resulting in segfault. Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemudp.c

[Openais] test two

2011-09-01 Thread Steven Dake
___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman/listinfo/openais

[Openais] test

2011-09-01 Thread Steven Dake
___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman/listinfo/openais

[Openais] test 3

2011-09-01 Thread Steven Dake
test 3 ___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman/listinfo/openais

Re: [Openais] Corosync 2.0 Feature Request: Notification of the status change for each rings via SNMP

2011-09-01 Thread Steven Dake
to match the new model. Regards -steve Regards, Keisuke MORI 2011/7/22 Steven Dake sd...@redhat.com: The Corosync flatiron 1.y series had many more features added then I would have liked, but the development team feels the 1.y series addresses any major gaps users of the software have had

[Openais] Sorry for noise on mailing list

2011-09-01 Thread Steven Dake
The mailing list server had a short outage. Apologies for noise on the mailing list. Regards -steve ___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman/listinfo/openais

Re: [Openais] aisexec crashes with SIGABRT

2011-08-30 Thread Steven Dake
://www.multiservice.com +1.913.663.9483 tel:%2B1.913.663.9483 (direct) +1.816.718.0468 tel:%2B1.816.718.0468 (mobile) +1.913.217.9318 tel:%2B1.913.217.9318 (fax) On Mon, Aug 22, 2011 at 12:13, Steven Dake sd...@redhat.com mailto:sd...@redhat.com wrote: On 08

Re: [Openais] aisexec crashes with SIGABRT

2011-08-30 Thread Steven Dake
Thanks, -- Chris Kirke Director - Systems Architecture Multi Service Corporation www.multiservice.com http://www.multiservice.com +1.913.663.9483 (direct) +1.816.718.0468 (mobile) +1.913.217.9318 (fax) On Mon, Aug 29, 2011 at 22:43, Steven Dake sd...@redhat.com mailto:sd

[Openais] [PATCH] Remove hdb.h header includes from unnecessary files

2011-08-23 Thread Steven Dake
The files in this patch do not use the hdb.h header. Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemrrp.c |1 - exec/totemsrp.c |1 - exec/totemudp.c |1 - exec/totemudp.h |1 - exec/totemudpu.c |1 - exec/totemudpu.h |1 - 6 files changed, 0 insertions

[Openais] [PATCH] Get rid of hdb usage in totempg.h interface

2011-08-23 Thread Steven Dake
hdb has some expense and is not necessary in the totempg.so runtime. This patch removes the dependence on hdb and instead uses a direct pointer. Signed-off-by: Steven Dake sd...@redhat.com --- exec/main.c |2 +- exec/sync.c |2 +- exec/syncv2.c

Re: [Openais] aisexec crashes with SIGABRT

2011-08-22 Thread Steven Dake
On 08/22/2011 09:58 AM, Christopher A. Kirke wrote: currently using the REL5-provided package on two nodes on local Cisco-switched LAN: openais.x86_64 0.80.6-28.el5_6.1 installed with following configuration: # Please read the openais.conf.5 manual

[Openais] [PATCH] Move cs_queue.h from include directory to exec directory

2011-08-22 Thread Steven Dake
This file is only used by totemsrp.c. Move out of general include directory. Signed-off-by: Steven Dake sd...@redhat.com --- exec/Makefile.am|2 +- exec/cs_queue.h | 229 +++ exec/totemsrp.c |2 +- include

Re: [Openais] Problems forming cluster on corosync startup

2011-08-15 Thread Steven Dake
client as locked up and rebooted the units. It definitely rules out #2. I can repeat the test with healthchecking disabled to narrow down if #1 or #3 will occur. Regards, Tim On Thu, Aug 11, 2011 at 4:21 AM, Steven Dake sd...@redhat.com wrote: On 08/09/2011 09:56 PM, Tim Beale wrote: Hi

Re: [Openais] Problems forming cluster on corosync startup

2011-08-10 Thread Steven Dake
) memb_set_and( diff1, diff2, diff ) Thanks, Tim On Mon, Aug 8, 2011 at 9:45 PM, Steven Dake sd...@redhat.com wrote: On 08/08/2011 12:10 AM, Tim Beale wrote: Hi Steve, Thanks for your help. I tried out your patch but the problem still occurs. The problem looks to me due to the ring-IDs

Re: [Openais] [PATCH 1/2] cpg: Handle errors from totem_mcast

2011-08-09 Thread Steven Dake
On second consideration this patch is Reviewed-by: Steven Dake sd...@redhat.com On 08/08/2011 09:11 AM, Steven Dake wrote: On 07/28/2011 07:20 AM, Jan Friesse wrote: totem_mcast function can return -1 if corosync is overloaded. Sadly in many calls of this functions was error code ether

Re: [Openais] [PATCH 2/2] cfg: Handle errors from totem_mcast

2011-08-09 Thread Steven Dake
On second consideration this patch is Reviewed-by: Steven Dake sd...@redhat.com On 08/08/2011 09:15 AM, Steven Dake wrote: Before accepting an IPC message, ipc checks that the totem queue has available room for new messages. As a result this patch is either not necessary or fixes the wrong

[Openais] [PATCH 1/4] Fix problem in totemiba where incorrect define is used (and also not defined)

2011-08-09 Thread Steven Dake
Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemiba.c |4 +++- 1 files changed, 3 insertions(+), 1 deletions(-) diff --git a/exec/totemiba.c b/exec/totemiba.c index 2d8c690..a16f88a 100644 --- a/exec/totemiba.c +++ b/exec/totemiba.c @@ -70,6 +70,8 @@ #include corosync/list.h

[Openais] [PATCH 2/4] Define totemiba_log_printf properly

2011-08-09 Thread Steven Dake
Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemiba.c |8 +--- 1 files changed, 5 insertions(+), 3 deletions(-) diff --git a/exec/totemiba.c b/exec/totemiba.c index a16f88a..008018a 100644 --- a/exec/totemiba.c +++ b/exec/totemiba.c @@ -187,13 +187,15 @@ struct

[Openais] [PATCH 4/4] Remove -lcoroipcc from tools/Makefile.am notifyd

2011-08-09 Thread Steven Dake
Signed-off-by: Steven Dake sd...@redhat.com --- tools/Makefile.am |2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/tools/Makefile.am b/tools/Makefile.am index f88e741..2699519 100644 --- a/tools/Makefile.am +++ b/tools/Makefile.am @@ -55,7 +55,7

[Openais] [PATCH 3/4] properly define rec_token_cq_send_event_fn

2011-08-09 Thread Steven Dake
Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemiba.c |5 - 1 files changed, 4 insertions(+), 1 deletions(-) diff --git a/exec/totemiba.c b/exec/totemiba.c index 008018a..ffcfceb 100644 --- a/exec/totemiba.c +++ b/exec/totemiba.c @@ -562,7 +562,10 @@ static int

[Openais] [PATCH] Make joined and left lists deliver correct results

2011-08-08 Thread Steven Dake
Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemsrp.c | 47 ++- 1 files changed, 42 insertions(+), 5 deletions(-) diff --git a/exec/totemsrp.c b/exec/totemsrp.c index 4a299a0..a97ed49 100644 --- a/exec/totemsrp.c +++ b/exec/totemsrp.c

Re: [Openais] Problems forming cluster on corosync startup

2011-08-08 Thread Steven Dake
the transitional list to determine their contents, which is not correct. This results in incorrect data delivered to clm. Try the follow-up patch which should correctly calculate the joined and left lists. Thanks, Tim On Mon, Aug 8, 2011 at 6:08 AM, Steven Dake sd...@redhat.com wrote: On 08

[Openais] feature proposal: take 2 of quorum

2011-08-08 Thread Steven Dake
On 08/08/2011 12:25 AM, Fabio M. Di Nitto wrote: On 8/7/2011 6:57 PM, Steven Dake wrote: Believe many in community are on vacation during our proposal window. As a result, I'm extending until Aug 30th. topic-quorum ? as we discussed recently on IRC, in order to replace cman. can you

Re: [Openais] [PATCH] Add systemd unit files for corosync and corosync-notifyd

2011-08-08 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com Regards -steve On 08/08/2011 04:04 AM, Angus Salkeld wrote: Signed-off-by: Angus Salkeld asalk...@redhat.com --- configure.ac |8 corosync.spec.in | 12 init/.gitignore

Re: [Openais] [PATCH] Revert totemsrp: Remove recv_flush code

2011-08-08 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com On 07/27/2011 05:49 AM, Jan Friesse wrote: This reverts commit 2167 Reversion is needed to remove overflow of receive buffers and dropping messages. Signed-off-by: Jan Friesse jfrie...@redhat.com --- branches/whitetank/exec/totemnet.c | 45

Re: [Openais] [PATCH] coroipcc: use malloc for path in service_connect

2011-08-08 Thread Steven Dake
Reiewed-by: Steen Dake sd...@redhat.com On 07/27/2011 08:31 AM, Jan Friesse wrote: Coroipcc appropriately uses PATH_MAX sized variables for various data structures handling files in the initialization of the client. Due to the use of 12 of these structures declared as stack variables, the

Re: [Openais] [PATCH 1/2] cpg: Handle errors from totem_mcast

2011-08-08 Thread Steven Dake
On 07/28/2011 07:20 AM, Jan Friesse wrote: totem_mcast function can return -1 if corosync is overloaded. Sadly in many calls of this functions was error code ether not handled at all, or handled by assert. Commit changes behaviour to ether return CS_ERR_TRY_AGAIN or put error code to later

Re: [Openais] [PATCH 2/2] cfg: Handle errors from totem_mcast

2011-08-08 Thread Steven Dake
Before accepting an IPC message, ipc checks that the totem queue has available room for new messages. As a result this patch is either not necessary or fixes the wrong thing. See coroipcs.c:697 send_ok = api-sending_allowed (conn_info-service, header-id,

Re: [Openais] [PATCH] Make realtime scheduling optional not the default.

2011-08-07 Thread Steven Dake
Good work Reviewed-by: Steven Dake sd...@redhat.com On 08/07/2011 05:40 AM, Angus Salkeld wrote: Signed-off-by: Angus Salkeld asalk...@redhat.com --- configure.ac |6 ++ exec/main.c| 21 +++-- man/corosync.8 |7 +-- 3 files changed, 26 insertions

Re: [Openais] [GIT PULL] Changes to example configuration file (corosync.conf.example)

2011-08-07 Thread Steven Dake
Florian This has been processed. Apologies for delay - very busy week. Regards -steve On 08/01/2011 07:17 AM, Florian Haas wrote: Steve, please consider pulling the following changes since commit d4fb83e971b6fa9af0447ce0a70345fb20064dc1: main: let poll really stop before

[Openais] Extendng call for Corosync RFEs until Aug 30th

2011-08-07 Thread Steven Dake
Believe many in community are on vacation during our proposal window. As a result, I'm extending until Aug 30th. Regards -steve ___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman/listinfo/openais

[Openais] Corosync 2.0 Feature Request: Experiment with rdma support without using librdmacm

2011-08-07 Thread Steven Dake
The librdmacm libs assume a connection oriented mechanism whereas totem assumes a connectionless oriented operation. The RDMA technology can be exposed only through ibverbs. The advantage is improved reliability with RDMA networks. In the TODO file this is the topic : topic-rdmaud

[Openais] Corosync 2.0 Feature Request: Use zero-copy operation with RDMA networks

2011-08-07 Thread Steven Dake
Totem currently copies each packet into the network layer. This results in an extra copy in RDMA networks. To reduce cpu utilization and improve performance, allocate these packets from the totem network layer before sending the packet. This removes an extra memory copy operation in RDMA

[Openais] Corosync 2.0 Feature Request: Centralize the encryption/decryption into one file

2011-08-07 Thread Steven Dake
Each network driver has encryption code in it. Centralize that encryption code to one file so that it may be maintained in one file rather then 3 separate drivers. This is the topic-onecrypt topic on the TODO file. Regards -steve ___ Openais mailing

[Openais] [PATCH] Set my_new_memb_list in recovery enter

2011-08-07 Thread Steven Dake
Currently my_new_memb_list is set in commit_enter, resulting in join messages being accepted during commit/recovery phases which are not appropriate to maintain protocol guarantees. Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemsrp.c | 10 +- 1 files changed, 5 insertions

Re: [Openais] Problems forming cluster on corosync startup

2011-08-07 Thread Steven Dake
On 08/03/2011 10:32 PM, Tim Beale wrote: Hi, It looks to me that the way the transition from Recovery to Operational works, we can't guarantee that all nodes in the ring have entered Operational before a node processes another Memb-Join message from a new node. E.g. we can't guarantee the

Re: [Openais] [PATCH 1/6] Remove scheduling

2011-08-05 Thread Steven Dake
I believe a better approach would be to default to standard scheduling and add a new flag --realtime which enables realtime scheduling. Regards -steve On 08/05/2011 12:09 AM, Angus Salkeld wrote: Signed-off-by: Angus Salkeld asalk...@redhat.com --- exec/main.c | 55

Re: [Openais] [PATCH 3/6] Fix some compiler warnings

2011-08-05 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com On 08/05/2011 12:09 AM, Angus Salkeld wrote: Signed-off-by: Angus Salkeld asalk...@redhat.com --- configure.ac |4 +- exec/crypto.c |2 - exec/main.c |3 -- exec/objdb.c | 76

Re: [Openais] [PATCH 4/6] libqb: Add libqb dependency in the rpm pc file

2011-08-05 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com On 08/05/2011 12:09 AM, Angus Salkeld wrote: Signed-off-by: Angus Salkeld asalk...@redhat.com --- corosync.spec.in |2 +- pkgconfig/corosync.pc.in |2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git

Re: [Openais] [PATCH 6/6] Update TODOs

2011-08-05 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com On 08/05/2011 12:09 AM, Angus Salkeld wrote: Signed-off-by: Angus Salkeld asalk...@redhat.com --- TODO | 73 + 1 files changed, 19 insertions(+), 54 deletions(-) diff --git a/TODO b

[Openais] Live demo of Pacemaker Cloud on Fedora: Friday August 5th at 8am PST

2011-08-03 Thread Steven Dake
Extending a general invitation to the high availability communities and other cloud community contributors to participate in a live demo I am giving on Friday August 5th 8am PST (GMT-7). Demo portion of session is 15 minutes and will be provided first followed by more details of our approach to

Re: [Openais] Corosync fails to start under cman

2011-08-03 Thread Steven Dake
On 08/03/2011 04:06 PM, David wrote: I have a 3 node RHCS cluster and prior to an VLAN change (moved the cluster communications into its own VLAN) all three nodes were working. Post VLAN migration 2 of the 3 nodes joined the cluster but a third is failing when I start cman: Starting

Re: [Openais] Corosync (version 1.23 on rhel6) crashes when packets are dropped

2011-08-02 Thread Steven Dake
On 08/02/2011 04:47 PM, Stanley, Ephrim wrote: Hi, I’m evaluating the Qpid messaging broker which uses Corosync for clustering. As part of my cluster break tests, I ran into a problem where Corosync dies without producing any core files or error messages. Is this expected ? Also, what

Re: [Openais] [PATCH] corosync.conf.example: add note about host addresses in bindnetaddr

2011-08-01 Thread Steven Dake
These patches look good. Reviewed-by: Steven Dake sd...@redhat.com Regards -steve On 07/31/2011 11:56 PM, Florian Haas wrote: https://lists.linux-foundation.org/pipermail/openais/2011-July/016563.html Jan Friesse pointed out that bindnetaddr should be set to a host address (as opposed

Re: [Openais] corosync didn't do what I expected

2011-07-29 Thread Steven Dake
On 07/29/2011 12:36 PM, Keith Stevens wrote: I have the following configuration on two servers netbox1 and netbox2: crm(live)configure# show node netbox1 \ attributes standby=off node netbox2 primitive failover-ip ocf:heartbeat:IPaddr \ params ip=216.105.20.43 \

Re: [Openais] Corosync Compatability

2011-07-29 Thread Steven Dake
to restart your cluster. Regards -steve On Tue, July 26, 2011 7:12 pm, Steven Dake wrote: On 07/26/2011 01:52 AM, manish.gu...@ionidea.com wrote: Hi, I am facing problem with redundent Communication Channel. I am using Coroync 1.2 In this auto failback of redundent channel is not Supported

Re: [Openais] Corosync Compatability

2011-07-26 Thread Steven Dake
On 07/26/2011 01:52 AM, manish.gu...@ionidea.com wrote: Hi, I am facing problem with redundent Communication Channel. I am using Coroync 1.2 In this auto failback of redundent channel is not Supported. But 1.4 provide support. Corosync-1.4 id compatiable with which version of

Re: [Openais] vsftype - which one?

2011-07-26 Thread Steven Dake
On 07/26/2011 04:07 AM, Proskurin Kirill wrote: Hello all. I not fully understand that vsftype is really is. Could someone explain it? I plan to make a ~50 nodes cluster with about ~50 resources via pacemaker. All nodes are in out local network with 1Gbis\s NIC What type should I

Re: [Openais] [PATCH] main: let poll really stop before totempg_finalize

2011-07-25 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com On 07/25/2011 06:23 AM, Jan Friesse wrote: Signed-off-by: Jan Friesse jfrie...@redhat.com --- exec/main.c | 24 +++- 1 files changed, 15 insertions(+), 9 deletions(-) diff --git a/exec/main.c b/exec/main.c index be9e118

Re: [Openais] [PATCH] totemsrp: fix buffer overflows for large clusters ( 100 nodes)

2011-07-24 Thread Steven Dake
Thanks for the submission. Reviewed-by; Steven Dake sd...@redhat.com On 07/24/2011 02:58 AM, MORITA Kazutaka wrote: Signed-off-by: MORITA Kazutaka morita.kazut...@lab.ntt.co.jp --- exec/totemsrp.c |6 +++--- 1 files changed, 3 insertions(+), 3 deletions(-) diff --git a/exec

Re: [Openais] Corosync sends unicast but should multycast

2011-07-22 Thread Steven Dake
Tokens are always sent unicast - this is how the protocol works. thanks -steve On 07/22/2011 07:22 AM, Proskurin Kirill wrote: Hi all. Found odd thing - some of my node send unicast while other send muiltycast and other unicast and multycast... with same configuration and they all work.

Re: [Openais] Corosync sends unicast but should multycast

2011-07-22 Thread Steven Dake
On 07/22/2011 08:01 AM, Proskurin Kirill wrote: On 07/22/2011 06:46 PM, Steven Dake wrote: Tokens are always sent unicast - this is how the protocol works. Thanks for reply. One more thing - then and for what multycast is send? We make some test with network team and try to understand all

Re: [Openais] Ip addr auto detection

2011-07-21 Thread Steven Dake
On 07/21/2011 02:42 AM, Proskurin Kirill wrote: Hello all. In man for corosync.conf suggest to add not current IP addr of a node but her network: For example, if the local interface is 192.168.5.92 with netmask 255.255.255.0, set bindnetaddr to 192.168.5.0. Ok - that`s cool. But If i

Re: [Openais] Multycast unicast as fall back

2011-07-21 Thread Steven Dake
On 07/21/2011 02:30 AM, Proskurin Kirill wrote: Hello all. Is this possible to use multycast as primary way to communication in cluster but fall back to unicast transports if multycast is fail? Different rings with different transports? We have some problems in network switches and

Re: [Openais] About TODO file

2011-07-21 Thread Steven Dake
On 07/21/2011 04:15 AM, Yingliang Yang wrote: Hi, I have downloaded corosync-1.4.0 package. There is a TODO file in the release.But it's updated in October 2010 I would like to know is there any plan in the future. And also, there is an option(enable_watchdog) in the configure file. Will

Re: [Openais] Multycast unicast as fall back

2011-07-21 Thread Steven Dake
On 07/21/2011 06:27 AM, Proskurin Kirill wrote: On 07/21/2011 05:11 PM, Steven Dake wrote: On 07/21/2011 02:30 AM, Proskurin Kirill wrote: Hello all. Is this possible to use multycast as primary way to communication in cluster but fall back to unicast transports if multycast is fail

[Openais] New bugzilla method

2011-07-21 Thread Steven Dake
Hi, We have new bugzilla tracking in place via bugzilla.redhat.com. When filing bugs, please file under Community-Corosync Cluster Engine rather then rawhide or a specific fedora version. If the issue is fedora specific, continue to file under fedora. For other distro specific problems (such

[Openais] Corosync 2.0 (needle) Call for RFEs

2011-07-21 Thread Steven Dake
The Corosync flatiron 1.y series had many more features added then I would have liked, but the development team feels the 1.y series addresses any major gaps users of the software have had. As a result, we are freezing any future feature development of the flatiron branch permanently. We will

Re: [Openais] FAILED TO RECEIVE followed by cluster failure

2011-07-21 Thread Steven Dake
On 07/21/2011 12:19 PM, Jed Smith wrote: Steve, Thank you again for all of the information. I labbed an in-place upgrade and the Corosync 1.4.0 compile brought down the 1.2.1-4ubuntu1 box. All I did was deploy from scratch, create a cluster with 1.2.1-4ubuntu1 and Pacemaker

Re: [Openais] [PATCH 2/3] specfile: use _datadir as var expansion not exec

2011-07-20 Thread Steven Dake
On 07/20/2011 12:48 AM, Jan Friesse wrote: Steven Dake wrote: On 07/19/2011 08:01 AM, Jan Friesse wrote: Signed-off-by: Jan Friesse jfrie...@redhat.com --- corosync.spec.in |2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/corosync.spec.in b/corosync.spec.in index

Re: [Openais] Multi State active resource each instance start\Stop

2011-07-19 Thread Steven Dake
On 07/19/2011 03:21 AM, manish.gu...@ionidea.com wrote: Hi, I have configured a multi-state(clone)resource float IP(IP). It is running on all the configure Nodes. I am trying to stop it using crm_resource command crm_resource -r IP:0 -p target-role -v stopped I am

Re: [Openais] [PATCH 1/3] specfile: Correct URL and source0

2011-07-19 Thread Steven Dake
# Runtime bits Requires: corosynclib = %{version}-%{release} Reviewed-by: Steven Dake sd...@redhat.com ___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman/listinfo/openais

Re: [Openais] [PATCH 2/3] specfile: use _datadir as var expansion not exec

2011-07-19 Thread Steven Dake
On 07/19/2011 08:01 AM, Jan Friesse wrote: Signed-off-by: Jan Friesse jfrie...@redhat.com --- corosync.spec.in |2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/corosync.spec.in b/corosync.spec.in index 37e53ed..823ad3d 100644 --- a/corosync.spec.in +++

Re: [Openais] [PATCH 3/3] specfile: Install corosync-signals.conf for dbus

2011-07-19 Thread Steven Dake
/corosync-signals.conf +%endif + ## tree fixup # drop static libs rm -f %{buildroot}%{_libdir}/*.a Reviewed-by: Steven Dake sd...@redhat.com ___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman

Re: [Openais] Some messages still leaked in recovery code

2011-07-19 Thread Steven Dake
On 07/18/2011 07:55 PM, Tim Beale wrote: Hi, I think there is still a slight memory-leak when recovery is entered repeatedly. The recovery messages usually get freed when the operational state is entered. However if recovery is entered several times, without entering the operational state,

Re: [Openais] Add a few more stats for debugging

2011-07-19 Thread Steven Dake
On 07/18/2011 09:14 PM, Tim Beale wrote: Hi, Attached is a patch that adds a few more more stats (the code was actually written by Angus). We find these stats useful - hopefully others will too. Cheers, Tim Great work Reviewed-by: Steven Dake sd...@redhat.com

Re: [Openais] Announcing Corosync 1.4.0

2011-07-18 Thread Steven Dake
On 07/18/2011 08:29 AM, Digimer wrote: On 07/18/2011 10:37 AM, Jan Friesse wrote: Corosync 1.4.0 is available for immediate download from our website. This version brings many enhancements to the software but most visible change is redundant ring auto recovery functionality. Please retrieve

Re: [Openais] Announcing Corosync 1.4.0

2011-07-18 Thread Steven Dake
On 07/18/2011 07:37 AM, Jan Friesse wrote: Corosync 1.4.0 is available for immediate download from our website. This version brings many enhancements to the software but most visible change is redundant ring auto recovery functionality. Please retrieve the latest sources from our website:

Re: [Openais] FAILED TO RECEIVE followed by cluster failure

2011-07-18 Thread Steven Dake
On 07/18/2011 10:38 AM, Jed Smith wrote: Thank you for your reply. On Mon, Jul 18, 2011 at 1:18 PM, Digimer li...@alteeve.com wrote: Is it possible that the switch dropped the multicast group, and didn't reform it fast enough to prevent the cluster from partitioning? Our network guy says

Re: [Openais] FAILED TO RECEIVE followed by cluster failure

2011-07-18 Thread Steven Dake
On 07/18/2011 07:55 PM, Keisuke MORI wrote: Hi, 2011/7/19 Steven Dake sd...@redhat.com: On 07/18/2011 10:38 AM, Jed Smith wrote: Thank you for your reply. On Mon, Jul 18, 2011 at 1:18 PM, Digimer li...@alteeve.com wrote: Is it possible that the switch dropped the multicast group

Re: [Openais] [PATCH 2/2] rrp: Handle rollower in passive rrp properly

2011-07-15 Thread Steven Dake
Great work Reviewed-by: Steven Dake sd...@redhat.com On 07/15/2011 06:31 AM, Jan Friesse wrote: Signed-off-by: Jan Friesse jfrie...@redhat.com --- exec/totemrrp.c | 175 +++ 1 files changed, 112 insertions(+), 63 deletions(-) diff

Re: [Openais] [PATCH] totemconfig: Change default FAIL_TO_RECV_CONST

2011-07-15 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com On 07/15/2011 09:21 AM, Jan Friesse wrote: Previous default (50) was too low for most modern switch hardware. This may trigger abort because the aru doesn't increase for 50 token rotations combined with a defect in how failed to recv conditions

Re: [Openais] [PATCH] rrp: handle rollover in active rrp properly

2011-07-15 Thread Steven Dake
Reviewed-by: Steven Dake sd...@redhat.com On 07/15/2011 09:31 AM, Jan Friesse wrote: Signed-off-by: Jan Friesse jfrie...@redhat.com --- exec/totemrrp.c | 24 +++- 1 files changed, 23 insertions(+), 1 deletions(-) diff --git a/exec/totemrrp.c b/exec/totemrrp.c index

Re: [Openais] [TOTEM ] Process pause detected for XXX ms, flushing membership messages.

2011-07-08 Thread Steven Dake
On 07/08/2011 02:03 AM, Vladislav Bogdanov wrote: I checked the archives and found a patch from some time ago that was never merged. It wasn't verified to resolve the pause timeout problem but t could indeed solve the problem. It wasn't merged because we lacked verification it resolved the

Re: [Openais] [PATCH] totemiba: free send_buf on ibv_reg_mr failure

2011-07-07 Thread Steven Dake
); + free (send_buf); return (NULL); } list_init (send_buf-list_all); Reviewed-by: Steven Dake sd...@redhat.com Thanks! -steve ___ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org

[Openais] [PATCH] Speculatory patch that may correct tlbe...@gmail.com's reported problem

2011-07-07 Thread Steven Dake
May not work at all or correct problem - would appreciate feedback Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemsrp.c |2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/exec/totemsrp.c b/exec/totemsrp.c index 3dcc05e..5a3bfaa 100644 --- a/exec/totemsrp.c +++ b

[Openais] [PATCH] take 2 Speculatory patch that may correct tlbe...@gmail.com's reported problem

2011-07-07 Thread Steven Dake
May not work at all or correct problem - would appreciate feedback Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemsrp.c |2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/exec/totemsrp.c b/exec/totemsrp.c index 3dcc05e..5a3bfaa 100644 --- a/exec/totemsrp.c +++ b

Re: [Openais] Question about recovery code

2011-07-07 Thread Steven Dake
specific to my system. Let me know if there's any debug or anything you want me to try out. Thanks, Tim On Thu, Jul 7, 2011 at 3:47 PM, Steven Dake sd...@redhat.com wrote: On 07/06/2011 05:24 PM, Tim Beale wrote: Hi, We've hit a problem in the recovery code and I'm struggling

[Openais] [PATCH] take 3 Speculatory patch that may correct tlbe...@gmail.com's reported problem

2011-07-07 Thread Steven Dake
May not work at all or correct problem - would appreciate feedback Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemsrp.c |2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/exec/totemsrp.c b/exec/totemsrp.c index 3dcc05e..16de74d 100644 --- a/exec/totemsrp.c +++ b

Re: [Openais] [PATCH] Add more pause timeout resets

2011-07-06 Thread Steven Dake
On 07/05/2011 04:51 PM, Russell Bryant wrote: On Tue, Jul 5, 2011 at 2:14 PM, Steven Dake sd...@redhat.com wrote: Signed-off-by: Steven Dake sd...@redhat.com --- exec/totemsrp.c | 14 ++ 1 files changed, 14 insertions(+), 0 deletions(-) diff --git a/exec/totemsrp.c b/exec

Re: [Openais] startup error - getpwnam_r() returns ERANGE for some systems

2011-07-06 Thread Steven Dake
On 07/05/2011 07:20 PM, Tim Beale wrote: Hi, We've just upgraded to corosync v1.3.1 and struck a problem with corosync failing to startup. The problem is the getpwnam_r()/getgrnam_r() calls return ERANGE on our system, meaning insufficient buffer space was supplied (the expected buffer

Re: [Openais] Fix compile/runtime issues for _POSIX_THREAD_PROCESS_SHARED 1

2011-07-06 Thread Steven Dake
is optional, but does say: 'For greater portability it is best to always call semctl() with four arguments'. The attached patch does this. Cheers, Tim Tim, Thanks for the work. Is this only a uclibc problem on linux? Reviewed-by: Steven Dake sd...@redhat.com Thanks -steve

[Openais] [PATCH 2/4] build: make RDMA support an RPM build conditional

2011-07-06 Thread Steven Dake
From: Florian Haas florian.h...@linbit.com Enable RDMA in RPM builds by default to maintain the previous behavior (which always included --enable-rdma in the %configure invocation). --- corosync.spec.in |5 + 1 files changed, 5 insertions(+), 0 deletions(-) diff --git a/corosync.spec.in

[Openais] [PATCH 3/4] build: set RDMA related _LIBS and _CFLAGS only if building with RDMA support

2011-07-06 Thread Steven Dake
From: Florian Haas florian.h...@linbit.com Having to force {ibverbs,rdmacm}_{LIBS,CFLAGS} looks positively odd; so this may warrant further review. However, they are definitely not needed if building without RDMA support. --- corosync.spec.in |2 ++ 1 files changed, 2 insertions(+), 0

[Openais] [PATCH 4/4] build: disable RDMA support in RPMs by default

2011-07-06 Thread Steven Dake
From: Florian Haas florian.h...@linbit.com Rather than curiously disable RDMA support by default in configure and enable it by default in RPM builds, streamline the default configuration to always turn RDMA support off. It can be enabled in RPM builds with --with rdma. --- corosync.spec.in |

Re: [Openais] [PATCH 4/4] build: disable RDMA support in RPMs by default

2011-07-06 Thread Steven Dake
On 07/06/2011 06:52 AM, Steven Dake wrote: From: Florian Haas florian.h...@linbit.com Rather than curiously disable RDMA support by default in configure and enable it by default in RPM builds, streamline the default configuration to always turn RDMA support off. It can be enabled in RPM

Re: [Openais] [GIT PULL] Minor fixes for RPM builds

2011-07-06 Thread Steven Dake
On 07/06/2011 07:08 AM, Florian Haas wrote: On 2011-07-06 15:59, Steven Dake wrote: On 07/06/2011 06:56 AM, Florian Haas wrote: On 2011-07-06 15:49, Steven Dake wrote: Florian, I'll take improvements however I can get them, but sending patches to the list is preferred that way multiple

Re: [Openais] [PATCH 1/4] build: force LC_ALL=C correctly for dates

2011-07-06 Thread Steven Dake
Thanks for the patch Reviewed-by: Steven Dake sd...@redhat.com On 07/06/2011 06:52 AM, Steven Dake wrote: From: Florian Haas florian.h...@linbit.com Failure to force C dates will have RPM et al. complain about invalid dates and timestamps. --- Makefile.am |4 ++-- 1 files changed, 2

Re: [Openais] [PATCH 2/4] build: make RDMA support an RPM build conditional

2011-07-06 Thread Steven Dake
On 07/06/2011 01:02 PM, Florian Haas wrote: On 07/06/2011 03:52 PM, Steven Dake wrote: From: Florian Haas florian.h...@linbit.com Enable RDMA in RPM builds by default to maintain the previous behavior (which always included --enable-rdma in the %configure invocation). Steve, seeing

Re: [Openais] Question about recovery code

2011-07-06 Thread Steven Dake
On 07/06/2011 05:24 PM, Tim Beale wrote: Hi, We've hit a problem in the recovery code and I'm struggling to understand why we do the following: /* * The recovery sort queue now becomes the regular * sort queue. It is necessary to copy the state * into the

Re: [Openais] [TOTEM ] Process pause detected for XXX ms, flushing membership messages.

2011-07-05 Thread Steven Dake
On 07/05/2011 07:26 AM, Vladislav Bogdanov wrote: Hi all, Last days I see following messages in logs: [TOTEM ] Process pause detected for XXX ms, flushing membership messages. After that ring is quickly re-established. DLM/clvmd notifies this and switches to kern_stop waiting for fencing

  1   2   3   4   5   6   7   8   9   10   >