>> currently doing another conntrackd project and therefore using the
>> Found a minor issue:
>>
>> When the active host is fenced and returns to the cluster, it does not
>> request the current connection tracking states. Therefore state
>> information might be lost. This patch fixes that. Any comm
Hi people
currently doing another conntrackd project and therefore using the
code once again (jippie :)). Found a minor issue:
When the active host is fenced and returns to the cluster, it does not
request the current connection tracking states. Therefore state
information might be lost. This pat
>> There did not have to be a negative location constraint up to now,
>> because the cluster took care of that.
>
> Only because it didn't work correctly.
Okay.
>> Actually, this is a wanted setup. It happened that VMs configs were
>> changed in ways that lead to a VM not being startable any mor
>> With the agent before the mentioned patch, during probe of a newly
>> configured resource, the cluster would have learned that the VM is not
>> available on one of the nodes (ERR_INSTALLED), so it would never start
>> the resource there.
>
> This is exactly the problem with shared storage setup
On 06/27/2011 11:09 AM, Dejan Muhamedagic wrote:
> Hi Dominik,
>
> On Fri, Jun 24, 2011 at 03:50:40PM +0200, Dominik Klein wrote:
>> Hi Dejan,
>>
>> this way, the cluster never learns that it can't start a resource on
>> that node.
>
> This resourc
I'm not sure my fix is correct.
According to
https://github.com/ClusterLabs/resource-agents/commit/96ff8e9ad3d4beca7e063beef156f3b838a798e1#heartbeat/VirtualDomain
this is a regression which was introduced in April '11.
So the fix should be the other way around: Introduce a parameter that
let's
Hi Dejan,
this way, the cluster never learns that it can't start a resource on
that node.
I don't consider this a solution.
Regards
Dominik
___
Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha-d
This fixes the issue described yesterday.
Comments?
Regards
Dominik
exporting patch:
# HG changeset patch
# User Dominik Klein
# Date 1308909599 -7200
# Node ID 2b1615aaca2c90f2f4ab93eb443e5902906fb28a
# Parent 7a11934b142d1daf42a04fbaa0391a3ac47cee4c
RA VirtualDomain: Fix probe if config is
Hi
code snippet from
http://hg.linux-ha.org/agents/raw-file/7a11934b142d/heartbeat/VirtualDomain
(which I believe is the current version)
VirtualDomain_Validate_All() {
if [ ! -r $OCF_RESKEY_config ]; then
if ocf_is_probe; then
ocf_log info "Configuration file $OCF_RESKE
> Am I too paranoid?
I don't think you are. Some non-root pratically being able to remove any
file is certainly a valid concern.
Thing is: I needed an RA that configured a cronjob. Florian suggested
writing "the" symlink RA instead, that could manage symlink. Apparently
there was an IRC discussio
th
Dominik
#!/bin/bash
#
#
# An OCF RA that manages a symlink
#
# Copyright (c) 2011 Dominik Klein
#
# This program is free software; you can redistribute it and/or modify
# it under the terms of version 2 of the GNU General Public License as
# published by the Free Software Foundation.
#
# This
> This is what I found in pacemaker.spec.in in the repository:
>
> Requires(pre): cluster-glue >= 1.0.6
The 1.0.10 rpm from clusterlabs for opensuse 11.2 just says
"cluster-glue" afaict:
rpm -qR pacemaker
cluster-glue
resource-agents
python >= 2.4
libpacemaker3 = 1.0.10-1.4
libesmtp
net-snmp
rp
Mornin Dejan,
The reason was that libglue2 and cluster-glue were not installed from
the clusterlabs repository, as the rest of the packages were, but
instead they were pulled from the original opensuse repository in an
older version.
>>>
>>> This is what I found in pacemaker.sp
Hi Dejan
>> The reason was that libglue2 and cluster-glue were not installed from
>> the clusterlabs repository, as the rest of the packages were, but
>> instead they were pulled from the original opensuse repository in an
>> older version.
>
> This is what I found in pacemaker.spec.in in the rep
Hi
as some of you might have seen on the pacemaker list, I tried to install
a 3 node cluster and there were ipc issues reported by the cib and
therefore the cluster could not start correctly.
The reason was that libglue2 and cluster-glue were not installed from
the clusterlabs repository, as the
Thanks for inclusion.
While looking through the pushed changes, I spotted two meta-data typos.
See trivial patch.
Regards
Dominik
> Applied and pushed with two minor edits. Thanks a lot!
>
> Cheers,
> Florian
--- conntrackd.orig 2011-02-14 11:43:22.0 +0100
+++ conntrackd 2011-02-14 11:4
Maybe you applied the s/100/$slavescore patch someone sent a couple
weeks ago. I used the last version from thread "New stateful RA:
conntrackd" dated october 27th 3:29pm.
Anyway, here's my version.
Regards
Dominik
On 02/11/2011 01:36 PM, Florian Haas wrote:
> On 2011-02-11 09
> info".
done
Regards
Dominik
--- conntrackd 2011-02-10 12:23:37.054678924 +0100
+++ conntrackd.fghaas 2011-02-11 09:45:39.721300359 +0100
@@ -4,7 +4,7 @@
# An OCF RA for conntrackd
# http://conntrack-tools.netfilter.org/
#
-# Copyright (c) 2010 Dominik Klein
+# Copyright
Not yet. That's why I wrote soon_-ish_ ;)
Any release coming up you want to include this in?
> any news on this?
>
> Cheers,
> Florian
___
Linux-HA-Dev: Linux-HA-Dev@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Hom
>> Or, put differently: is us tracking the supposed state really necessary,
>> or can we inquire it from the service somehow?
>
> From the submitted RA:
>
>> # You can't query conntrackd whether it is master or slave. It can
>> be both at the same time.
>> # This RA creates a st
order to achieve this with pacemaker we needed to find a resource
>>> agent for conntrackd. Looking at the mailing list we found a couple of
>>> options although we only fully evaluated the RA produced by Dominik
>>> Klein as it appears to be more feature complete than t
Hi
thanks for testing and feedback.
On 01/27/2011 01:37 PM, "Marjan, BlatnikČŠŽ" wrote:
> Conntrackd RA from Dominik Klein works. We can now successfully
> migrate/fail from one node to another one.
>
> At the begining, we have problems with failing. After reboot/fail
> Maybe I just don't understand it, but isn't that broken?
Jon and I just talked on IRC and my misunderstanding was that he
intended to _start_ conntrackd via an init script and have the cluster
only do the cache commiting/flushing and monitoring work.
So from a cluster configuration point of vie
Please don't hate me, here's another update. Some more -ne instead of !=
Dominik
#!/bin/bash
#
#
# An OCF RA for conntrackd
# http://conntrack-tools.netfilter.org/
#
# Copyright (c) 2010 Dominik Klein
#
# This program is free software; you can redistribute it and/or modify
> can you please consult with Dominik Klein, who (unaware of your effort)
> incidentally also wrote an RA for conntrackd and posted it on this list
> a few days ago? It would be nice if the two of you could consolidate
> your efforts and come up with an updated patch. Maybe you gu
conntrackd
# http://conntrack-tools.netfilter.org/
#
# Copyright (c) 2010 Dominik Klein
#
# This program is free software; you can redistribute it and/or modify
# it under the terms of version 2 of the GNU General Public License as
# published by the Free Software Foundation.
#
# This program is dist
cht AG Köln - HRB 34038
USt-ID DE210882245
Geschäftsführende Gesellschafter: Christian Plätke und Holger Jansen
#!/bin/bash
#
#
# An OCF RA for conntrackd
# http://conntrack-tools.netfilter.org/
#
# Copyright (c) 2010 Dominik Klein
#
# This program is free software; you can redistribute
Andrew Beekhof wrote:
> On Tue, Jan 12, 2010 at 10:43 AM, Raoul Bhatia [IPAX]
> wrote:
>> On 01/12/2010 10:39 AM, Florian Haas wrote:
> Why not simply set that for root at boot? (it rhymes too :)
because i do not like the idea that each and every process gets
elevated limits by defa
I'd suggest an approach like Florian's from the Virtualdomain RA. Here's
a quote, guess you get the idea.
shutdown_timeout=$((($OCF_RESKEY_CRM_meta_timeout/1000)-5))
Regards
Dominik
Dejan Muhamedagic wrote:
> Hi Hideo-san,
>
> On Mon, Nov 30, 2009 at 11:00:05AM +0900, renayama19661...@ybb.ne.jp
Dejan Muhamedagic wrote:
> Hi Florian,
>
> On Wed, Sep 16, 2009 at 08:25:30AM +0200, Florian Haas wrote:
>> Lars, Dejan,
>>
>> as discussed on #linux-ha yesterday, I've pushed a small changeset to
>> the Filesystem RA that implements a monitor operation which checks
>> whether I/O on the mounted f
Trivial. See attached patch.
Regards
Dominik
exporting patch:
# HG changeset patch
# User Dominik Klein
# Date 1240578752 -7200
# Node ID 2d97904c385cc9b4779286001611bd748f48589d
# Parent 60cc2d6eee88ff6c2dedf7b539b9ee018efda6da
Low: RA mysql: Correctly remove eventually remaining socket
diff
his RA. I agree the "cmd=" lines and pid file creation are very very
ugly, but I could not yet find a better way. Not that much of a shell
genius I guess :( Please share if you can improve things.
Regards
Dominik
exporting patch:
# HG changeset patch
# User Dominik Klein
# Date 1234350
See changeset description.
exporting patch:
# HG changeset patch
# User Dominik Klein
# Date 1229092041 -3600
# Node ID ba0edd6afc4489ba438a17a246848e4d1e9854ea
# Parent 1a5685e8f1ed6c230ae3892856892e6a3a57d208
RA mysql: Add check for mysqld binary. Pretty hard to debug if the binary is not in
Andrew Beekhof wrote:
> On Wed, Dec 10, 2008 at 16:51, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote:
>>> diff -r 057a73385865 -r 1a5685e8f1ed resources/OCF/drbd
>>> --- a/resources/OCF/drbd Tue Dec 02 20:29:32 2008 +0100
>>> +++ b/resources/OCF/drbd Tue Dec 09 16:10:12 2008 +0100
>>> @@ -3
patch calls drbd_update_prefs if the master role is detected during
the probe. Within that function, crm_master is called and sets an
appropriate promotion score. That keeps the master instance running in
its current location.
Regards
Dominik
exporting patch:
# HG changeset patch
# User Dominik
Knight, Doug wrote:
All,
I am setting up a Filesystem resource to maintain an NFS mount on a
client system. I've configured a monitor function that checks every 15
minutes, timeout 1 minute. When an error occurs with the NFS mount (the
server is down for any length of time, etc), I'd like heartbe
geset patch
# User Dominik Klein <[EMAIL PROTECTED]>
# Date 1228138736 -3600
# Node ID e87ff08b20ca22647944a56fd772d2673c951457
# Parent 4a47778b2ca99a6b5985baf170bc3897fba2dfee
Medium: RA pingd
diff -r 4a47778b2ca9 -r e87ff08b20ca resources/OCF/pingd
--- a/resources/OCF/pingd Wed Oct 29 13:24:34
Hi
I tried to use this mornings heartbeat dev tip and run hb_report. It
reports
/usr/sbin/hb_report: line 161: syntax error near unexpected token `('
/usr/sbin/hb_report: line 161: `perl -e "use POSIX; print
strftime('%x %X',localtime($1));"'
I can perfectly run that command on the
Dominik Klein wrote:
See user list. Thread "[Linux-HA] Apache failover / renaming the binary"
Regards
Dominik
Again. Second issue though.
exporting patch:
# HG changeset patch
# User Dominik Klein <[EMAIL PROTECTED]>
# Date 1215073256 -7200
# Node ID db487301a953408ab59a2fc5
See user list. Thread "[Linux-HA] Apache failover / renaming the binary"
Regards
Dominik
exporting patch:
# HG changeset patch
# User Dominik Klein <[EMAIL PROTECTED]>
# Date 1215066469 -7200
# Node ID 94c262e9af4978ffe6be49f3bcb079750e3ec1a6
# Parent 412d1b01469463510ad3b61e
I'll review the RA and include it in the repository.
I had a look, too as I would have needed something like this soon anyway.
My thoughts are attached in form of a patch file.
Regards
Dominik
exporting patch:
# HG changeset patch
# User Dominik Klein <[EMAIL PROTECTED]>
# Date
s not
found on a node? err_installed I guess?
[15:58] sounds about right
Regards
Dominik
Exporting patch:
# HG changeset patch
# User Dominik Klein <[EMAIL PROTECTED]>
# Date 1210687383 -7200
# Node ID 42ce605e3da516db5e0a69b92d6e27433537ab53
# Parent 49b142475fa9925bb440d359816aacc1fe6c44
There was more wrong stuff in the meta-data function. See patch.
Exporting patch:
# HG changeset patch
# User Dominik Klein <[EMAIL PROTECTED]>
# Date 1210167737 -7200
# Node ID 3e89d5a6edacc2a5d742fdb8eb62f8a56b16be1e
# Parent 0fe9dfacad504390fc1c1c80e6e6cf28c440bd76
More mysql meta-data
meta-data displayed a wrong default value for "socket".
Regards
Dominik
Exporting patch:
# HG changeset patch
# User Dominik Klein <[EMAIL PROTECTED]>
# Date 1210167404 -7200
# Node ID 0fe9dfacad504390fc1c1c80e6e6cf28c440bd76
# Parent ed0972c7aa43699ae2ec31f68cac3b9cbcc5d5c5
M
OCF resource agents should be able to run meta-data, monitor, and
stop even when there is no required software installed:
- meta-data should run ok
- monitor should return 7 (resource stopped)
- stop should return 0 (success)
Can you please fix that?
I think this also needs fixing in the pgsql
45 matches
Mail list logo