Thanks, I'll try that.

 *** Thomas
This communication is confidential and intended solely for the addressee(s). 
Any unauthorized review, use, disclosure or distribution is prohibited. If you 
believe this message has been sent to you in error, please notify the sender by 
replying to this transmission and delete the message without disclosing it. 
Thank you.
E-mail including attachments is susceptible to data corruption, interruption, 
unauthorized amendment, tampering and viruses, and we only send and receive 
e-mails on the basis that we are not liable for any such corruption, 
interception, amendment, tampering or viruses or any consequences thereof.


-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Andrew Beekhof
Sent: den 10 maj 2007 16:45
To: General Linux-HA mailing list
Subject: Re: [Linux-HA] nodes stays offline after communication is restored

On 5/10/07, Thomas Ã…kerblom (HF/EBC) <[EMAIL PROTECTED]> wrote:
> Hi.
> I'm still having this problem.
> If it is the same as bug 1546:
> What will changeset:10555:9addb46282eb mean in terms of heartbeat version?
> Can I download this?

you can always download the latest development version at this link:
   http://hg.linux-ha.org/dev/archive/tip.tar.bz2

or a specific version at:
   http://hg.linux-ha.org/dev/archive/{version}.tar.bz2

or in this case:
   http://hg.linux-ha.org/dev/archive/9addb46282eb.tar.bz2


> /Thomas
>
>  *** Thomas
> This communication is confidential and intended solely for the addressee(s). 
> Any unauthorized review, use, disclosure or distribution is prohibited. If 
> you believe this message has been sent to you in error, please notify the 
> sender by replying to this transmission and delete the message without 
> disclosing it. Thank you.
> E-mail including attachments is susceptible to data corruption, interruption, 
> unauthorized amendment, tampering and viruses, and we only send and receive 
> e-mails on the basis that we are not liable for any such corruption, 
> interception, amendment, tampering or viruses or any consequences thereof.
>
> -----Original Message-----
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Andrew Beekhof
> Sent: den 25 april 2007 14:40
> To: General Linux-HA mailing list
> Subject: Re: [Linux-HA] nodes stays offline after communication is restored
>
> On 4/23/07, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote:
> > On Mon, Apr 23, 2007 at 10:27:55AM +0200, Thomas Ã…kerblom (HF/EBC) wrote:
> > > Hi.
> > >
> > > OS:           SLES10
> > > Linux-HA      2.0.8
> > >
> > > I have a system with two nodes (HA-1 & HA-2) and one standby (HA-3).
> > > To illustrate my problem I have set up HA to define two alias addresses 
> > > each on the two hosts HA-1 & HA-2.
> > > After initiation all is OK and crm_mon on all three nodes shows that all 
> > > nodes are online.
> > > When I unplug the network cable to HA-2, HA-3 will take over and also 
> > > then all seams OK.
> > > HA-1 and HA-3 are online and HA-2 is offline.
> > > HA-2 is still running but consider HA-1 & HA-3 to be offline, as they are.
> > > The problem starts when I plug the network back to HA-2.
> > > The situation stays, HA-2 is offline to HA-1 & HA-3 and vice versa.
> > > I have a persistent split brain situation.
> > > In the syslog I can see that they recognize each other to be alive but it 
> > > just doesn't appear to be good enough.
> > > Is my configuration faulty?
> > > I'm attaching the syslogs from the time when I plugged the cable back and 
> > > for a short time after.
> >
> > A problem somewhere in CCM but couldn't see anything obvious.
>
> I'd bet this is another instance of bug 1546
>
>     http://old.linux-foundation.org/developer_bugzilla/show_bug.cgi?id=1546
>
> >
> > BTW, you're running a version of heartbeat recently pulled from
> > the dev branch (or you downloaded a compiled package from
> > somewhere) which has lame logging, i.e. all messages are tagged
> > with "logd" which is not very useful. It was me that broke it, but
> > then fixed it on Thursday, so you should either get the newer code
> > or, since this bug is excercised only if ha_logd logs through
> > syslog, change your logging config accordingly.
> >
> > Thanks.
> >
> > Dejan
> >
> > > I also attach the cib.xml and the ha.cf
> > >
> > >
> > >  <<messages_HA-1>>   <<messages_HA-2>>   <<messages_HA-3>>
> > >
> > >  <<ha.cf>>  <<cib.xml>>
> > >
> > > Regards
> > >  *** Thomas
> > > This communication is confidential and intended solely for the 
> > > addressee(s). Any unauthorized review, use, disclosure or distribution is 
> > > prohibited. If you believe this message has been sent to you in error, 
> > > please notify the sender by replying to this transmission and delete the 
> > > message without disclosing it. Thank you.
> > > E-mail including attachments is susceptible to data corruption, 
> > > interruption, unauthorized amendment, tampering and viruses, and we only 
> > > send and receive e-mails on the basis that we are not liable for any such 
> > > corruption, interception, amendment, tampering or viruses or any 
> > > consequences thereof.
> > >
> > >
> >
> >
> >
> >
> >
> > Content-Description: cib.xml
> > >  <cib admin_epoch="0" have_quorum="true" ignore_dtd="false" num_peers="3" 
> > > ccm_transition="4" cib_feature_revision="1.3" generated="true" 
> > > dc_uuid="7e126899-9c1f-477c-9e0e-7d28620f89da" epoch="1" num_updates="32" 
> > > cib-last-written="Mon Apr 23 09:39:36 2007">
> > >    <configuration>
> > >      <crm_config>
> > >        <cluster_property_set id="cl_pr_set_clust1">
> > >          <attributes>
> > >            <nvpair id="transition_idle_timeout" 
> > > name="transition_idle_timeout" value="120s"/>
> > >            <nvpair id="symmetric_cluster" name="symmetric_cluster" 
> > > value="false"/>
> > >            <nvpair id="no_quorum_policy" name="no_quorum_policy" 
> > > value="ignore"/>
> > >            <nvpair id="suppress_cib_writes" name="suppress_cib_writes" 
> > > value="true"/>
> > >            <nvpair id="default_resource_failure_stickiness" 
> > > name="default_resource_failure_stickiness" value="-INFINITY"/>
> > >          </attributes>
> > >        </cluster_property_set>
> > >      </crm_config>
> > >      <nodes>
> > >        <node id="7e126899-9c1f-477c-9e0e-7d28620f89da" uname="ha-3" 
> > > type="normal"/>
> > >        <node id="a4b83df2-1dce-4b4a-b5fa-033a31ebb95a" uname="ha-2" 
> > > type="normal"/>
> > >        <node id="aeb823fa-8533-4814-a2cd-1073fbfe11c3" uname="ha-1" 
> > > type="normal"/>
> > >      </nodes>
> > >      <resources>
> > >        <group id="group_lim1">
> > >          <primitive id="rsc_lim1_AliasIp1" class="ocf" type="IPaddr" 
> > > provider="heartbeat">
> > >            <operations>
> > >              <op id="op_stop_lim1_AliasIp1" name="stop" timeout="20s" 
> > > on_fail="stop"/>
> > >              <op id="op_start_lim1_AliasIp1" name="start" timeout="20s" 
> > > on_fail="restart"/>
> > >              <op id="op_monitor_lim1_AliasIp1" name="monitor" 
> > > interval="30s" timeout="10s" on_fail="restart"/>
> > >            </operations>
> > >            <instance_attributes id="ia_lim1_AliasIp1">
> > >              <attributes>
> > >                <nvpair name="ip" value="192.168.10.31" 
> > > id="lim1_AliasIp1"/>
> > >                <nvpair name="netmask" value="24" id="lim1_AliasMask1"/>
> > >                <nvpair name="nic" value="eth0" id="lim1_AliasIf1"/>
> > >              </attributes>
> > >            </instance_attributes>
> > >          </primitive>
> > >          <primitive id="rsc_lim1_AliasIp2" class="ocf" type="IPaddr" 
> > > provider="heartbeat">
> > >            <operations>
> > >              <op id="op_stop_lim1_AliasIp2" name="stop" timeout="20s" 
> > > on_fail="stop"/>
> > >              <op id="op_start_lim1_AliasIp2" name="start" timeout="20s" 
> > > on_fail="restart"/>
> > >              <op id="op_monitor_lim1_AliasIp2" name="monitor" 
> > > interval="30s" timeout="10s" on_fail="restart"/>
> > >            </operations>
> > >            <instance_attributes id="ia_lim1_AliasIp2">
> > >              <attributes>
> > >                <nvpair name="ip" value="192.168.11.31" 
> > > id="lim1_AliasIp2"/>
> > >                <nvpair name="netmask" value="24" id="lim1_AliasMask2"/>
> > >                <nvpair name="nic" value="eth1" id="lim1_AliasIf2"/>
> > >              </attributes>
> > >            </instance_attributes>
> > >          </primitive>
> > >        </group>
> > >        <group id="group_lim2">
> > >          <primitive id="rsc_lim2_AliasIp1" class="ocf" type="IPaddr" 
> > > provider="heartbeat">
> > >            <operations>
> > >              <op id="op_stop_lim2_AliasIp1" name="stop" timeout="20s" 
> > > on_fail="stop"/>
> > >              <op id="op_start_lim2_AliasIp1" name="start" timeout="20s" 
> > > on_fail="restart"/>
> > >              <op id="op_monitor_lim2_AliasIp1" name="monitor" 
> > > interval="30s" timeout="10s" on_fail="restart"/>
> > >            </operations>
> > >            <instance_attributes id="ia_lim2_AliasIp1">
> > >              <attributes>
> > >                <nvpair name="ip" value="192.168.10.32" 
> > > id="lim2_AliasIp1"/>
> > >                <nvpair name="netmask" value="24" id="lim2_AliasMask1"/>
> > >                <nvpair name="nic" value="eth0" id="lim2_AliasIf1"/>
> > >              </attributes>
> > >            </instance_attributes>
> > >          </primitive>
> > >          <primitive id="rsc_lim2_AliasIp2" class="ocf" type="IPaddr" 
> > > provider="heartbeat">
> > >            <operations>
> > >              <op id="op_stop_lim2_AliasIp2" name="stop" timeout="20s" 
> > > on_fail="stop"/>
> > >              <op id="op_start_lim2_AliasIp2" name="start" timeout="20s" 
> > > on_fail="restart"/>
> > >              <op id="op_monitor_lim2_AliasIp2" name="monitor" 
> > > interval="30s" timeout="10s" on_fail="restart"/>
> > >            </operations>
> > >            <instance_attributes id="ia_lim2_AliasIp2">
> > >              <attributes>
> > >                <nvpair name="ip" value="192.168.11.32" 
> > > id="lim2_AliasIp2"/>
> > >                <nvpair name="netmask" value="24" id="lim2_AliasMask2"/>
> > >                <nvpair name="nic" value="eth1" id="lim2_AliasIf2"/>
> > >              </attributes>
> > >            </instance_attributes>
> > >          </primitive>
> > >        </group>
> > >      </resources>
> > >      <constraints>
> > >        <rsc_location id="run_group_lim1" rsc="group_lim1">
> > >          <rule id="pref_run_group_lim1" score="100" boolean_op="and">
> > >            <expression id="exp_pref_run_group_lim1" attribute="#uname" 
> > > operation="eq" value="ha-1"/>
> > >          </rule>
> > >        </rsc_location>
> > >        <rsc_location id="run_group_lim1_Sby1" rsc="group_lim1">
> > >          <rule id="pref_run_group_lim1_Sby1" score="50" boolean_op="and">
> > >            <expression id="exp_pref_run_group_lim1_Sby1" 
> > > attribute="#uname" operation="eq" value="ha-3"/>
> > >          </rule>
> > >        </rsc_location>
> > >        <rsc_location id="run_group_lim2" rsc="group_lim2">
> > >          <rule id="pref_run_group_lim2" score="100" boolean_op="and">
> > >            <expression id="exp_pref_run_group_lim2" attribute="#uname" 
> > > operation="eq" value="ha-2"/>
> > >          </rule>
> > >        </rsc_location>
> > >        <rsc_location id="run_group_lim2_Sby1" rsc="group_lim2">
> > >          <rule id="pref_run_group_lim2_Sby1" score="50" boolean_op="and">
> > >            <expression id="exp_pref_run_group_lim2_Sby1" 
> > > attribute="#uname" operation="eq" value="ha-3"/>
> > >          </rule>
> > >        </rsc_location>
> > >        <rsc_colocation id="notSame1_2" from="group_lim1" to="group_lim2" 
> > > score="-INFINITY"/>
> > >      </constraints>
> > >    </configuration>
> > >  </cib>
> >
> > > _______________________________________________
> > > Linux-HA mailing list
> > > Linux-HA@lists.linux-ha.org
> > > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > > See also: http://linux-ha.org/ReportingProblems
> >
> > --
> > Dejan
> > _______________________________________________
> > Linux-HA mailing list
> > Linux-HA@lists.linux-ha.org
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> >
> _______________________________________________
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
> _______________________________________________
> Linux-HA mailing list
> Linux-HA@lists.linux-ha.org
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
>
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to