Hello again, you're right, i do have DRBD 8.2.1 installed.
Well, you mean downgrading on 0.7.x would be better? This is only a test-cluster so this shouldn't be a problem. But I'll try re-installing my current DRBD-version first and then (if this doesn't help) downgrading to 0.7.x-DRBD. I will report the results ;) Thanks in advance Florian > -----Ursprüngliche Nachricht----- > Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > ha.org] Im Auftrag von Chun Tian (binghe) > Gesendet: Montag, 10. März 2008 14:57 > An: General Linux-HA mailing list > Betreff: Re: AW: AW: [Linux-HA] Switchover problem with DRBD > > Hi, Florian > > I compard my HA config, can almost say, your Heartbeat configure just > can work, but DRBD has something wrong. See this: > > crmd[17381]: 2008/03/05_11:44:34 ERROR: process_lrm_event: LRM > operation DRBD_AFD:1_promote_0 (17) Timed Out (timeout=20000ms) > drbd[18348]: 2008/03/05_11:44:34 DEBUG: r0 notify: post for stop - > counts: active 0 - starting 1 - stopping 1 > drbd[18348]: 2008/03/05_11:44:34 DEBUG: r0: Calling drbdadm -c /etc/ > drbd.conf state r0 > drbd[18348]: 2008/03/05_11:44:44 DEBUG: r0: Exit code 0 > drbd[18348]: 2008/03/05_11:44:44 DEBUG: r0: Command output: Child > process does not terminate! Exiting. No response from the DRBD driver! > Is the module loaded? Unknown/TOO_LARGE > drbd[18348]: 2008/03/05_11:44:44 DEBUG: r0: Calling drbdadm -c /etc/ > drbd.conf cstate r0 > lrmd[17378]: 2008/03/05_11:44:54 WARN: DRBD_AFD:1:notify process (PID > 18348) timed out (try 1). Killing with signal SIGTERM (15). > lrmd[17378]: 2008/03/05_11:44:54 WARN: operation notify[18] on > ocf::drbd::DRBD_AFD:1 for client 17381, its parameters: > CRM_meta_role=[Master] CRM_meta_notify_stop_resource=[DRBD_AFD:0 ] > CRM_meta_notify_operation=[stop] > CRM_meta_notify_start_resource=[DRBD_AFD:1 ] > CRM_meta_notify_stop_uname=[noderz ] > CRM_meta_notify_promote_resource=[DRBD_AFD:1 ] drbd_resource=[r0] > CRM_meta_notify_master_uname=[noderz ] > CRM_meta_notify_demote_uname=[noderz ] CRM_meta_master_max=[1] > CRM_meta_notify_master_resource=[DRBD_AFD:0 ] CRM_meta_timeout=[20000] > CRM_meta_s: pid [18348] timed out > > There's something wrong when HA running drbdadm command, it hangs. By > seeing you drbd.conf, I think you may be using the DRBD 8.x but not > 7.x, am I right? I must say for your case, the more stable DRBD 7.x is > enough: you never want Two-Primary DRBD node. > > Regards, > > Chun Tian (binghe) > > > Hi, > > > > thanks for your reply. > > > > The attachements were already added to the first mail to the list > > but here they are again :) > > > > Thanks > > Florian > > > >> -----Ursprüngliche Nachricht----- > >> Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > >> ha.org] Im Auftrag von Chun Tian (binghe) > >> Gesendet: Montag, 10. März 2008 13:26 > >> An: General Linux-HA mailing list > >> Betreff: Re: AW: [Linux-HA] Switchover problem with DRBD > >> > >> Hi, there > >> > >> If you're using HA 2.x, maybe you should show some parts of your > >> cib.xml. I have running DRBD HA clusters, and I think the key is in > >> the cib.xml > >> > >> Regards, > >> > >> Chun Tian (binghe) > >> > >>> Isn't anybody able to give a hint why promoting the DRBD-instance > >>> fails? :( > >>> > >>>> Von: [EMAIL PROTECTED] [mailto:linux-ha- > [EMAIL PROTECTED] > >>>> ha.org] Im Auftrag von Schmidt, Florian > >>>> Gesendet: Mittwoch, 5. März 2008 15:22 > >>>> An: General Linux-HA mailing list; [EMAIL PROTECTED] > >>>> Betreff: [Linux-HA] Switchover problem with DRBD > >>>> > >>>> Hi everybody, > >>>> > >>>> Testing my 2-node-cluster i got a strange behaviour when stopping > >>>> heartbeat on my primary node. I don't know if it is caused by > >>>> heartbeat > >>>> or DRBD or both, so I post this in both lists. > >>>> > >>>> Starting with this: > >>>> > >>>> ============ > >>>> Last updated: Wed Mar 5 15:01:10 2008 > >>>> Current DC: noderz (91d062c3-ad0a-4c24-b759-acada7f19101) > >>>> 2 Nodes configured. > >>>> 3 Resources configured. > >>>> ============ > >>>> > >>>> Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): online > >>>> Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): online > >>>> > >>>> Master/Slave Set: DRBD > >>>> DRBD_AFD:0 (heartbeat::ocf:drbd): Master noderz > >>>> DRBD_AFD:1 (heartbeat::ocf:drbd): Started nodekrz Resource > >>>> Group: > >>>> Group1 > >>>> Filesystem (heartbeat::ocf:Filesystem): Started noderz > >>>> AFD (lsb:afdha): Started noderz > >>>> Cluster_IP (heartbeat::ocf:IPaddr): Started noderz > >>>> > >>>> > >>>> > >>>> I said /etc/init.d/heartbeat stop on primary node (noderz) and > >>>> expected > >>>> this: > >>>> > >>>> ============ > >>>> Last updated: Wed Mar 5 15:01:10 2008 > >>>> Current DC: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d) > >>>> 2 Nodes configured. > >>>> 3 Resources configured. > >>>> ============ > >>>> > >>>> Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): OFFLINE > >>>> Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): online > >>>> > >>>> Master/Slave Set: DRBD > >>>> DRBD_AFD:0 (heartbeat::ocf:drbd): stopped > >>>> DRBD_AFD:1 (heartbeat::ocf:drbd): Master nodekrz > >>>> Resource Group: Group1 > >>>> Filesystem (heartbeat::ocf:Filesystem): Started nodekrz > >>>> AFD (lsb:afdha): Started nodekrz > >>>> Cluster_IP (heartbeat::ocf:IPaddr): Started nodekrz > >>>> > >>>> > >>>> But I got this: > >>>> ============ > >>>> Last updated: Wed Mar 5 14:52:06 2008 > >>>> Current DC: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d) > >>>> 2 Nodes configured. > >>>> 3 Resources configured. > >>>> ============ > >>>> > >>>> Node: noderz (91d062c3-ad0a-4c24-b759-acada7f19101): OFFLINE > >>>> Node: nodekrz (44425bd9-2cba-4d6a-ac62-82a8bb81a23d): online > >>>> > >>>> Master/Slave Set: DRBD > >>>> DRBD_AFD:0 (heartbeat::ocf:drbd): Stopped > >>>> DRBD_AFD:1 (heartbeat::ocf:drbd): Started nodekrz > >>>> > >>>> Failed actions: > >>>> DRBD_AFD:1_promote_0 (node=nodekrz, call=17, rc=-2): Timed Out > >>>> > >>>> > >>>> I added the /var/log/ha-debug of the node, a cibadmin -Q, my ha.cf > >>>> and > >>>> my drbd.conf (if needed) > >>>> > >>>> Would be nice if someone could give me a hint why the switchover > >>>> fails. > >>>> > >>>> Thanks a lot for any help. > >>>> Florian > >>> _______________________________________________ > >>> Linux-HA mailing list > >>> Linux-HA@lists.linux-ha.org > >>> http://lists.linux-ha.org/mailman/listinfo/linux-ha > >>> See also: http://linux-ha.org/ReportingProblems > >> > >> _______________________________________________ > >> Linux-HA mailing list > >> Linux-HA@lists.linux-ha.org > >> http://lists.linux-ha.org/mailman/listinfo/linux-ha > >> See also: http://linux-ha.org/ReportingProblems > > <ha.cf><drbd.conf><ha- > > debug><cib.xml>_______________________________________________ > > Linux-HA mailing list > > Linux-HA@lists.linux-ha.org > > http://lists.linux-ha.org/mailman/listinfo/linux-ha > > See also: http://linux-ha.org/ReportingProblems > > _______________________________________________ > Linux-HA mailing list > Linux-HA@lists.linux-ha.org > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems