Hi!

I think it still depend on HOW corosync failed: If just some process hung or 
died, or the cluster network got unreachable, the node can still be fenced from 
remote (e.g. via SBD through FC).

Regards,
Ulrich

>>> Digimer <li...@alteeve.ca> schrieb am 03.12.2012 um 13:13 in Nachricht
<50bc976e.7040...@alteeve.ca>:
> As I keep saying; If corosync dies, the cluster stack on the node fails.
> So if corosync is dead, that node will not do anything; It will not
> fence, it will not restart services, nothing. If you somehow lose
> corosync on all nodes at the same time, then your cluster is dead.
> 
> Clusters protect against single points of failure. Rarely can multiple
> simultaneous failures be protected against.
> 
> On 12/03/2012 07:07 AM, Hermes Flying wrote:
> > I see, I will look into this.
> > There is one thing I am confused about.
> > If I have Node-A/Node-B/Node-C and let's say Node-A has the VIP.
> > What happens if corosync fails in all nodes? Will all 3 try to kill
> > eachother?
> > 
> > 
> > ------------------------------------------------------------------------
> > *From:* Digimer <li...@alteeve.ca>
> > *To:* Hermes Flying <flyingher...@yahoo.com>
> > *Cc:* General Linux-HA mailing list <linux-ha@lists.linux-ha.org>
> > *Sent:* Monday, December 3, 2012 1:20 PM
> > *Subject:* Re: [Linux-HA] Corosync on cluster with 3+ nodes
> > 
> > Most Fujitsu servers work with fence_ipmilan. I suspect the same is true
> > with the IBM servers. Ask the hardware people if they have out-of-band
> > management on the servers (which is the generic/marketing way of saying
> > IPMI or the like).
> > 
> > On 12/03/2012 05:00 AM, Hermes Flying wrote:
> >> Some are IBM servers mentioning VMK (optional) and others are Fujitsu
> >> mentioning iRMC.
> >> Not sure what can I do in the IBM case
> >>
> >>
> >> ------------------------------------------------------------------------
> >> *From:* Digimer <li...@alteeve.ca <mailto:li...@alteeve.ca>>
> >> *To:* Hermes Flying <flyingher...@yahoo.com 
> > <mailto:flyingher...@yahoo.com>>
> >> *Cc:* General Linux-HA mailing list <linux-ha@lists.linux-ha.org 
> > <mailto:linux-ha@lists.linux-ha.org>>
> >> *Sent:* Sunday, December 2, 2012 10:36 PM
> >> *Subject:* Re: [Linux-HA] Corosync on cluster with 3+ nodes
> >>
> >> I don't know anything about your setup or office situation, so I can't
> >> speak to your specific issues.
> >>
> >> What I can say is that almost all actual servers have remote management;
> >> Fujitsu servers have IPMI, HP servers have iLO, Dell servers have DRAC,
> >> IBM servers have RSA and so on. Most generic servers have at least basic
> >> IPMI. So it's very likely that you will already have what you need to
> >> implement fencing. If you don't, then something like the APC AP7900 is
> >> usually about $500 canadian and will work as a fence device. I don't
> >> know what country you are in, so I can't speak more on availability,
> >> applicability or cost.
> >>
> >> You might seriously want to consider hiring someone to help you with
> >> this setup. Clustering is not a straight-forward topic. The cost to hire
> >> a consultant might save you enough time and headache that it's cheaper
> >> in the end than trying to do it yourself.
> >>
> >> On 12/02/2012 03:31 PM, Hermes Flying wrote:
> >>> I am an application developer and so far I didn't care about HW specs
> >>> (not my responsibility).
> >>> Now it seems I must care and so since I am not sure what HW is being
> >>> recommended for servers I will verify tomorrow and let you know.
> >>> My concern is that by going to pacemaker (although seems to be exactly
> >>> what I need) is that we would need to request for new HW on migrate to
> >>> new deployment. I don't even know if these devices are expensive.
> >>> Did you ever had an issue with this? If I have problem with this, should
> >>> I be looking into different solution? Any suggestion for alternative?
> >>>
> >>>
> >>> ------------------------------------------------------------------------
> >>> *From:* Digimer <li...@alteeve.ca <mailto:li...@alteeve.ca>
> > <mailto:li...@alteeve.ca <mailto:li...@alteeve.ca>>>
> >>> *To:* Hermes Flying <flyingher...@yahoo.com 
> > <mailto:flyingher...@yahoo.com>
> >> <mailto:flyingher...@yahoo.com <mailto:flyingher...@yahoo.com>>>
> >>> *Cc:* General Linux-HA mailing list <linux-ha@lists.linux-ha.org 
> > <mailto:linux-ha@lists.linux-ha.org>
> >> <mailto:linux-ha@lists.linux-ha.org <mailto:linux-ha@lists.linux-ha.org>>>
> >>> *Sent:* Sunday, December 2, 2012 10:19 PM
> >>> *Subject:* Re: [Linux-HA] Corosync on cluster with 3+ nodes
> >>>
> >>> It must be an external device. If, for example, the kernel crashes hard,
> >>> or if you get a spinlock, the system may not respond to anything or a
> >>> service may never stop, blocking a reboot. You can not trust that the
> >>> system is accessible or functioning in any way.
> >>>
> >>> Fencing *must* be an external device. Period.
> >>>
> >>> What kind of servers do you have?
> >>>
> >>> On 12/02/2012 03:15 PM, Hermes Flying wrote:
> >>>> If I have a requirement not to include external HW, is there any other
> >>>> way? I mean, I am not -by far- a linux expert, but how come it doesn't
> >>>> do a restart or halt?
> >>>>
> >>>>
> >>>> ------------------------------------------------------------------------
> >>>> *From:* Digimer <li...@alteeve.ca <mailto:li...@alteeve.ca>
> > <mailto:li...@alteeve.ca <mailto:li...@alteeve.ca>>
> >> <mailto:li...@alteeve.ca <mailto:li...@alteeve.ca>
> > <mailto:li...@alteeve.ca <mailto:li...@alteeve.ca>>>>
> >>>> *To:* Hermes Flying <flyingher...@yahoo.com 
> > <mailto:flyingher...@yahoo.com>
> >> <mailto:flyingher...@yahoo.com <mailto:flyingher...@yahoo.com>>
> >>> <mailto:flyingher...@yahoo.com <mailto:flyingher...@yahoo.com>
> > <mailto:flyingher...@yahoo.com <mailto:flyingher...@yahoo.com>>>>
> >>>> *Cc:* General Linux-HA mailing list <linux-ha@lists.linux-ha.org 
> > <mailto:linux-ha@lists.linux-ha.org>
> >> <mailto:linux-ha@lists.linux-ha.org <mailto:linux-ha@lists.linux-ha.org>>
> >>> <mailto:linux-ha@lists.linux-ha.org 
> > <mailto:linux-ha@lists.linux-ha.org> <mailto:linux-ha@lists.linux-ha.org 
> > <mailto:linux-ha@lists.linux-ha.org>>>>
> >>>> *Sent:* Sunday, December 2, 2012 10:01 PM
> >>>> *Subject:* Re: [Linux-HA] Corosync on cluster with 3+ nodes
> >>>>
> >>>> On 12/02/2012 02:56 PM, Hermes Flying wrote:
> >>>>> Clear! In order to do fence and crash a node, is there a specific HW
> >>>>> requirement to do this?
> >>>>
> >>>> Yes, there must be external hardware (out-of-band management counts as
> >>>> "external", despite physically being on the server's mainboard). The
> >>>> most common fence device is IPMI, iLO, RSA, iDRAC and the like.
> >>>> Alternatives are switched PDUs, like APC's AP7900.
> >>>>
> >>>> --
> >>>> Digimer
> >>>> Papers and Projects: https://alteeve.ca/w/ 
> >>>> What if the cure for cancer is trapped in the mind of a person without
> >>>> access to education?
> >>>>
> >>>>
> >>>
> >>>
> >>> --
> >>> Digimer
> >>> Papers and Projects: https://alteeve.ca/w/ 
> >>> What if the cure for cancer is trapped in the mind of a person without
> >>> access to education?
> >>>
> >>>
> >>
> >>
> >> --
> >> Digimer
> >> Papers and Projects: https://alteeve.ca/w/ 
> >> What if the cure for cancer is trapped in the mind of a person without
> >> access to education?
> >>
> >>
> > 
> > 
> > -- 
> > Digimer
> > Papers and Projects: https://alteeve.ca/w/ 
> > What if the cure for cancer is trapped in the mind of a person without
> > access to education?
> > 
> > 
> 



 
_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to