Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-02 Thread minh chau
ge- >>> From: Gary Lee [mailto:gary@dektech.com.au] >>> Sent: Wednesday, March 02, 2016 4:30 PM >>> To: Mathivanan Naickan Palanivelu >>> Cc: minh.c...@dektech.com.au; opensaf-devel@lists.sourceforge.net; >>> Nagendra Kumar; Praveen Malviya; hans.n

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-02 Thread Anders Widell
h.com.au; opensaf-devel@lists.sourceforge.net; >> Nagendra Kumar; Praveen Malviya; hans.nordeb...@ericsson.com >> Subject: Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience >> at node director [#1620] >> >> Hi Mathi >> >> I think Minh ha

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-02 Thread Mathivanan Naickan Palanivelu
vel@lists.sourceforge.net; >Nagendra Kumar; Praveen Malviya; hans.nordeb...@ericsson.com >Subject: Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience >at node director [#1620] > >Hi Mathi > >I think Minh has previously said "delayed failover" isn

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-02 Thread Gary Lee
Hi Mathi I think Minh has previously said "delayed failover" isn't the best description of what patch 6 is doing. Minh has previously described it better as "adjust HA assignment"; moving transient states to states that realign() can work with. The transient states aren't necessarily caused

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-02 Thread Mathivanan Naickan Palanivelu
Hi All, What is 'delayed failover'? That sounds against the principles of 'software fault isolation'!? Thanks, Mathi. - minh.c...@dektech.com.au wrote: > Hi Praveen, > > Please see comments in line [Minh] > > Thanks, > Minh > > On 02/03/16 18:12, praveen malviya wrote: > > > > > > On 02

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-01 Thread minh chau
Hi Praveen, Please see comments in line [Minh] Thanks, Minh On 02/03/16 18:12, praveen malviya wrote: > > > On 02-Mar-16 12:26 PM, minh chau wrote: >> Hi Praveen, >> >> If node_up of amfnd comes after node sync timer expires, amfd will send >> reboot message to that amfnd, regardless of susi sta

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-01 Thread praveen malviya
On 02-Mar-16 12:26 PM, minh chau wrote: > Hi Praveen, > > If node_up of amfnd comes after node sync timer expires, amfd will send > reboot message to that amfnd, regardless of susi states. > Sending reboot message in avd_comp_pres_state_set() if comp is > inst/term-failed has already been in code

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-01 Thread minh chau
Hi Praveen, If node_up of amfnd comes after node sync timer expires, amfd will send reboot message to that amfnd, regardless of susi states. Sending reboot message in avd_comp_pres_state_set() if comp is inst/term-failed has already been in code base of #1620. The change in #1620 that marks *nod

Re: [devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-03-01 Thread praveen malviya
Hi Minh, One query on patch 03. From headless state when first controller joins, in avd_cluster_tmr_init_evh(), SG is being realigned. During realignment AMF will take care of new assignments but not of those SUSIs whose FSMs are in transition state. Is AMF rebooting the node which hosts SUs

[devel] [PATCH 04 of 15] amfnd: Add support for cloud resilience at node director [#1620]

2016-02-25 Thread Minh Hon Chau
osaf/services/saf/amf/amfnd/clc.cc | 100 +++-- osaf/services/saf/amf/amfnd/clm.cc | 11 +- osaf/services/saf/amf/amfnd/comp.cc | 42 ++- osaf/services/saf/amf/amfnd/compdb.cc | 45 ++- osaf/services/saf/amf/amfnd/di.cc | 419 ++