Hi Isuru, On Tue, Oct 28, 2014 at 3:46 PM, Isuru Haththotuwa <isu...@apache.org> wrote:
> Hi, > > On Tue, Oct 28, 2014 at 2:43 PM, Reka Thirunavukkarasu <r...@wso2.com> > wrote: > >> Hi >> >> On Tue, Oct 28, 2014 at 2:22 PM, Isuru Haththotuwa <isu...@apache.org> >> wrote: >> >>> >>> >>> On Tue, Oct 28, 2014 at 1:48 PM, Reka Thirunavukkarasu <r...@wso2.com> >>> wrote: >>> >>>> Hi >>>> >>>> On Tue, Oct 28, 2014 at 1:16 PM, Isuru Haththotuwa <isu...@apache.org> >>>> wrote: >>>> >>>>> Thanks Reka for starting this Thread. >>>>> >>>>> Found two issues related to undeploying an Application: >>>>> 1. https://issues.apache.org/jira/browse/STRATOS-918 - Fixed now. >>>>> >>>>> 2. Undeploying an Application doesn't remove it properly until the >>>>> Member is activated. Looking in to this now. >>>>> >>>> >>>> >>>> We will need this fix for the member fault as well. If cluster monitor >>>> starts a member upon member fault before the whole cluster termination, >>>> then that cluster monitor is becoming active. Hence not going to terminated >>>> state. Looking into that now.. >>>> >>> What is the State Transition in this case? Is it Terminating to Active? >>> If so we might be able to generically handle this, since its a invalid >>> state Transfer and mark the cluster as Invalid, and then terminate. For >>> this, we need to introduce a new error state to cluster statuses. WDYT? >>> >> >> +1 to introduce error states. So that those which are in error state can >> be terminated by relevant monitors. >> >> But in this case the cluster should go through active --> inActive --> >> terminating --> terminated. But due to network delay in receiving inActive >> when member fault receives, cluster monitor tries to satisfy the min rule >> by bringing one new member instead of the one got terminated. Then when >> cluster monitor receives inActive, it tries to notify parent and etc. But >> the newly spawned member got activated. then cluster monitor becomes >> activated. After that, parent monitors send terminating notification. But >> somehow this active monitor skips the terminating event. >> > > Not sure if this is a silly suggestion since I might not have understood > the scenario fully here. As I understand, the problem is that Cluster > Monitor's mincheck getting triggered before the Cluster Monitor is marked > as inactive. Since member fault is a case where we need to give the control > to the parent (if dependency flag is set), can we pause the Cluster Monitor > till the decision is taken from the parent? The Cluster Monitor can be > resumed after parent gives back the control to the child. > +1. This will be good point. I will set a flag in cluster monitor which identified as dependent upon the member fault event. So that ClusterMonitor will be paused from that point onwards... Thanks, Reka > >>>> Thanks, >>>> Reka >>>> >>>> >>>> >>>>> >>>>> On Tue, Oct 28, 2014 at 1:11 PM, Reka Thirunavukkarasu <r...@wso2.com> >>>>> wrote: >>>>> >>>>>> Hi all, >>>>>> >>>>>> This is to update the testing Developer Preview-3 for the end to end >>>>>> work flow. Since we have introduced the termination behaviour, we are >>>>>> executing the following steps to verify flow. >>>>>> >>>>>> * Deploy an composite application with nested groups >>>>>> * Autoscaler wil bring them using defined startup order >>>>>> * Application will become Active >>>>>> >>>>>> Case 1: >>>>>> >>>>>> * Terminate one cluster's VM from IaaS (where this cluster is >>>>>> *independent* from all other siblings) >>>>>> * Nothing will happen to parents >>>>>> * Cluster eventually become active. >>>>>> >>>>>> This is working fine. >>>>>> >>>>>> Case 2: >>>>>> >>>>>> * Terminate one cluster's VM from IaaS (where this cluster is >>>>>> *dependent* on some siblings) >>>>>> * It will notify the parent about inActive state >>>>>> * Parent will behave according its specified termination behaviour >>>>>> and notify its parent >>>>>> * When this notification stops where a parent has *kill-none or at >>>>>> application level, *that parent will push all the children to be >>>>>> terminated. >>>>>> * Once all the children are terminated from the sub section, that >>>>>> parent will bring them in parallel. >>>>>> >>>>>> Finalising this by identifying issues. >>>>>> >>>>>> Case 3: >>>>>> >>>>>> * Unsubscribing from application >>>>>> - all the cluster will be marked as terminated and they will >>>>>> gradually terminated.. >>>>>> - once all the clusters are terminated, parent will be terminated >>>>>> - Eventually application will be terminated and send the >>>>>> application terminated event >>>>>> - all others act upon application terminated event and remove the >>>>>> application related information from their side. >>>>>> >>>>> >>>>>> The above is working fine now.. >>>>>> >>>>>> - Metadata service will also remove app details (We are testing >>>>>> this) >>>>>> >>>>>> FYI: >>>>>> All the identified sibling to be terminated, will be terminated in >>>>>> parallel as of now. We are not maintaining any order when terminating as >>>>>> i >>>>>> explained in the earlier mail. >>>>>> >>>>>> Isuruh/Udara, can you also add, if i miss any testing steps? >>>>>> >>>>>> Thanks, >>>>>> Reka >>>>>> >>>>>> -- >>>>>> Reka Thirunavukkarasu >>>>>> Senior Software Engineer, >>>>>> WSO2, Inc.:http://wso2.com, >>>>>> Mobile: >>>>>> +94776442007 >>>>>> >>>>>> -- >>>>>> <%2B94776442007> >>>>>> Thanks and Regards, >>>>>> >>>>>> Isuru H. >>>>>> <%2B94776442007> >>>>>> +94 716 358 048 <%2B94776442007>* <http://wso2.com/>* >>>>>> >>>>>> >>>>>> * <http://wso2.com/>* >>>>>> >>>>>> >>>>>> >>>> >>>> >>>> -- >>>> Reka Thirunavukkarasu >>>> Senior Software Engineer, >>>> WSO2, Inc.:http://wso2.com, >>>> Mobile: +94776442007 >>>> >>>> -- >>>> <%2B94776442007> >>>> Thanks and Regards, >>>> >>>> Isuru H. >>>> <%2B94776442007> >>>> +94 716 358 048 <%2B94776442007>* <http://wso2.com/>* >>>> >>>> >>>> * <http://wso2.com/>* >>>> >>>> >>>> >> >> >> -- >> Reka Thirunavukkarasu >> Senior Software Engineer, >> WSO2, Inc.:http://wso2.com, >> Mobile: +94776442007 >> >> -- >> <%2B94776442007> >> Thanks and Regards, >> >> Isuru H. >> <%2B94776442007> >> +94 716 358 048 <%2B94776442007>* <http://wso2.com/>* >> >> >> * <http://wso2.com/>* >> >> >> -- Reka Thirunavukkarasu Senior Software Engineer, WSO2, Inc.:http://wso2.com, Mobile: +94776442007