Re: [Architecture] Horizontal autoscaling of containers in AppCloud

Deependra Ariyadewa Wed, 02 Nov 2016 09:52:56 -0700

On Wed, Nov 2, 2016 at 9:48 PM, Dmitry Sotnikov <dmi...@wso2.com> wrote:


> Wouldn't minimum be set to 0 in true serverless experience?
>

Yes, in the initial stage replica count should be 0 but in the running
state we have to have two copies of each instance to face server failures.

>
> On Wed, Nov 2, 2016 at 9:16 AM, Deependra Ariyadewa <d...@wso2.com> wrote:
>
>>
>>
>> On Wed, Nov 2, 2016 at 1:04 PM, Kasun De Silva <kas...@wso2.com> wrote:
>>
>>> Hi All,
>>>
>>> Past couple of days I was working on a POC for the $subject.
>>>
>>> Following is the current model that AppCloud use to create application /
>>> services. Basically for each application / service AppCloud will create a 
>>> *Deployment
>>> >> ReplicaSet >> Pod* in Kubernetes cluster. At this point each
>>> ReplicaSet created has only one replica which is a Pod. This deployment
>>> does not scale up or scale down under any circumstances.
>>>
>>> This effort is to bring up a new feature to autoscale the application /
>>> service deployed in AppCloud according to certain metrics.
>>>
>>> Basically as usual AppCloud will create a *Deployment >> RS >> Pod* in
>>> Kubernetes for each application / service created as in above scenario.
>>>
>>> With the new feature, user can set auto-scaling parameters in the
>>> application settings level. We are using Kubernetes v1.3.4, and by default
>>> it accepts following parameters for *Horizontal Pod Autoscaler* (HPA).
>>>
>>>    1. Minimum # of pods
>>>    2. Maximum # of pods
>>>    3. Target CPU Utilization
>>>
>>>
>>> and the autoscaling logic will be following,
>>>
>>> TargetNumOfPods = ceil(sum(CurrentPodsCPUUtilization)
>>> / TargetCPUUtilization)
>>>
>>> But target number of pods will be bounded by the values provided for the
>>> MinPods and MaxPods values.
>>>
>>> MinPods <= TargetNumOfPods <= MaxPods
>>>
>>> When user set the auto scale parameters AppCloud will basically create a
>>> Kubernetes HPA kind for the particular appllication / service version (This
>>> is on demand. we do not create an HPA kind for each application / service
>>> version that user creates.). So that HPA for that particular version will
>>> take responsibility of scaling up, scaling down of pods according to the
>>> given threshold. Following will be the model that we are using in
>>> Kubernetes with the autoscaling feature in AppCloud.
>>>
>>
>> In the server less paradigm user should get a simple application creation
>> user experience, therefore making the auto scaling feature as the default
>> behavior is a good option. Also we have to send the minimum pod count to 2
>> to support node failures.
>>
>>>
>>> Your thoughts and comments are welcome on this.
>>>
>>> Thanks,
>>> Kasun
>>> 
>>> --
>>> *Kasun de Silva*
>>> Senior Software Engineer | Cloud TG
>>>
>>> WSO2 Inc <http://wso2.com>*. *|* E*mail : kas...@wso2.com | Mobile: +94
>>> 77 794 4260
>>>
>>>
>>
>>
>> --
>> Deependra Ariyadewa
>> WSO2, Inc. http://wso2.com/ http://wso2.org
>>
>> email d...@wso2.com; cell +94 71 403 5996 ;
>> Blog http://risenfall.wordpress.com/
>> PGP info: KeyID: 'DC627E6F'
>>
>> *WSO2 - Lean . Enterprise . Middleware*
>>
>
>
>
> --
> Dmitry Sotnikov
> VP of Cloud; WSO2, Inc.;  http://wso2.com/
> email: dmi...@wso2.com; cell: +1.949.303.9653; Skype: DSotnikov
> Lean . Enterprise . Middleware
>
> <http://wso2.com/signature>
>



-- 
Deependra Ariyadewa
WSO2, Inc. http://wso2.com/ http://wso2.org

email d...@wso2.com; cell +94 71 403 5996 ;
Blog http://risenfall.wordpress.com/
PGP info: KeyID: 'DC627E6F'

*WSO2 - Lean . Enterprise . Middleware*

_______________________________________________
Architecture mailing list
Architecture@wso2.org
https://mail.wso2.org/cgi-bin/mailman/listinfo/architecture

Re: [Architecture] Horizontal autoscaling of containers in AppCloud

Reply via email to