Hi Martin,

The job of the CC is to spawn/terminate instances. AS is the one who
decides when/what to start and when to terminate. So as Nirmal said have a
look at the AS logs in order to find the reason for termination/spawning.


On Thu, Jul 17, 2014 at 5:09 AM, Nirmal Fernando (JIRA) <j...@apache.org>
wrote:

>
>     [
> https://issues.apache.org/jira/browse/STRATOS-706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14064337#comment-14064337
> ]
>
> Nirmal Fernando commented on STRATOS-706:
> -----------------------------------------
>
> On Thu, Jul 17, 2014 at 1:11 AM, Martin Eppel (JIRA) <j...@apache.org>
>
>
> All the log file you quoted is from Cloud Controller. And what CC does is
> providing an API to terminate instances. The caller of this API, i.e.
> auto-scaler is the one who logs the reason for calling CC to terminate
> instances. Did you check auto-scaler logs?
>
>
>
>
> --
> Best Regards,
> Nirmal
>
> Nirmal Fernando.
> PPMC Member & Committer of Apache Stratos,
> Senior Software Engineer, WSO2 Inc.
>
> Blog: http://nirmalfdo.blogspot.com/
>
>
> > member terminate event should log reason
> > ----------------------------------------
> >
> >                 Key: STRATOS-706
> >                 URL: https://issues.apache.org/jira/browse/STRATOS-706
> >             Project: Stratos
> >          Issue Type: Bug
> >          Components: Autoscaler
> >    Affects Versions: 4.0.0
> >            Reporter: Martin Eppel
> >             Fix For: 4.0.1
> >
> >
> > When Stratos terminates a member it must log the reason for it. Ideally
> the logging should be systematic enough so that one can grep for different
> severity, or by member, or by event type or some other useful
> categorization.
> > The justification for this defect is that it will improve greatly
> debugging and troubleshooting capabilities. Without logging it is very
> difficult to debug terminations of members.
> >
> > For example, consider this sequence in the stratos log file:
> >
> > ===================
> > TID: [0] [STRATOS] [2014-07-15 09:58:48,654] DEBUG
> {org.apache.stratos.cloud.controller.impl.CloudControllerServiceImpl} -
>  Received an instance spawn request : MemberContext [memberId=null,
> nodeId=null, clusterId=cisco-gilan-appmgr-1.cisco-gil, cartridgeType=null,
> privateIpAddress=null, publicIpAddress=null, allocatedIpAddress=null,
> initTime=1405418328649, lbClusterId=null, networkPartitionId=OAM1]
> {org.apache.stratos.cloud.controller.impl.CloudControllerServiceImpl}
> > TID: [0] [STRATOS] [2014-07-15 09:58:48,654] DEBUG
> {org.apache.stratos.cloud.controller.impl.CloudControllerServiceImpl} -
>  Payload: SERVICE_NAME=cisco-gilan-appmgr,HOST_NAME=
> cisco-gilan-appmgr-1.qmog.cisco.com
> ,MULTITENANT=false,TENANT_ID=-1234,TENANT_RANGE=-1234,CARTRIDGE_ALIAS=cisco-gilan-appmgr-1,CLUSTER_ID=cisco-gilan-appmgr-1.cisco-gil,CARTRIDGE_KEY=o1jbiPPmPWBgyNVM,DEPLOYMENT=default,REPO_URL=null,PORTS=9482,PUPPET_IP=PUPPET_IP,PUPPET_HOSTNAME=PUPPET_HOSTNAME,PUPPET_ENV=PUPPET_ENV,HEARTBEAT_AUTHKEY=20c9629a87f53ecdb5278d2ddb5a9d42,TRUSTSTORE_PASSWORD=wso2carbon,CEP_PORT=7611,MONITORING_SERVER_SECURE_PORT=0,MB_PORT=61616,OPENSTACK_COMPUTE_DNS=10.58.10.82,MB_IP=
> octl-01.qmog.cisco.com,QSB_PUPPET_ENVIR=,CEP_IP=octl-01.qmog.cisco.com
> ,VSM_USER=admin,VEM_IP=192.168.66.43,ENABLE_DATA_PUBLISHER=false,MONITORING_SERVER_ADMIN_PASSWORD=xxxx,MONITORING_SERVER_IP=
> octl-01.qmog.cisco.com
> ,VEM_USER=ubuntu,VEM_PWD=ubuntu,COMMIT_ENABLED=false,MONITORING_SERVER_ADMIN_USERNAME=xxxx,CERT_TRUSTSTORE=/opt/apache-stratos-cartridge-agent/security/client-truststore.jks,VSM_PWD=Starent123!,VSM_IP=192.168.66.2,MONITORING_SERVER_PORT=0,APPMGR_GITREPO=ssh://
> jenapper@10.58.10.189/home/jenapper/code/eccentrica.git,MEMBER_ID=cisco-gilan-appmgr-1.cisco-gil7ef7327f-2bb2-4768-820f-d064de29aa59,LB_CLUSTER_ID=null,NETWORK_PARTITION_ID=OAM1,PARTITION_ID=RegionOne-AZ-1
> {org.apache.stratos.cloud.controller.impl.CloudControllerServiceImpl}
> > TID: [0] [STRATOS] [2014-07-15 09:58:55,888]  INFO
> {org.apache.stratos.cloud.controller.impl.CloudControllerServiceImpl} -
>  Member is terminated: MemberContext
> [memberId=cisco-gilan-appmgr-1.cisco-gil407f5bdc-aad2-4234-80fc-6cdf17be6192,
> nodeId=RegionOne/89433818-21ed-48d4-bd8f-c396ab30f6d2,
> clusterId=cisco-gilan-appmgr-1.cisco-gil, cartridgeType=cisco-gilan-appmgr,
> privateIpAddress=192.168.66.1, publicIpAddress=null,
> allocatedIpAddress=null, initTime=1405417410736, lbClusterId=null,
> networkPartitionId=OAM1]
> {org.apache.stratos.cloud.controller.impl.CloudControllerServiceImpl}
> > ===================
> >
> > The problem is that Stratos gives no indication of why it is doing this
> [1]. Stratos should be enhanced so that the above message gives some
> indication of *why* the member is being terminated (loss of heartbeats,
> timeout on port knocking etc. etc.). This is needed as apache stratos
> expands it's user base.
> > This issue has high priority as it affects the efficiency of
> troubleshooting and system stability.
> >
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.2#6252)
>



-- 

Udara Liyanage
Software Engineer
WSO2, Inc.: http://wso2.com
lean. enterprise. middleware

web: http://udaraliyanage.wordpress.com
phone: +94 71 443 6897

Reply via email to