[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-9590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15656573#comment-15656573
 ] 

Wei Zhou commented on CLOUDSTACK-9590:
--------------------------------------

key logs
{code}
2016-11-10 13:23:06,568 DEBUG [c.c.a.t.Request] (AgentManager-Handler-15:null) 
(logid:) Seq -1-4: Scheduling the first command  { Cmd , MgmtId: -1, via: -1, 
Ver: v1, Flags: 1, 
[{"com.cloud.agent.api.StartupRoutingCommand":{"cpuSockets":2,"cpus":8,"speed":2261,"memory":100092813312,"dom0MinMemory":1073741824,"poolSync":false,"supportsClonedVolumes":false,"caps":"hvm,snapshot","pool":"/root","hypervisorType":"KVM","hostDetails":{"Host.OS.Kernel.Version":"3.10.0-327.36.3.el7.x86_64","com.cloud.network.Networks.RouterPrivateIpStrategy":"HostLocal","Host.OS.Version":"7.2.1511","Host.OS":"CentOS"},"hostTags":[],"groupDetails":{},"type":"Routing","dataCenter":"3","pod":"3","cluster":"9","guid":"71773d35-5679-34fe-9b15-7e2736d0dd28-LibvirtComputingResource","name":"kvm02.oscloud.local","version":"4.9.0","iqn":"iqn.1994-05.com.redhat:eedee56bd952","privateIpAddress":"192.168.85.14","privateMacAddress":"52:80:f7:fc:af:42","privateNetmask":"255.255.255.0","storageIpAddress":"192.168.85.14","storageNetmask":"255.255.255.0","storageMacAddress":"52:80:f7:fc:af:42","resourceName":"LibvirtComputingResource","gatewayIpAddress":"192.168.85.254","wait":0}},{"com.cloud.agent.api.StartupStorageCommand":{"totalSize":0,"poolInfo":{"uuid":"b3b9ef96-18b1-4136-8a69-5b316c6dc123","host":"192.168.85.14","localPath":"/var/lib/libvirt/images","hostPath":"/var/lib/libvirt/images","poolType":"Filesystem","capacityBytes":30866534400,"availableBytes":29248552960},"resourceType":"STORAGE_POOL","hostDetails":{},"type":"Storage","dataCenter":"3","pod":"3","guid":"71773d35-5679-34fe-9b15-7e2736d0dd28-LibvirtComputingResource","name":"kvm02.oscloud.local","version":"4.9.0","resourceName":"LibvirtComputingResource","wait":0}}]
 }
2016-11-10 13:23:06,570 INFO  [c.c.a.m.AgentManagerImpl] 
(AgentManager-Handler-13:null) (logid:) Connection from /192.168.85.14 closed 
but no cleanup was done.
2016-11-10 13:23:06,582 DEBUG [c.c.a.m.AgentManagerImpl] 
(AgentManager-Handler-15:null) (logid:) Failed to send startupanswer: 
java.nio.channels.ClosedChannelException
2016-11-10 13:23:06,605 DEBUG [c.c.a.t.Request] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Seq -1-4: Processing 
the first command  { Cmd , MgmtId: -1, via: -1, Ver: v1, Flags: 1, 
[{"com.cloud.agent.api.StartupRoutingCommand":{"cpuSockets":2,"cpus":8,"speed":2261,"memory":100092813312,"dom0MinMemory":1073741824,"poolSync":false,"supportsClonedVolumes":false,"caps":"hvm,snapshot","pool":"/root","hypervisorType":"KVM","hostDetails":{"Host.OS.Kernel.Version":"3.10.0-327.36.3.el7.x86_64","com.cloud.network.Networks.RouterPrivateIpStrategy":"HostLocal","Host.OS.Version":"7.2.1511","Host.OS":"CentOS"},"hostTags":[],"groupDetails":{},"type":"Routing","dataCenter":"3","pod":"3","cluster":"9","guid":"71773d35-5679-34fe-9b15-7e2736d0dd28-LibvirtComputingResource","name":"kvm02.oscloud.local","version":"4.9.0","iqn":"iqn.1994-05.com.redhat:eedee56bd952","privateIpAddress":"192.168.85.14","privateMacAddress":"52:80:f7:fc:af:42","privateNetmask":"255.255.255.0","storageIpAddress":"192.168.85.14","storageNetmask":"255.255.255.0","storageMacAddress":"52:80:f7:fc:af:42","resourceName":"LibvirtComputingResource","gatewayIpAddress":"192.168.85.254","wait":0}},{"com.cloud.agent.api.StartupStorageCommand":{"totalSize":0,"poolInfo":{"uuid":"b3b9ef96-18b1-4136-8a69-5b316c6dc123","host":"192.168.85.14","localPath":"/var/lib/libvirt/images","hostPath":"/var/lib/libvirt/images","poolType":"Filesystem","capacityBytes":30866534400,"availableBytes":29248552960},"resourceType":"STORAGE_POOL","hostDetails":{},"type":"Storage","dataCenter":"3","pod":"3","guid":"71773d35-5679-34fe-9b15-7e2736d0dd28-LibvirtComputingResource","name":"kvm02.oscloud.local","version":"4.9.0","resourceName":"LibvirtComputingResource","wait":0}}]
 }
2016-11-10 13:23:06,626 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to BareMetalDiscoverer
2016-11-10 13:23:06,626 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to NetscalerElement
2016-11-10 13:23:06,626 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to HypervServerDiscoverer
2016-11-10 13:23:06,626 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to BaremetalPxeManagerImpl
2016-11-10 13:23:06,626 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to XcpServerDiscoverer
2016-11-10 13:23:06,626 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to NiciraNvp
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to BrocadeVcsElement
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to Ovm3Discoverer
2016-11-10 13:23:06,627 DEBUG [c.c.h.o.r.Ovm3Discoverer] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) 
createHostVOForConnectedAgent: Host[-51-Routing]
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to LxcServerDiscoverer
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to NetworkUsageManagerImpl
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to PremiumSecondaryStorageManagerImpl
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to Ovs
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to ConsoleProxyManagerImpl
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to OvmDiscoverer
2016-11-10 13:23:06,627 DEBUG [c.c.r.ResourceManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Dispatching resource 
state event CREATE_HOST_VO_FOR_CONNECTED to KvmServerDiscoverer
2016-11-10 13:23:06,687 DEBUG [c.c.r.ResourceState] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Resource state update: 
[id = 51; name = kvm02.oscloud.local; old state = Enabled; event = 
InternalCreated; new state = Enabled]
2016-11-10 13:23:06,688 DEBUG [c.c.h.Status] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Transition:[Resource 
state = Enabled, Agent event = AgentConnected, Host id = 51, name = 
kvm02.oscloud.local]
2016-11-10 13:23:06,705 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) create 
ClusteredAgentAttache for 51
2016-11-10 13:23:06,709 DEBUG [c.c.a.m.AgentManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Sending Connect to 
listener: XcpServerDiscoverer
2016-11-10 13:23:06,709 DEBUG [c.c.h.x.d.XcpServerDiscoverer] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Not XenServer so moving 
on.
2016-11-10 13:23:06,709 DEBUG [c.c.a.m.AgentManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Sending Connect to 
listener: HypervServerDiscoverer
2016-11-10 13:23:06,709 DEBUG [c.c.h.h.d.HypervServerDiscoverer] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Not Hyper-V hypervisor, 
so moving on.
2016-11-10 13:23:06,709 DEBUG [c.c.a.m.AgentManagerImpl] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Sending Connect to 
listener: SecurityGroupListener
2016-11-10 13:23:06,709 INFO  [c.c.n.s.SecurityGroupListener] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Received a host startup 
notification
2016-11-10 13:23:06,714 DEBUG [c.c.a.t.Request] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Seq 
51-746753113213370369: Sending  { Cmd , MgmtId: 3232257305, via: 
51(kvm02.oscloud.local), Ver: v1, Flags: 100011, 
[{"com.cloud.agent.api.CleanupNetworkRulesCmd":{"interval":2382,"wait":0}}] }
2016-11-10 13:23:06,714 INFO  [c.c.a.m.AgentAttache] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Seq 
51-746753113213370369: Unable to send due to Resource [Host:51] is unreachable: 
Host 51: Channel is closed
2016-11-10 13:23:06,714 DEBUG [c.c.a.m.AgentAttache] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Seq 
51-746753113213370369: Cancelling.
2016-11-10 13:23:06,714 DEBUG [c.c.n.s.SecurityGroupListener] 
(AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Unable to schedule 
network rules cleanup for host 51
com.cloud.exception.AgentUnavailableException: Resource [Host:51] is 
unreachable: Host 51: Channel is closed
        at 
com.cloud.agent.manager.ConnectedAgentAttache.send(ConnectedAgentAttache.java:46)
        at com.cloud.agent.manager.AgentAttache.send(AgentAttache.java:373)
        at 
com.cloud.agent.manager.ClusteredAgentAttache.send(ClusteredAgentAttache.java:141)
        at 
com.cloud.agent.manager.AgentManagerImpl.send(AgentManagerImpl.java:507)
        at 
com.cloud.network.security.SecurityGroupListener.processConnect(SecurityGroupListener.java:169)
        at 
com.cloud.agent.manager.AgentManagerImpl.notifyMonitorsOfConnection(AgentManagerImpl.java:564)
        at 
com.cloud.agent.manager.AgentManagerImpl.handleConnectedAgent(AgentManagerImpl.java:1087)
        at 
com.cloud.agent.manager.AgentManagerImpl.access$000(AgentManagerImpl.java:120)
        at 
com.cloud.agent.manager.AgentManagerImpl$HandleAgentConnectTask.runInContext(AgentManagerImpl.java:1171)
        at 
org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
        at 
org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
        at 
org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
        at java.lang.Thread.run(Thread.java:745)
{code}

> KVM + CentOS 7.2 + Agent in Alert State for long time
> -----------------------------------------------------
>
>                 Key: CLOUDSTACK-9590
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-9590
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the 
> default.) 
>          Components: cloudstack-agent
>    Affects Versions: 4.9.0
>         Environment: entOS Linux release 7.2.1511 (Core)
> cloudstack-agent-4.9.0-1.el7.centos.x86_64
>            Reporter: Sven Vogel
>         Attachments: agent.log, cloudstack-startup.log, management-server.zip
>
>
> Hi,
> When i add a new host to cloudstack management server it take some time to 
> get host out from alert state.
> 1. i add the host and host add not possible
> 2. values are correct set to agent.properties, restart cloustack agent
> 3. agent says connected to server
> 4. management server says "alert"
> management-server.log
> 2016-11-10 13:23:06,783 DEBUG [c.c.h.Status] 
> (AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Transition:[Resource 
> state = Enabled, Agent event = AgentDisconnected, Host
> id = 51, name = kvm02.oscloud.local]
> 2016-11-10 13:23:06,798 DEBUG [c.c.a.m.ClusteredAgentManagerImpl] 
> (AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Notifying other nodes 
> of to disconnect
> 2016-11-10 13:23:06,806 DEBUG [c.c.a.m.AgentManagerImpl] 
> (AgentConnectTaskPool-49:ctx-c3b72839) (logid:5a86e1fb) Failed to handle host 
> connection: com.cloud.exception.Connection
> Exception: Unable to get an answer to the CheckNetworkCommand from agent: 51
> is there any way to speed up the alert state? is it normal that it take so 
> long?
> thanks
> Sven



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to