[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-4288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kelven Yang resolved CLOUDSTACK-4288.
-------------------------------------

    Resolution: Fixed
    
> Management server is hanging quite often and in indefinite time intervals.  
> ----------------------------------------------------------------------------
>
>                 Key: CLOUDSTACK-4288
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-4288
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the 
> default.) 
>          Components: Install and Setup
>    Affects Versions: 4.2.0
>            Reporter: Kiran Koneti
>            Assignee: Kelven Yang
>            Priority: Blocker
>             Fix For: 4.2.0
>
>         Attachments: catalina.zip, management-server.zip
>
>
> I have created a Advanced Zone setup using the latest rhel63 399 build which 
> is generated around 12:08 PM IST.I see the management server hanging quite 
> few often for few minutes and restores again after some time on its own.
> At that time all the all teh CS operations are halted even the Management 
> server logs also halt and once it starts the hosts go into alert state and 
> comes up later.
> This is observed quite often and when i took the thread dump it shows the 
> below messages
> ""SecGrp-Worker-1" prio=10 tid=0x00007f86bc1de000 nid=0x28b waiting on 
> condition [0x00007f86b7cfb000]
>    java.lang.Thread.State: WAITING (parking)
>         at sun.misc.Unsafe.park(Native Method)
>         - parking to wait for  <0x000000077b193618> (a 
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
>         at java.util.concurrent.locks.LockSupport.park(LockSupport.java:186)
>         at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2043)
>         at 
> com.cloud.network.security.LocalSecurityGroupWorkQueue.getWork(LocalSecurityGroupWorkQueue.java:152)
>         at 
> com.cloud.network.security.SecurityGroupManagerImpl2.work(SecurityGroupManagerImpl2.java:136)
>         at 
> com.cloud.network.security.SecurityGroupManagerImpl2$WorkerThread.run(SecurityGroupManagerImpl2.java:71)
> "SecGrp-Worker-0" prio=10 tid=0x00007f86bc1dc000 nid=0x28a waiting on 
> condition [0x00007f86b7dfc000]
>    java.lang.Thread.State: WAITING (parking)
>         at sun.misc.Unsafe.park(Native Method)
>         - parking to wait for  <0x000000077b193618> (a 
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
>         at java.util.concurrent.locks.LockSupport.park(LockSupport.java:186)
>         at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2043)
>         at 
> com.cloud.network.security.LocalSecurityGroupWorkQueue.getWork(LocalSecurityGroupWorkQueue.java:152)
>         at 
> com.cloud.network.security.SecurityGroupManagerImpl2.work(SecurityGroupManagerImpl2.java:136)
>         at 
> com.cloud.network.security.SecurityGroupManagerImpl2$WorkerThread.run(SecurityGroupManagerImpl2.java:71)
> "HA-2" prio=10 tid=0x00007f86bc1da000 nid=0x289 waiting on condition 
> [0x00007f86b7efd000]
>    java.lang.Thread.State: TIMED_WAITING (parking)
>         at sun.misc.Unsafe.park(Native Method)
>         - parking to wait for  <0x000000077dc218b0> (a 
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)
>         at 
> java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226)
>         at 
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2081)
>         at java.util.concurrent.DelayQueue.take(DelayQueue.java:193)
>         at 
> java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:688)
>         at 
> java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:681)
>         at 
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:1043)"
> Attaching the catalina.out as well as the management server logs.
> This issue is observed in two different setups i.e with rhel 63 build in my 
> environment and also rhel62 environment which manasa is using.
> During the hang period when i did top the cpu% goes down to very low values.
>  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to