[ 
https://issues.apache.org/jira/browse/CLOUDSTACK-7566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136380#comment-14136380
 ] 

Min Chen commented on CLOUDSTACK-7566:
--------------------------------------

Guard all potential unhandled exception in MessageBus gate.enter and gate.leave 
routine to avoid potential lock holdup. Since each API will invoke messageBus 
to publish event, any potential lock holdup will make jobs pending in the 
system and render cloud unusable.

> Many jobs getting stuck in pending state and cloud is unusable
> --------------------------------------------------------------
>
>                 Key: CLOUDSTACK-7566
>                 URL: https://issues.apache.org/jira/browse/CLOUDSTACK-7566
>             Project: CloudStack
>          Issue Type: Bug
>      Security Level: Public(Anyone can view this level - this is the 
> default.) 
>          Components: Management Server
>    Affects Versions: 4.3.0
>            Reporter: Min Chen
>            Priority: Blocker
>             Fix For: 4.5.0
>
>
> Many jobs are getting stuck with errors like:
> 2014-09-09 18:55:41,964 WARN [jobs.impl.AsyncJobMonitor] 
> (Timer-1:ctx-1e7a8a7e) Task (job-355415) has been pending for 690 seconds
> Even jobs that apparently succeed are getting the same error. Async job table 
> is not updated with complete even though job is completed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to