[ https://issues.apache.org/jira/browse/KYLIN-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17172237#comment-17172237 ]
Gabor Arki commented on KYLIN-4348: ----------------------------------- Altogether we have 10 running jobs in the cluster which show no progress: * 169f75fa-a02f-221b-fc48-037bc7a842d0 * 0b5dae1b-6faf-66c5-71dc-86f5b820f1c4 * 00924699-8b51-8091-6e71-34ccfeba3a98 * 4620192a-71e1-16dd-3b05-44d7f9144ad4 * 416355c2-a3d7-57eb-55c6-c042aa256510 * 12750aea-3b96-c817-64e8-bf893d8c120f * 42819dde-5857-fd6b-b075-439952f47140 * 00128937-bd4a-d6c1-7a4e-744dee946f67 * 46a0233f-217e-9155-725b-c815ad77ba2c * 062150ba-bacd-6644-4801-3a51b260d1c5 However, the ones possessing the locks are all pending: * f888380e-9ff4-98f5-2df4-1ae71e045f93 * fc186bd9-1186-6ed4-e58c-bbbf6dd8ef74 * d1a6475a-9ab2-5ee4-6714-f395e20cfc01 So, essentially the jobs that are running cannot actually run because they are unable to acquire a lock. However, the ones that possess the lock cannot continue because there are already 10 running jobs. This seems to be a deadlock to me. > Fix distributed concurrency lock bug > ------------------------------------ > > Key: KYLIN-4348 > URL: https://issues.apache.org/jira/browse/KYLIN-4348 > Project: Kylin > Issue Type: Sub-task > Reporter: wangxiaojing > Assignee: wangxiaojing > Priority: Major > Fix For: v3.1.0 > > Attachments: image-2020-02-03-10-54-21-976.png, > image-2020-02-03-10-54-53-468.png > > -- This message was sent by Atlassian Jira (v8.3.4#803005)