[ https://issues.apache.org/jira/browse/OOZIE-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14119306#comment-14119306 ]
Hadoop QA commented on OOZIE-1813: ---------------------------------- Testing JIRA OOZIE-1813 Cleaning local git workspace ---------------------------- {color:green}+1 PATCH_APPLIES{color} {color:green}+1 CLEAN{color} {color:red}-1 RAW_PATCH_ANALYSIS{color} . {color:green}+1{color} the patch does not introduce any @author tags . {color:green}+1{color} the patch does not introduce any tabs . {color:green}+1{color} the patch does not introduce any trailing spaces . {color:red}-1{color} the patch contains 2 line(s) longer than 132 characters . {color:green}+1{color} the patch does adds/modifies 1 testcase(s) {color:green}+1 RAT{color} . {color:green}+1{color} the patch does not seem to introduce new RAT warnings {color:green}+1 JAVADOC{color} . {color:green}+1{color} the patch does not seem to introduce new Javadoc warnings {color:green}+1 COMPILE{color} . {color:green}+1{color} HEAD compiles . {color:green}+1{color} patch compiles . {color:green}+1{color} the patch does not seem to introduce new javac warnings {color:green}+1 BACKWARDS_COMPATIBILITY{color} . {color:green}+1{color} the patch does not change any JPA Entity/Colum/Basic/Lob/Transient annotations . {color:green}+1{color} the patch does not modify JPA files {color:red}-1 TESTS{color} . Tests run: 1515 . Tests failed: 3 . Tests errors: 0 . The patch failed the following testcases: . testBundleStatusTransitServiceKilled2(org.apache.oozie.service.TestStatusTransitService) . testBundleStatusTransitRunningFromKilled(org.apache.oozie.service.TestStatusTransitService) . testUnpauseBundleAndCoordinator(org.apache.oozie.service.TestPauseTransitService) {color:green}+1 DISTRO{color} . {color:green}+1{color} distro tarball builds with the patch ---------------------------- {color:red}*-1 Overall result, please check the reported -1(s)*{color} The full output of the test-patch run is available at . https://builds.apache.org/job/oozie-trunk-precommit-build/1850/ > Add service to report/kill rogue bundles and coordinator jobs > ------------------------------------------------------------- > > Key: OOZIE-1813 > URL: https://issues.apache.org/jira/browse/OOZIE-1813 > Project: Oozie > Issue Type: Bug > Reporter: Purshotam Shah > Assignee: Purshotam Shah > Attachments: OOZIE-1813-V2.patch, OOZIE-1813-V3.patch, > OOZIE-1813-V4.patch, OOZIE-1813-V5.patch, OOZIE-1813-V6.patch, > OOZIE-1813-V7.patch > > > People leave their test coordinator and bundle jobs without ever killing them > and they just eat up resources heavily. We should have a service which > periodically check for abandoned coords and report/kill them. > We can add multiple logic to this like ( number of consecutive > failed/timedout action, total number of failed/timedout action). > To start with if number of coord action with failed/timedout status > defined > value, then coord is considered to be rogue. -- This message was sent by Atlassian JIRA (v6.3.4#6332)