[jira] [Comment Edited] (HBASE-14420) Zombie Stomping Session
[ https://issues.apache.org/jira/browse/HBASE-14420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14985983#comment-14985983 ] stack edited comment on HBASE-14420 at 11/2/15 8:45 PM: Here are the longest running tests: {code} $ grep -h "--- > > Key: HBASE-14420 > URL: https://issues.apache.org/jira/browse/HBASE-14420 > Project: HBase > Issue Type: Umbrella > Components: test >Reporter: stack >Assignee: stack >Priority: Critical > Attachments: hangers.txt, none_fix (1).txt, none_fix.txt, > none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, > none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, > none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, > none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, > none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, none_fix.txt, > none_fix.txt, none_fix.txt > > > Patch build are now failing most of the time because we are dropping zombies. > I confirm we are doing this on non-apache build boxes too. > Left-over zombies consume resources on build boxes (OOME cannot create native > threads). Having to do multiple test runs in the hope that we can get a > non-zombie-making build or making (arbitrary) rulings that the zombies are > 'not related' is a productivity sink. And so on... > This is an umbrella issue for a zombie stomping session that started earlier > this week. Will hang sub-issues of this one. Am running builds back-to-back > on little cluster to turn out the monsters. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Comment Edited] (HBASE-14420) Zombie Stomping Session
[ https://issues.apache.org/jira/browse/HBASE-14420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14876733#comment-14876733 ] stack edited comment on HBASE-14420 at 9/19/15 12:18 AM: - Looking at recent builds, not bad but then this on ubuntu-1 doing HBASE-14407 NotServingRegion: hbase region closed forever against branch-1.2. kalashnikov:hbase.git stack$ python ./dev-support/findHangingTests.py https://builds.apache.org/view/H-L/view/HBase/job/PreCommit-HBASE-Build/15635/consoleFull Fetching the console output from the URL Printing hanging tests Hanging test : org.apache.hadoop.hbase.security.access.TestWithDisabledAuthorization Hanging test : org.apache.hadoop.hbase.security.access.TestAccessController2 Hanging test : org.apache.hadoop.hbase.security.access.TestScanEarlyTermination Hanging test : org.apache.hadoop.hbase.master.balancer.TestStochasticLoadBalancer Printing Failing tests Failing test : org.apache.hadoop.hbase.client.TestSnapshotCloneIndependence was (Author: stack): Looking at recent builds, not bad but then this on ubuntu-1 doing HBASE-14407 NotServingRegion: hbase region closed forever kalashnikov:hbase.git stack$ python ./dev-support/findHangingTests.py https://builds.apache.org/view/H-L/view/HBase/job/PreCommit-HBASE-Build/15635/consoleFull Fetching the console output from the URL Printing hanging tests Hanging test : org.apache.hadoop.hbase.security.access.TestWithDisabledAuthorization Hanging test : org.apache.hadoop.hbase.security.access.TestAccessController2 Hanging test : org.apache.hadoop.hbase.security.access.TestScanEarlyTermination Hanging test : org.apache.hadoop.hbase.master.balancer.TestStochasticLoadBalancer Printing Failing tests Failing test : org.apache.hadoop.hbase.client.TestSnapshotCloneIndependence > Zombie Stomping Session > --- > > Key: HBASE-14420 > URL: https://issues.apache.org/jira/browse/HBASE-14420 > Project: HBase > Issue Type: Umbrella > Components: test >Reporter: stack >Assignee: stack >Priority: Critical > > Patch build are now failing most of the time because we are dropping zombies. > I confirm we are doing this on non-apache build boxes too. > Left-over zombies consume resources on build boxes (OOME cannot create native > threads). Having to do multiple test runs in the hope that we can get a > non-zombie-making build or making (arbitrary) rulings that the zombies are > 'not related' is a productivity sink. And so on... > This is an umbrella issue for a zombie stomping session that started earlier > this week. Will hang sub-issues of this one. Am running builds back-to-back > on little cluster to turn out the monsters. -- This message was sent by Atlassian JIRA (v6.3.4#6332)