[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15039695#comment-15039695 ] Hudson commented on HBASE-14772: FAILURE: Integrated in HBase-Trunk_matrix #529 (See [https://builds.apache.org/job/HBase-Trunk_matrix/529/]) HBASE-14772 Improve zombie detector; be more discerning; part2; (stack: rev 5e430837d3e4a7d159e84964357297c8ab42430d) * dev-support/test-patch.sh * dev-support/zombie-detector.sh HBASE-14772 Improve zombie detector; be more discerning; part2; (stack: rev 7117a2e35d42ef4e3f17b0a8f891fc5200cd0890) * dev-support/zombie-detector.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Fix For: 2.0.0 > > Attachments: 14772v3.patch, zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037629#comment-15037629 ] Hudson commented on HBASE-14772: FAILURE: Integrated in HBase-Trunk_matrix #527 (See [https://builds.apache.org/job/HBase-Trunk_matrix/527/]) HBASE-14772 Improve zombie detector; be more discerning; part2; (stack: rev 69658ea4a916c8ea5e6dd7d056a548e8dce4e96d) * dev-support/test-patch.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: 14772v3.patch, zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037492#comment-15037492 ] Hudson commented on HBASE-14772: FAILURE: Integrated in HBase-Trunk_matrix #526 (See [https://builds.apache.org/job/HBase-Trunk_matrix/526/]) HBASE-14772 Improve zombie detector; be more discerning; part2 (stack: rev cf8d3bd641ef9f69dabecec1b9e87272493fe825) * dev-support/zombie-detector.sh * dev-support/test-patch.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: 14772v3.patch, zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15038816#comment-15038816 ] Hudson commented on HBASE-14772: FAILURE: Integrated in HBase-Trunk_matrix #528 (See [https://builds.apache.org/job/HBase-Trunk_matrix/528/]) HBASE-14772 Improve zombie detector; be more discerning; part2; addendum (stack: rev a154ecda00d9d9a58e83d322dae7ffd3518b633c) * dev-support/test-patch.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: 14772v3.patch, zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14997942#comment-14997942 ] Hudson commented on HBASE-14772: FAILURE: Integrated in HBase-Trunk_matrix #452 (See [https://builds.apache.org/job/HBase-Trunk_matrix/452/]) HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM fix (stack: rev 44367f55e8bbd252ae824d1ddb626b5ce91fe75d) * dev-support/zombie-detector.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14996226#comment-14996226 ] Hudson commented on HBASE-14772: SUCCESS: Integrated in HBase-Trunk_matrix #449 (See [https://builds.apache.org/job/HBase-Trunk_matrix/449/]) HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM Add (stack: rev 4c2e0d95dc62ee42b3da820d751a11eb52ce0069) * dev-support/zombie-detector.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995863#comment-14995863 ] Hudson commented on HBASE-14772: FAILURE: Integrated in HBase-Trunk_matrix #448 (See [https://builds.apache.org/job/HBase-Trunk_matrix/448/]) HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM some (stack: rev 1cbcf1175e6ce497936f12c60fb2e897833ace39) * dev-support/test-patch.sh * dev-support/zombie-detector.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995499#comment-14995499 ] Hudson commented on HBASE-14772: SUCCESS: Integrated in HBase-Trunk_matrix #446 (See [https://builds.apache.org/job/HBase-Trunk_matrix/446/]) HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM set (stack: rev 305ecaf224340b0f6e248d4fdabf0b53e1cd3b03) * dev-support/zombie-detector.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14994944#comment-14994944 ] stack commented on HBASE-14772: --- I pushed v2. This is part 1 of a few pushed I'd say till get it right. Let me hook it up now up in jenkins to run on trunk build. Will make for some messy output for a while ... Will clean up in subsequent patches. > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995087#comment-14995087 ] Hudson commented on HBASE-14772: FAILURE: Integrated in HBase-Trunk_matrix #442 (See [https://builds.apache.org/job/HBase-Trunk_matrix/442/]) HBASE-14772 Improve zombie detector; be more discerning (stack: rev bea2f7feacd1a34d27ee17c201aaeacc32e8cdaf) * pom.xml * dev-support/zombie-detector.sh > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14993466#comment-14993466 ] Hadoop QA commented on HBASE-14772: --- {color:red}-1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12770968/zombiev2.patch against master branch at commit bfa36891901b96b95d82f5307642c35fd2b9f534. ATTACHMENT ID: 12770968 {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:green}+1 tests included{color}. The patch appears to include 1 new or modified tests. {color:green}+1 hadoop versions{color}. The patch compiles with all supported hadoop versions (2.4.0 2.4.1 2.5.0 2.5.1 2.5.2 2.6.0 2.6.1 2.7.0 2.7.1) {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 protoc{color}. The applied patch does not increase the total number of protoc compiler warnings. {color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages. {color:green}+1 checkstyle{color}. The applied patch does not increase the total number of checkstyle errors {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 2.0.3) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:red}-1 lineLengths{color}. The patch introduces the following lines longer than 100: +echo "Found ${ZOMBIE_TESTS_COUNT} suspicious java process(es); waiting ${wait}s to see if just slow to stop" + {color:red}-1 core zombie tests{color}. There are ${ZOMBIE_TESTS_COUNT} possible zombie test(s): ${ZB_STACK}" + > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack >Assignee: stack > Attachments: zombie.patch, zombiev2.patch > > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning
[ https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14992223#comment-14992223 ] Sean Busbey commented on HBASE-14772: - yeah, a UUID or some such specific to the current build would be perfect. on Jenkins at least, there's ${BUILD_ID} that we can use. > Improve zombie detector; be more discerning > --- > > Key: HBASE-14772 > URL: https://issues.apache.org/jira/browse/HBASE-14772 > Project: HBase > Issue Type: Sub-task > Components: test >Reporter: stack > > Currently, any surefire process with the hbase flag is a potential zombie. > Our zombie check currently takes a reading and if it finds candidate zombies, > it waits 30 seconds and then does another reading. If a concurrent build > going on, in both cases the zombie detector will come up positive though the > adjacent test run may be making progress; i.e. the cast of surefire processes > may have changed between readings but our detector just sees presence of > hbase surefire processes. > Here is example: > {code} > Suspicious java process found - waiting 30s to see if there are just slow to > stop > There appear to be 5 zombie tests, they should have been killed by surefire > but survived > 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test > -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom > -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true > BEGIN zombies jstack extract > END zombies jstack extract > {code} > 5 is the number of forked processes we allow when doing medium and large > tests so an adjacent build will always show as '5 zombies'. > Need to add discerning if list of processes changes between readings. > Can I also add a tag per build run that all forked processes pick up so I can > look at the current builds progeny only? -- This message was sent by Atlassian JIRA (v6.3.4#6332)