[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-12-03 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15039695#comment-15039695
 ] 

Hudson commented on HBASE-14772:


FAILURE: Integrated in HBase-Trunk_matrix #529 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/529/])
 HBASE-14772 Improve zombie detector; be more discerning; part2; (stack: rev 
5e430837d3e4a7d159e84964357297c8ab42430d)
* dev-support/test-patch.sh
* dev-support/zombie-detector.sh
 HBASE-14772 Improve zombie detector; be more discerning; part2; (stack: rev 
7117a2e35d42ef4e3f17b0a8f891fc5200cd0890)
* dev-support/zombie-detector.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Fix For: 2.0.0
>
> Attachments: 14772v3.patch, zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-12-03 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037629#comment-15037629
 ] 

Hudson commented on HBASE-14772:


FAILURE: Integrated in HBase-Trunk_matrix #527 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/527/])
 HBASE-14772 Improve zombie detector; be more discerning; part2; (stack: rev 
69658ea4a916c8ea5e6dd7d056a548e8dce4e96d)
* dev-support/test-patch.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: 14772v3.patch, zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-12-03 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15037492#comment-15037492
 ] 

Hudson commented on HBASE-14772:


FAILURE: Integrated in HBase-Trunk_matrix #526 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/526/])
HBASE-14772 Improve zombie detector; be more discerning; part2 (stack: rev 
cf8d3bd641ef9f69dabecec1b9e87272493fe825)
* dev-support/zombie-detector.sh
* dev-support/test-patch.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: 14772v3.patch, zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-12-03 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15038816#comment-15038816
 ] 

Hudson commented on HBASE-14772:


FAILURE: Integrated in HBase-Trunk_matrix #528 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/528/])
HBASE-14772 Improve zombie detector; be more discerning; part2; addendum 
(stack: rev a154ecda00d9d9a58e83d322dae7ffd3518b633c)
* dev-support/test-patch.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: 14772v3.patch, zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14997942#comment-14997942
 ] 

Hudson commented on HBASE-14772:


FAILURE: Integrated in HBase-Trunk_matrix #452 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/452/])
  HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM fix (stack: 
rev 44367f55e8bbd252ae824d1ddb626b5ce91fe75d)
* dev-support/zombie-detector.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14996226#comment-14996226
 ] 

Hudson commented on HBASE-14772:


SUCCESS: Integrated in HBase-Trunk_matrix #449 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/449/])
 HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM Add  (stack: 
rev 4c2e0d95dc62ee42b3da820d751a11eb52ce0069)
* dev-support/zombie-detector.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-08 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995863#comment-14995863
 ] 

Hudson commented on HBASE-14772:


FAILURE: Integrated in HBase-Trunk_matrix #448 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/448/])
 HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM some (stack: 
rev 1cbcf1175e6ce497936f12c60fb2e897833ace39)
* dev-support/test-patch.sh
* dev-support/zombie-detector.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-07 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995499#comment-14995499
 ] 

Hudson commented on HBASE-14772:


SUCCESS: Integrated in HBase-Trunk_matrix #446 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/446/])
 HBASE-14772 Improve zombie detector; be more discerning; ADDENDUM set (stack: 
rev 305ecaf224340b0f6e248d4fdabf0b53e1cd3b03)
* dev-support/zombie-detector.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-06 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14994944#comment-14994944
 ] 

stack commented on HBASE-14772:
---

I pushed v2. This is part 1 of a few pushed I'd say till get it right. Let me 
hook it up now up in jenkins to run on trunk build. Will make for some messy 
output for a while ...  Will clean up in subsequent patches.

> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-06 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14995087#comment-14995087
 ] 

Hudson commented on HBASE-14772:


FAILURE: Integrated in HBase-Trunk_matrix #442 (See 
[https://builds.apache.org/job/HBase-Trunk_matrix/442/])
HBASE-14772 Improve zombie detector; be more discerning (stack: rev 
bea2f7feacd1a34d27ee17c201aaeacc32e8cdaf)
* pom.xml
* dev-support/zombie-detector.sh


> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-06 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14993466#comment-14993466
 ] 

Hadoop QA commented on HBASE-14772:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12770968/zombiev2.patch
  against master branch at commit bfa36891901b96b95d82f5307642c35fd2b9f534.
  ATTACHMENT ID: 12770968

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified tests.

{color:green}+1 hadoop versions{color}. The patch compiles with all 
supported hadoop versions (2.4.0 2.4.1 2.5.0 2.5.1 2.5.2 2.6.0 2.6.1 2.7.0 
2.7.1)

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 protoc{color}.  The applied patch does not increase the 
total number of protoc compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 checkstyle{color}.  The applied patch does not increase the 
total number of checkstyle errors

{color:green}+1 findbugs{color}.  The patch does not introduce any  new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 lineLengths{color}.  The patch introduces the following lines 
longer than 100:
+echo "Found ${ZOMBIE_TESTS_COUNT} suspicious java process(es); waiting 
${wait}s to see if just slow to stop"
+  {color:red}-1 core zombie tests{color}.  There are ${ZOMBIE_TESTS_COUNT} 
possible zombie test(s): ${ZB_STACK}"
+
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>Assignee: stack
> Attachments: zombie.patch, zombiev2.patch
>
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HBASE-14772) Improve zombie detector; be more discerning

2015-11-05 Thread Sean Busbey (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14992223#comment-14992223
 ] 

Sean Busbey commented on HBASE-14772:
-

yeah, a UUID or some such specific to the current build would be perfect. on 
Jenkins at least, there's ${BUILD_ID} that we can use.

> Improve zombie detector; be more discerning
> ---
>
> Key: HBASE-14772
> URL: https://issues.apache.org/jira/browse/HBASE-14772
> Project: HBase
>  Issue Type: Sub-task
>  Components: test
>Reporter: stack
>
> Currently, any surefire process with the hbase flag is a potential zombie. 
> Our zombie check currently takes a reading and if it finds candidate zombies, 
> it waits 30 seconds and then does another reading. If a concurrent build 
> going on, in both cases the zombie detector will come up positive though the 
> adjacent test run may be making progress; i.e. the cast of surefire processes 
> may have changed between readings but our detector just sees presence of  
> hbase surefire processes.
> Here is example:
> {code}
> Suspicious java process found - waiting 30s to see if there are just slow to 
> stop
> There appear to be 5 zombie tests, they should have been killed by surefire 
> but survived
> 12823 surefirebooter852180186418035480.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7653 surefirebooter8579074445899448699.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 12614 surefirebooter136529596936417090.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 7836 surefirebooter3217047564606450448.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
> 13566 surefirebooter2084039411151963494.jar -enableassertions -Dhbase.test 
> -Xmx2800m -XX:MaxPermSize=256m -Djava.security.egd=file:/dev/./urandom 
> -Djava.net.preferIPv4Stack=true -Djava.awt.headless=true
>  BEGIN zombies jstack extract
>  END  zombies jstack extract
> {code}
> 5 is the number of forked processes we allow when doing medium and large 
> tests so an adjacent build will always show as '5 zombies'.
> Need to add discerning if list of processes changes between readings.
> Can I also add a tag per build run that all forked processes pick up so I can 
> look at the current builds progeny only?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)