[ 
https://issues.apache.org/jira/browse/HBASE-14305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yu Li updated HBASE-14305:
--------------------------
    Description: 
As titled, this failure is reported in a UT check by HadoopQA, below is part of 
the jstack output:
{noformat}
"main" prio=10 tid=0x00007fb77000a800 nid=0x5004 in Object.wait() 
[0x00007fb778799000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000007ee9a5260> (a java.util.LinkedList)
        at 
org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForPreviousTransactionsComplete(MultiVersionConcurrencyControl.java:224)
        - locked <0x00000007ee9a5260> (a java.util.LinkedList)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2254)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2061)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2026)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2016)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1423)
        at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1344)
        - locked <0x00000007ee9c85e8> (a java.lang.Object)
        at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1295)
        at 
org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:352)
        at 
org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileGetting(TestHRegion.java:3999)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
{noformat}
It seems waiting on waitQueue never got notified and cause the case a zombie

Full jstack output please refer to [this 
link|https://builds.apache.org/job/PreCommit-HBASE-Build/15244//consoleFull]

  was:
As titled, this failure is reported in a UT check by HadoopQA, below is part of 
the jstack output:
{noformat}
"main" prio=10 tid=0x00007fb77000a800 nid=0x5004 in Object.wait() 
[0x00007fb778799000]
   java.lang.Thread.State: WAITING (on object monitor)
        at java.lang.Object.wait(Native Method)
        - waiting on <0x00000007ee9a5260> (a java.util.LinkedList)
        at 
org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForPreviousTransactionsComplete(MultiVersionConcurrencyControl.java:224)
        - locked <0x00000007ee9a5260> (a java.util.LinkedList)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2254)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2061)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2026)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2016)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1423)
        at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1344)
        - locked <0x00000007ee9c85e8> (a java.lang.Object)
        at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1295)
        at 
org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:352)
        at 
org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileGetting(TestHRegion.java:3999)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
{noformat}
It seems we are waiting on the same waitQueue object after got its lock.

Full jstack output please refer to [this 
link|https://builds.apache.org/job/PreCommit-HBASE-Build/15244//consoleFull]


> Deadlock observed in MVCC during Unit Testing
> ---------------------------------------------
>
>                 Key: HBASE-14305
>                 URL: https://issues.apache.org/jira/browse/HBASE-14305
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 2.0.0
>            Reporter: Yu Li
>
> As titled, this failure is reported in a UT check by HadoopQA, below is part 
> of the jstack output:
> {noformat}
> "main" prio=10 tid=0x00007fb77000a800 nid=0x5004 in Object.wait() 
> [0x00007fb778799000]
>    java.lang.Thread.State: WAITING (on object monitor)
>       at java.lang.Object.wait(Native Method)
>       - waiting on <0x00000007ee9a5260> (a java.util.LinkedList)
>       at 
> org.apache.hadoop.hbase.regionserver.MultiVersionConcurrencyControl.waitForPreviousTransactionsComplete(MultiVersionConcurrencyControl.java:224)
>       - locked <0x00000007ee9a5260> (a java.util.LinkedList)
>       at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2254)
>       at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2061)
>       at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2026)
>       at 
> org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2016)
>       at 
> org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:1423)
>       at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1344)
>       - locked <0x00000007ee9c85e8> (a java.lang.Object)
>       at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:1295)
>       at 
> org.apache.hadoop.hbase.HBaseTestingUtility.closeRegionAndWAL(HBaseTestingUtility.java:352)
>       at 
> org.apache.hadoop.hbase.regionserver.TestHRegion.testWritesWhileGetting(TestHRegion.java:3999)
>       at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> {noformat}
> It seems waiting on waitQueue never got notified and cause the case a zombie
> Full jstack output please refer to [this 
> link|https://builds.apache.org/job/PreCommit-HBASE-Build/15244//consoleFull]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to