[ 
https://issues.apache.org/jira/browse/HBASE-7865?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13579869#comment-13579869
 ] 

Ted Yu commented on HBASE-7865:
-------------------------------

>From 
>https://issues.apache.org/jira/secure/attachment/12569671/jstack_node3.txt:
{code}
"RS_CLOSE_REGION-node3,60020,1360977975359-2" prio=10 tid=0x00000000032ff000 
nid=0x3a2d waiting on condition [0x00007f74f0ead000]
   java.lang.Thread.State: WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x0000000625b93ee8> (a 
java.util.concurrent.locks.ReentrantLock$NonfairSync)
        at java.util.concurrent.locks.LockSupport.park(LockSupport.java:186)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:834)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireQueued(AbstractQueuedSynchronizer.java:867)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquire(AbstractQueuedSynchronizer.java:1197)
        at 
java.util.concurrent.locks.ReentrantLock$NonfairSync.lock(ReentrantLock.java:214)
        at java.util.concurrent.locks.ReentrantLock.lock(ReentrantLock.java:290)
        at 
org.apache.hadoop.hbase.regionserver.wal.HLog.startCacheFlush(HLog.java:1551)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:1490)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:1435)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:968)
        at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:916)
        - locked <0x0000000627dee918> (a java.lang.Object)
        at 
org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler.process(CloseRegionHandler.java:119)
        at 
org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:169)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603)
        at java.lang.Thread.run(Thread.java:722)

"RS_CLOSE_REGION-node3,60020,1360977975359-1" prio=10 tid=0x0000000002aab800 
nid=0x3a2c waiting on condition [0x00007f74f1dbc000]
   java.lang.Thread.State: WAITING (parking)
        at sun.misc.Unsafe.park(Native Method)
        - parking to wait for  <0x0000000625b93ee8> (a 
java.util.concurrent.locks.ReentrantLock$NonfairSync)
        at java.util.concurrent.locks.LockSupport.park(LockSupport.java:186)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:834)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireQueued(AbstractQueuedSynchronizer.java:867)
        at 
java.util.concurrent.locks.AbstractQueuedSynchronizer.acquire(AbstractQueuedSynchronizer.java:1197)
        at 
java.util.concurrent.locks.ReentrantLock$NonfairSync.lock(ReentrantLock.java:214)
        at java.util.concurrent.locks.ReentrantLock.lock(ReentrantLock.java:290)
        at 
org.apache.hadoop.hbase.regionserver.wal.HLog.startCacheFlush(HLog.java:1551)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:1490)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:1435)
        at 
org.apache.hadoop.hbase.regionserver.HRegion.doClose(HRegion.java:968)
        at org.apache.hadoop.hbase.regionserver.HRegion.close(HRegion.java:916)
{code}
Looks like region was having trouble closing.
Region server log would help further diagnostics.
                
> HBase regionserver never stops when running `bin/stop-hbase.sh` on master
> -------------------------------------------------------------------------
>
>                 Key: HBASE-7865
>                 URL: https://issues.apache.org/jira/browse/HBASE-7865
>             Project: HBase
>          Issue Type: Bug
>    Affects Versions: 0.94.5
>            Reporter: Jean-Marc Spaggiari
>         Attachments: jstack_node1.txt, jstack_node3.txt, jstack_node7.txt
>
>
> I faced 3 regions (out of 8) never stopping today. This is pretty bad because 
> the script is supposed to wait until all the RS stopped to re-start 
> everything, therefor, servers are never going back online.
> HBASE-7838 will help with that and will kill the RSs. But that will not 
> really solve the root cause.
> Attached are the jstack for the 3 servers.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to