Ashu Pachauri created HBASE-14699:
-------------------------------------

             Summary: Replication crashes regionservers when hbase.wal.provider 
is set to multiwal
                 Key: HBASE-14699
                 URL: https://issues.apache.org/jira/browse/HBASE-14699
             Project: HBase
          Issue Type: Bug
          Components: Replication
            Reporter: Ashu Pachauri
            Priority: Blocker


When the hbase.wal.provider is set to multiwal and replication is enabled, the 
regionservers start crashing with the following exception:

{code}
<hostname>,16020,1445495411258: Failed to write replication wal position 
(filename=<hostname>%2C16020%2C1445495411258.null0.1445495898373, 
position=1322399)
org.apache.zookeeper.KeeperException$NoNodeException: KeeperErrorCode = NoNode 
for 
/hbase/replication/rs/<hostname>,16020,1445495411258/1/<hostname>%2C16020%2C1445495411258.null0.1445495898373
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:111)
        at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
        at org.apache.zookeeper.ZooKeeper.setData(ZooKeeper.java:1270)
        at 
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.setData(RecoverableZooKeeper.java:429)
        at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:940)
        at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:990)
        at org.apache.hadoop.hbase.zookeeper.ZKUtil.setData(ZKUtil.java:984)
        at 
org.apache.hadoop.hbase.replication.ReplicationQueuesZKImpl.setLogPosition(ReplicationQueuesZKImpl.java:129)
        at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager.logPositionAndCleanOldLogs(ReplicationSourceManager.java:177)
        at 
org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.run(ReplicationSource.java:388)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to