[ https://issues.apache.org/jira/browse/HBASE-6847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13459831#comment-13459831 ]
Himanshu Vashishtha commented on HBASE-6847: -------------------------------------------- I wonder about the IndexOutOfBound exception. Is this hlog part of a failover regionserver? > HBASE-6649 broke replication > ---------------------------- > > Key: HBASE-6847 > URL: https://issues.apache.org/jira/browse/HBASE-6847 > Project: HBase > Issue Type: Bug > Reporter: Jean-Daniel Cryans > Assignee: Devaraj Das > Priority: Blocker > Fix For: 0.96.0, 0.92.3, 0.94.2 > > Attachments: HBASE-6847-0.94.patch, HBASE-6847.patch > > > After running with HBASE-6646 and replication enabled I encountered this: > {noformat} > 2012-09-17 20:04:08,111 DEBUG > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource: Opening > log for replication va1r3s24%2C10304%2C1347911704238.1347911706318 at 78617132 > 2012-09-17 20:04:08,120 DEBUG > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource: Break on > IOE: > hdfs://va1r5s41:10101/va1-backup/.logs/va1r3s24,10304,1347911704238/va1r3s24%2C10304%2C1347911704238.1347911706318, > entryStart=78641557, pos=78771200, end=78771200, edit=84 > 2012-09-17 20:04:08,120 DEBUG > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource: > currentNbOperations:164529 and seenEntries:84 and size: 154068 > 2012-09-17 20:04:08,120 DEBUG > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource: > Replicating 84 > 2012-09-17 20:04:08,146 INFO > org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager: > Going to report log #va1r3s24%2C10304%2C1347911704238.1347911706318 for > position 78771200 in > hdfs://va1r5s41:10101/va1-backup/.logs/va1r3s24,10304,1347911704238/va1r3s24%2C10304%2C1347911704238.1347911706318 > 2012-09-17 20:04:08,158 INFO > org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceManager: > Removing 0 logs in the list: [] > 2012-09-17 20:04:08,158 DEBUG > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource: > Replicated in total: 93234 > 2012-09-17 20:04:08,158 DEBUG > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource: Opening > log for replication va1r3s24%2C10304%2C1347911704238.1347911706318 at 78771200 > 2012-09-17 20:04:08,163 ERROR > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource: > Unexpected exception in ReplicationSource, > currentPath=hdfs://va1r5s41:10101/va1-backup/.logs/va1r3s24,10304,1347911704238/va1r3s24%2C10304%2C1347911704238.1347911706318 > java.lang.IndexOutOfBoundsException > at java.io.DataInputStream.readFully(DataInputStream.java:175) > at > org.apache.hadoop.io.DataOutputBuffer$Buffer.write(DataOutputBuffer.java:63) > at > org.apache.hadoop.io.DataOutputBuffer.write(DataOutputBuffer.java:101) > at > org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:2001) > at > org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1901) > at > org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1947) > at > org.apache.hadoop.hbase.regionserver.wal.SequenceFileLogReader.next(SequenceFileLogReader.java:235) > at > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.readAllEntriesToReplicateOrNextFile(ReplicationSource.java:394) > at > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.run(ReplicationSource.java:307) > {noformat} > There's something weird at the end of the file and it's killing replication. > We used to just retry. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira