[ 
https://issues.apache.org/jira/browse/HBASE-27162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Kyle Purtell updated HBASE-27162:
----------------------------------------
    Summary: TestReplicationValueCompressedWAL is flaky  (was: Could get 
malformed WAL PB messge when replication if WAL value compression is enabled)

> TestReplicationValueCompressedWAL is flaky
> ------------------------------------------
>
>                 Key: HBASE-27162
>                 URL: https://issues.apache.org/jira/browse/HBASE-27162
>             Project: HBase
>          Issue Type: Bug
>          Components: Replication, wal
>            Reporter: Duo Zhang
>            Priority: Major
>
> https://nightlies.apache.org/hbase/HBase-Flaky-Tests/master/3752/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.replication.regionserver.TestReplicationValueCompressedWAL-output.txt
> {noformat}
> 2022-06-26T08:41:27,359 DEBUG 
> [RS_REFRESH_PEER-regionserver/jenkins-hbase19:0-0.replicationSource,2.replicationSource.wal-reader.jenkins-hbase19.apache.org%2C44659%2C1656232862372,2]
>  wal.ProtobufLogReader(448): Encountered a malformed edit, seeking back to 
> last good position in file, from 65537 to 65536
> java.io.EOFException: Partial PB while reading WAL, probably an unexpected 
> EOF, ignoring. current offset=65537
>       at 
> org.apache.hadoop.hbase.regionserver.wal.ProtobufLogReader.readNext(ProtobufLogReader.java:390)
>  ~[classes/:?]
>       at 
> org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:104) 
> ~[classes/:?]
>       at 
> org.apache.hadoop.hbase.regionserver.wal.ReaderBase.next(ReaderBase.java:92) 
> ~[classes/:?]
>       at 
> org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.readNextEntryAndRecordReaderPosition(WALEntryStream.java:264)
>  ~[classes/:?]
>       at 
> org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.tryAdvanceEntry(WALEntryStream.java:178)
>  ~[classes/:?]
>       at 
> org.apache.hadoop.hbase.replication.regionserver.WALEntryStream.hasNext(WALEntryStream.java:103)
>  ~[classes/:?]
>       at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.readWALEntries(ReplicationSourceWALReader.java:212)
>  ~[classes/:?]
>       at 
> org.apache.hadoop.hbase.replication.regionserver.ReplicationSourceWALReader.run(ReplicationSourceWALReader.java:148)
>  ~[classes/:?]
> {noformat}
> Actually, we should still have a lot of edits after this point. This could  
> be a critical problem as we will hang replication forever...



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to