Re: Replication Error in HBase Production Environment

2024-10-11 Thread Hamado Dene
Hello Community, I’m still encountering this issue in production and haven’t yet found a way to resolve it.  Do you have any suggestions on how I can debug and address this problem? Thank, Hamado Dene Il giovedì 26 settembre 2024 alle ore 09:53:35 CEST, Hamado Dene ha scritto: Dump

Re: Replication Error in HBase Production Environment

2024-09-26 Thread Hamado Dene
Dumping the content on ZooKeeper of that OldWals, I see this: [zk: rzv-db06-hn.:2181,rzv-db07-hn.:2181,rzv-db08-hn.:2181(CONNECTED) 7] get /hbase/replication/rs/rzv-db10-hd.,16020,1726864206981/replicav3-rzv-db13-hd.,16020,1684871532555-rzv-db09-hd.,16020,1696832789107-r

Re: Replication Error in HBase Production Environment

2024-09-24 Thread Hamado Dene
Is there an HBase utility to dump the contents of ZooKeeper? The data in that path is not directly readable from ZooKeeper... I probably need to decode it somehow Thanks, Hamado Dene Il mercoledì 18 settembre 2024 alle ore 16:26:20 CEST, 张铎(Duo Zhang) ha scritto: It is a bit strang

Re: Replication Error in HBase Production Environment

2024-09-18 Thread Duo Zhang
It is a bit strange that the positions for all the very old WAL files are -1? I skimmed the code for branch-2.5, it seems they should only be set to 0. Could you please try to dump the znode content for recording the position of the given WAL file? The path on zookeeper should be something like /

Re: Replication Error in HBase Production Environment

2024-09-18 Thread Hamado Dene
I did some investigations, and the WALs seem to be readable without any issues... One strange thing I noticed is that the WALs are very old... they are 1 year older than the current date. -rw-r--r-- 2 hbase hadoop 42594304 2023-10-09 08:27 /hbase/oldWALs/rzv-db09-hd.%2C16020%2C167497335460

Re: Replication Error in HBase Production Environment

2024-09-16 Thread Hamado Dene
I deduced that it was one of the old WALs because, from the UI, I see that these old WALs are not being replicated. However, I'll do another round of checks to see if I can find something more. Would enabling debug help me find more information? Thanks again for your help. Replication Status

Re: Replication Error in HBase Production Environment

2024-09-16 Thread Duo Zhang
The staktrace you posted is messed up so it is not easy to find out which file actually blocks the replication progress... Could you please double check the WAL file which blocks the replication? Is it really one of these old WAL files? Thanks. Hamado Dene 于2024年9月16日周一 21:57写道: > > Thanks for

Re: Replication Error in HBase Production Environment

2024-09-16 Thread Hamado Dene
Thanks for your response. If I try to read the WALs with the following command: hbase org.apache.hadoop.hbase.wal.WALPrettyPrinter /hbase/oldWALs/rzv-db13-hd.%2C16020%2C1684871532555.1696811057371 I don't get any error... The file seems to be read correctly. In fact, at the end of the reading

Re: Replication Error in HBase Production Environment

2024-09-16 Thread Duo Zhang
Have you tried to read these WAL files by WALPrettyPrinter? What is the error from WALPrettyPrinter while reading these files? Hamado Dene 于2024年9月16日周一 16:15写道: > > Checking the WALs on HDFS, there are very old WALs, from a year ago... Does > anyone have any idea how to handle this issue in pro

Re: Replication Error in HBase Production Environment

2024-09-16 Thread Hamado Dene
Checking the WALs on HDFS, there are very old WALs, from a year ago... Does anyone have any idea how to handle this issue in production? -rw-r--r--   2 hbase hadoop   20684288 2023-10-09 08:26 /hbase/oldWALs/rzv-db14-hd.%2C16020%2C1674973593505.1696810047993 -rw-r--r--   2 hbase hadoop   15

Re: Replication Error in HBase Production Environment

2024-09-12 Thread Hamado Dene
Hi community,Could anyone kindly assist me in resolving this issue I'm facing?  Thank you in advance! Hamado Dene Il mercoledì 11 settembre 2024 alle ore 16:26:55 CEST, Hamado Dene ha scritto: Hi HBase Community, We are currently facing an issue in our production environment with HBase

Replication Error in HBase Production Environment

2024-09-11 Thread Hamado Dene
Hi HBase Community, We are currently facing an issue in our production environment with HBase replication, and I would greatly appreciate any guidance or suggestions the community may have We are running HBase version 2.5.8, and in the logs, we consistently encounter the following warning: 0