[jira] [Created] (HBASE-27872) xerial's snappy-java requires GLIBC >= 2.32

2023-05-17 Thread Andrew Kyle Purtell (Jira)
Andrew Kyle Purtell created HBASE-27872:
---

 Summary: xerial's snappy-java requires GLIBC >= 2.32
 Key: HBASE-27872
 URL: https://issues.apache.org/jira/browse/HBASE-27872
 Project: HBase
  Issue Type: Bug
Affects Versions: 2.5.4
Reporter: Andrew Kyle Purtell
Assignee: Andrew Kyle Purtell
 Fix For: 2.5.5


We need to add a native library load check with a helpful error message if it 
fails to load due to a too old glibc or similar reason, and disable the unit 
test if the native library fails to load. 

{noformat}
TestHFileCompressionSnappy.test:54->HFileTestBase.doTest:72 »
UnsatisfiedLink 
/home/apurtell/src/hbase/hbase-compression/hbase-compression-snappy/target/snappy-1.1.9-6406bd03-44b4-4586-a91b-79d213c14062-libsnappyjava.so:
/lib/aarch64-linux-gnu/libc.so.6: version `GLIBC_2.32'
{noformat}

In this case a Debian test VM has "Debian GLIBC 2.31-13+deb11u6" installed, 
which is too load. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27871) Meta replication stuck forever if wal it's still reading gets rolled and deleted

2023-05-17 Thread Wellington Chevreuil (Jira)
Wellington Chevreuil created HBASE-27871:


 Summary: Meta replication stuck forever if wal it's still reading 
gets rolled and deleted
 Key: HBASE-27871
 URL: https://issues.apache.org/jira/browse/HBASE-27871
 Project: HBase
  Issue Type: Bug
  Components: meta replicas
Affects Versions: 2.5.4, 2.4.17
Reporter: Wellington Chevreuil
Assignee: Wellington Chevreuil


This affects branch-2 based releases only (in master, HBASE-26416 refactored 
region replication to not rely on the replication framework anymore).

Per the original [meta region replicas 
design|https://docs.google.com/document/d/1jJWVc-idHhhgL4KDRpjMsQJKCl_NRaCLGiH3Wqwd3O8/edit],
 we use most of the replication framework for communicating changes in the 
primary replica back to the secondary ones, but we skip storing the queue state 
in ZK. In the event of a region replication crash, we should let the related 
replication source thread be interrupted, so that 
RegionReplicaReplicationEndpoint would set a new source from the scratch and 
make sure to update the secondary replicas.
 
We have run into a situation in one of our customers' cluster where the region 
replica source faced a long lag (probably because the RSes hosting the 
secondary replicas were busy and slower in processing the region replication 
entries), so that the current wal got rolled and eventually deleted whilst the 
replication source reader was still referring it. In such cases, 
ReplicationSourceReader only sees the IOException and keeps retrying the read 
indefinitely, but since the file is now gone, it will just get stuck there 
forever. In the particular case of FNFE (which I believe would only happen for 
region replication), we should just raise an exception and let 
RegionReplicaReplicationEndpoint handle it to reset the region replication 
source.
 
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27870) Eliminate the 'WARNING: package jdk.internal.util.random not in java.base' when running UTs with jdk11

2023-05-17 Thread Duo Zhang (Jira)
Duo Zhang created HBASE-27870:
-

 Summary: Eliminate the 'WARNING: package jdk.internal.util.random 
not in java.base' when running UTs with jdk11
 Key: HBASE-27870
 URL: https://issues.apache.org/jira/browse/HBASE-27870
 Project: HBase
  Issue Type: Improvement
  Components: build, pom, test
Reporter: Duo Zhang
Assignee: Duo Zhang


When fixing a jdk17 issue, we add a

--add-opens java.base/jdk.internal.util.random=ALL-UNNAMED

line in our pom when running UTs with jdk11+, so it will also be applied for 
jdk11, but we only need this for jdk17, on jdk11, it will generate a warning, 
as described in the title.

We should try some maven magics to fix this.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)