[jira] [Created] (HBASE-27366) split or merge removed region under snapshot

2022-09-12 Thread Huaxiang Sun (Jira)
Huaxiang Sun created HBASE-27366:


 Summary: split or merge removed region under snapshot
 Key: HBASE-27366
 URL: https://issues.apache.org/jira/browse/HBASE-27366
 Project: HBase
  Issue Type: Bug
  Components: snapshots
Affects Versions: 2.4.10
Reporter: Huaxiang Sun


We run into snapshot failures for one table with large number of regions. The 
event sequence is like the following:

 
 # Snapshot process lists all regions for one table.
 # Normalize kicks in to split some regions for the table under snapshot.
 # split finishes and major compaction finishes. The parent region is moved to 
archive.
 # When the Snapshot processes the parent region, it does not exist and 
snapshot fails.

Since snapshot process acquires the table lock, but there is no table lock 
acquired in split or merge process, they crash into each other.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27365) Minimise block addition failures due to no space in bucket cache writers queue by introducing wait time

2022-09-12 Thread Rajeshbabu Chintaguntla (Jira)
Rajeshbabu Chintaguntla created HBASE-27365:
---

 Summary: Minimise block addition failures due to no space in 
bucket cache writers queue by introducing wait time
 Key: HBASE-27365
 URL: https://issues.apache.org/jira/browse/HBASE-27365
 Project: HBase
  Issue Type: Improvement
  Components: BucketCache
Reporter: Rajeshbabu Chintaguntla
Assignee: Rajeshbabu Chintaguntla


Currently in bucket cache asynchronous caching mechanism introduced where 
initially the blocks to be cached will be added to queue and writer threads 
consume the blocks from the queue and write to bucket cache. In case if block 
writing to bucket cache is slow then there is a chance that  queue of writer 
threads become full  and following block additions will be failed. In case of 
slower storages like s3 might introduce latencies even if we enable bigger 
sizes of bucket cache using ephemeral storages. So we can allow configurable 
wait time while adding blocks to queue so that chances of queue free up is 
possible during the wait time and block addition failures can be minimised.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27364) Intra-cluster replication sink metrics

2022-09-12 Thread Hernan Gelaf-Romer (Jira)
Hernan Gelaf-Romer created HBASE-27364:
--

 Summary: Intra-cluster replication sink metrics
 Key: HBASE-27364
 URL: https://issues.apache.org/jira/browse/HBASE-27364
 Project: HBase
  Issue Type: Improvement
  Components: metrics, read replicas
Reporter: Hernan Gelaf-Romer


Region replication doesn't emit any sink metrics at the moment, these would be 
useful in determining replication lag. Adding metrics such as 
ageOfLastAppliedOp would be helpful.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)