[jira] [Created] (HBASE-27366) split or merge removed region under snapshot
Huaxiang Sun created HBASE-27366: Summary: split or merge removed region under snapshot Key: HBASE-27366 URL: https://issues.apache.org/jira/browse/HBASE-27366 Project: HBase Issue Type: Bug Components: snapshots Affects Versions: 2.4.10 Reporter: Huaxiang Sun We run into snapshot failures for one table with large number of regions. The event sequence is like the following: # Snapshot process lists all regions for one table. # Normalize kicks in to split some regions for the table under snapshot. # split finishes and major compaction finishes. The parent region is moved to archive. # When the Snapshot processes the parent region, it does not exist and snapshot fails. Since snapshot process acquires the table lock, but there is no table lock acquired in split or merge process, they crash into each other. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (HBASE-27365) Minimise block addition failures due to no space in bucket cache writers queue by introducing wait time
Rajeshbabu Chintaguntla created HBASE-27365: --- Summary: Minimise block addition failures due to no space in bucket cache writers queue by introducing wait time Key: HBASE-27365 URL: https://issues.apache.org/jira/browse/HBASE-27365 Project: HBase Issue Type: Improvement Components: BucketCache Reporter: Rajeshbabu Chintaguntla Assignee: Rajeshbabu Chintaguntla Currently in bucket cache asynchronous caching mechanism introduced where initially the blocks to be cached will be added to queue and writer threads consume the blocks from the queue and write to bucket cache. In case if block writing to bucket cache is slow then there is a chance that queue of writer threads become full and following block additions will be failed. In case of slower storages like s3 might introduce latencies even if we enable bigger sizes of bucket cache using ephemeral storages. So we can allow configurable wait time while adding blocks to queue so that chances of queue free up is possible during the wait time and block addition failures can be minimised. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (HBASE-27364) Intra-cluster replication sink metrics
Hernan Gelaf-Romer created HBASE-27364: -- Summary: Intra-cluster replication sink metrics Key: HBASE-27364 URL: https://issues.apache.org/jira/browse/HBASE-27364 Project: HBase Issue Type: Improvement Components: metrics, read replicas Reporter: Hernan Gelaf-Romer Region replication doesn't emit any sink metrics at the moment, these would be useful in determining replication lag. Adding metrics such as ageOfLastAppliedOp would be helpful. -- This message was sent by Atlassian Jira (v8.20.10#820010)