[ https://issues.apache.org/jira/browse/HBASE-28463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on HBASE-28463 started by Janardhan Hungund. ------------------------------------------------- > Time Based Priority for BucketCache > ----------------------------------- > > Key: HBASE-28463 > URL: https://issues.apache.org/jira/browse/HBASE-28463 > Project: HBase > Issue Type: New Feature > Components: BucketCache > Reporter: Janardhan Hungund > Assignee: Janardhan Hungund > Priority: Major > > This Jira introduces the feature of time-based data tiering in HBase to > optimize storage efficiency and access performance by segregating data based > on its recency. By keeping recent data in the bucket cache (backed by faster > storage types like SSDs) and evicting older data, the system aims to provide > a more flexible control over the cache allocation and eviction logic via > configuration, allowing for defining time priorities for cached data. > The need for a more extensive cache allocation mechanism becomes even more > critical on HBase deployments where cache access reflects on significant > performance gains, such as when using cloud storage as the underlying file > system. > The data is segregated into hot or cold categories based on its age. The > recent data within a specific time range (configured as hot-data-age) is > treated as hot and is stored in the cache, while the older data is stored and > accessed from the file system. > This feature intends to provide the TCO gains by optimizing the utilization > of high cost bucket cache. Perfect fit for the use cases that have the > date-based data writes while the scans focus on the recently written data. > Please find the detailed design document of the feature attached with the > Jira. > Thanks, > Janardhan -- This message was sent by Atlassian Jira (v8.20.10#820010)