[ 
https://issues.apache.org/jira/browse/HBASE-28463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HBASE-28463 started by Janardhan Hungund.
-------------------------------------------------
> Time Based Priority for BucketCache
> -----------------------------------
>
>                 Key: HBASE-28463
>                 URL: https://issues.apache.org/jira/browse/HBASE-28463
>             Project: HBase
>          Issue Type: New Feature
>          Components: BucketCache
>            Reporter: Janardhan Hungund
>            Assignee: Janardhan Hungund
>            Priority: Major
>
> This Jira introduces the feature of time-based data tiering in HBase to 
> optimize storage efficiency and access performance by segregating data based 
> on its recency. By keeping recent data in the bucket cache (backed by faster 
> storage types like SSDs) and evicting older data, the system aims to provide 
> a more flexible control over the cache allocation and eviction logic via 
> configuration, allowing for defining time priorities for cached data. 
> The need for a more extensive cache allocation mechanism becomes even more 
> critical on HBase deployments where cache access reflects on significant 
> performance gains, such as when using cloud storage as the underlying file 
> system.
> The data is segregated into hot or cold categories based on its age. The 
> recent data within a specific time range (configured as hot-data-age) is 
> treated as hot and is stored in the cache, while the older data is stored and 
> accessed from the file system.
> This feature intends to provide the TCO gains by optimizing the utilization 
> of high cost bucket cache. Perfect fit for the use cases that have the 
> date-based data writes while the scans focus on the recently written data.
> Please find the detailed design document of the feature attached with the 
> Jira.
> Thanks,
> Janardhan



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to