[ 
https://issues.apache.org/jira/browse/KUDU-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15231031#comment-15231031
 ] 

Todd Lipcon commented on KUDU-1400:
-----------------------------------

Making the 2 minute threshold configurable seems like an easy change (it's just 
a constant right now).

Merging small DRS (especially those that have sat around for a while) does seem 
like a good idea. It would be interesting to consider this along with some 
other "lower priority" DRS reorganizations/rewrites such as policies that 
switch to denser compression or different storage tiers, even if we dont 
implement those features in the short term.

> Improve rowset compaction policy to consider merging small DRSs
> ---------------------------------------------------------------
>
>                 Key: KUDU-1400
>                 URL: https://issues.apache.org/jira/browse/KUDU-1400
>             Project: Kudu
>          Issue Type: Improvement
>            Reporter: Binglin Chang
>
> We see some small table with light write load generate lot's of small 
> DRS(~1MB), since those DRSes do not overlap much, they don't get the chance 
> to be compacted, generating lot of very small files/blocks. So:
> # Compaction solution value should consider benefits of merging small DRS
> # Every 2 min flushing MRS(small or large) seems suboptimal, maybe flushing 
> small MRS should have "lower priority" than rowset compaction with higher 
> solution value?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to