Eric Newton created ACCUMULO-1802:
-------------------------------------

             Summary: use case for future configurability of major compactions
                 Key: ACCUMULO-1802
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-1802
             Project: Accumulo
          Issue Type: Sub-task
          Components: tserver
            Reporter: Eric Newton


The default compaction strategy has a tendency to put the oldest data in the 
largest files.  This leads to a lot of work when it is time to age off data.

One could imaging a compaction strategy that would split data into separate 
files based on the timestamp.  Additionally, if the min/max timestamps for a 
file were known, old data could be aged off by deleting whole files.

Augment the configurable compaction strategy to support multiple output files, 
and saving/using extra metadata in each file.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to