Eric Newton created ACCUMULO-1802: ------------------------------------- Summary: use case for future configurability of major compactions Key: ACCUMULO-1802 URL: https://issues.apache.org/jira/browse/ACCUMULO-1802 Project: Accumulo Issue Type: Sub-task Components: tserver Reporter: Eric Newton
The default compaction strategy has a tendency to put the oldest data in the largest files. This leads to a lot of work when it is time to age off data. One could imaging a compaction strategy that would split data into separate files based on the timestamp. Additionally, if the min/max timestamps for a file were known, old data could be aged off by deleting whole files. Augment the configurable compaction strategy to support multiple output files, and saving/using extra metadata in each file. -- This message was sent by Atlassian JIRA (v6.1#6144)