[
https://issues.apache.org/jira/browse/HBASE-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13575100#comment-13575100
]
Kannan Muthukkaruppan commented on HBASE-7763:
----------------------------------------------
Lars: Sorry to pitch in late, but looks like the points got discussed
subsequently.
SequenceID based sorting, and selection of contiguous sub-range of files is
important during compactions. This is because, for duplicate entries (same Key
(RowKey+ColKey+TS)) in multiple files, sequence id is used as the tie breaker.
Looks like proposed fix now is limited to bulk loaded files (which have
sequence id of zero). That seems to be ok.
> Compactions not sorting based on size anymore.
> ----------------------------------------------
>
> Key: HBASE-7763
> URL: https://issues.apache.org/jira/browse/HBASE-7763
> Project: HBase
> Issue Type: Bug
> Components: Compaction
> Affects Versions: 0.96.0, 0.94.4
> Reporter: Elliott Clark
> Assignee: Elliott Clark
> Priority: Critical
> Fix For: 0.96.0, 0.94.6
>
> Attachments: HBASE-7763-trunk-1.patch, HBASE-7763-trunk-2.patch,
> HBASE-7763-trunk-3.patch, HBASE-7763-trunk-TESTING.patch,
> HBASE-7763-trunk-TESTING.patch, HBASE-7763-trunk-TESTING.patch
>
>
> Currently compaction selection is not sorting based on size. This causes
> selection to choose larger files to re-write than are needed when bulk loads
> are involved.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira