[ 
https://issues.apache.org/jira/browse/HBASE-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13575100#comment-13575100
 ] 

Kannan Muthukkaruppan commented on HBASE-7763:
----------------------------------------------

Lars: Sorry to pitch in late, but looks like the points got discussed 
subsequently.

SequenceID based sorting, and selection of contiguous sub-range of files is 
important during compactions. This is because, for duplicate entries (same Key 
(RowKey+ColKey+TS)) in multiple files, sequence id is used as the tie breaker. 
Looks like proposed fix now is limited to bulk loaded files (which have 
sequence id of zero). That seems to be ok.
                
> Compactions not sorting based on size anymore.
> ----------------------------------------------
>
>                 Key: HBASE-7763
>                 URL: https://issues.apache.org/jira/browse/HBASE-7763
>             Project: HBase
>          Issue Type: Bug
>          Components: Compaction
>    Affects Versions: 0.96.0, 0.94.4
>            Reporter: Elliott Clark
>            Assignee: Elliott Clark
>            Priority: Critical
>             Fix For: 0.96.0, 0.94.6
>
>         Attachments: HBASE-7763-trunk-1.patch, HBASE-7763-trunk-2.patch, 
> HBASE-7763-trunk-3.patch, HBASE-7763-trunk-TESTING.patch, 
> HBASE-7763-trunk-TESTING.patch, HBASE-7763-trunk-TESTING.patch
>
>
> Currently compaction selection is not sorting based on size.  This causes 
> selection to choose larger files to re-write than are needed when bulk loads 
> are involved.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to