[ 
https://issues.apache.org/jira/browse/HBASE-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Elliott Clark updated HBASE-7763:
---------------------------------

    Attachment: HBASE-7763-trunk-TESTING.patch

This test patch produced the results here:

https://docs.google.com/spreadsheet/ccc?key=0AqJ3FqeHriCkdGxWRE1pN2tIUTREdzhzZ0VzMGEwT2c&usp=sharing


>From what I can see what we have is the worst of all worlds.

Just adding sorting gets us more io but it also removes a lot more files.  
Meaning there could be less re-writing

Sorting, taking the smallest, and doing the ratio search from the right to the 
left seems to give us the best files removed per meg of io.  But since it's 
more conservative on number of files to compact it could lead to more 
re-writing in the long term.

Thoughts?
                
> Compactions not sorting based on size anymore.
> ----------------------------------------------
>
>                 Key: HBASE-7763
>                 URL: https://issues.apache.org/jira/browse/HBASE-7763
>             Project: HBase
>          Issue Type: Bug
>          Components: Compaction
>    Affects Versions: 0.96.0, 0.94.4
>            Reporter: Elliott Clark
>            Assignee: Elliott Clark
>            Priority: Critical
>             Fix For: 0.96.0, 0.94.6
>
>         Attachments: HBASE-7763-trunk-TESTING.patch, 
> HBASE-7763-trunk-TESTING.patch, HBASE-7763-trunk-TESTING.patch
>
>
> Currently compaction selection is not sorting based on size.  This causes 
> selection to choose larger files to re-write than are needed when bulk loads 
> are involved.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to