[ https://issues.apache.org/jira/browse/HBASE-7763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Elliott Clark updated HBASE-7763: --------------------------------- Attachment: HBASE-7763-trunk-TESTING.patch This test patch produced the results here: https://docs.google.com/spreadsheet/ccc?key=0AqJ3FqeHriCkdGxWRE1pN2tIUTREdzhzZ0VzMGEwT2c&usp=sharing >From what I can see what we have is the worst of all worlds. Just adding sorting gets us more io but it also removes a lot more files. Meaning there could be less re-writing Sorting, taking the smallest, and doing the ratio search from the right to the left seems to give us the best files removed per meg of io. But since it's more conservative on number of files to compact it could lead to more re-writing in the long term. Thoughts? > Compactions not sorting based on size anymore. > ---------------------------------------------- > > Key: HBASE-7763 > URL: https://issues.apache.org/jira/browse/HBASE-7763 > Project: HBase > Issue Type: Bug > Components: Compaction > Affects Versions: 0.96.0, 0.94.4 > Reporter: Elliott Clark > Assignee: Elliott Clark > Priority: Critical > Fix For: 0.96.0, 0.94.6 > > Attachments: HBASE-7763-trunk-TESTING.patch, > HBASE-7763-trunk-TESTING.patch, HBASE-7763-trunk-TESTING.patch > > > Currently compaction selection is not sorting based on size. This causes > selection to choose larger files to re-write than are needed when bulk loads > are involved. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira