[ 
https://issues.apache.org/jira/browse/HBASE-13122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14343269#comment-14343269
 ] 

Shuaifeng Zhou commented on HBASE-13122:
----------------------------------------

We have done a performance test, here are the result:
FamilyFilter:
test table have two familys , each have 3 qualifier, and put 10000 rows into 
the table, each row&qualifier have 1000 versions.
scan use familyFilter get values from the second family, scaned 2000 rows and 
100 versions of each row& qualifier.
Use the oraginal FamilyFilter, cost average 309 seconds, but with the improved 
familyFilter, the cost is average 38 seconds, improved about 700%
ColumnRangeFilter:
The same data but only one family, scan 10000 rows and 1000 versions, orangial 
cost average 68s, the improved cost 64s, improved a little. 
Because in the FamilyFilter, the improve will reduce read files, so improved 
significantly, but the columnRangeFilter can not reduce read files, so imporve 
little.

> Improve efficiency for return codes of some filters
> ---------------------------------------------------
>
>                 Key: HBASE-13122
>                 URL: https://issues.apache.org/jira/browse/HBASE-13122
>             Project: HBase
>          Issue Type: Improvement
>          Components: Filters
>    Affects Versions: 0.94.24, 1.0.1, 0.98.10.1
>            Reporter: Shuaifeng Zhou
>         Attachments: 13122-master.patch, 13122.patch
>
>
> ColumnRangeFilter:
>  when minColumnInclusive is false, it means all the cells at the current 
> row&column not fit the condition, so it should skip to next column, return 
> code should be NEXT_COL, not SKIP.
> FamilyFilter is the similar sitution.
> Currently, SKIP will not causing error, but not efficent.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to