[
https://issues.apache.org/jira/browse/PHOENIX-212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gabriel Reid resolved PHOENIX-212.
----------------------------------
Resolution: Fixed
Bulk resolve of closed issues imported from GitHub. This status was reached by
first re-opening all closed imported issues and then resolving them in bulk.
> Use fuzzy row filter to optimize LIKE queries
> ---------------------------------------------
>
> Key: PHOENIX-212
> URL: https://issues.apache.org/jira/browse/PHOENIX-212
> Project: Phoenix
> Issue Type: Task
> Reporter: James Taylor
>
> Use the technique outlined by Alex Baranau in his
> [blog](http://blog.sematext.com/2012/08/09/consider-using-fuzzyrowfilter-when-in-need-for-secondary-indexes-in-hbase/)
> to optimize queries like this:
> SELECT * FROM web_stats WHERE domain LIKE 'foo%' AND date >= :1 AND date < :2
> assuming PK is domain+date. In this case, the scan would have a start key of
> 'foo' and a stop key of 'fop' and the filter would
> 1. jump to [domain column value] + [:1]
> 2. include all while [domain column value] is the same and [date column
> value] < :2
> 3. continue doing the above while more rows
> In the case where the cardinality of domains starting with 'foo' are low, and
> the cardinality of the dates is high, this will improve performance a lot.
--
This message was sent by Atlassian JIRA
(v6.2#6252)