[ 
https://issues.apache.org/jira/browse/PHOENIX-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939034#comment-13939034
 ] 

Hudson commented on PHOENIX-846:
--------------------------------

SUCCESS: Integrated in Apache Phoenix - Branch:master #186 (See 
[https://builds.apache.org/job/Phoenix/186/])
PHOENIX-846 Select DISTINCT with LIMIT does full scans (JamesTaylor) 
(jamestaylor: rev f268b45ba02228392a7ea8b7e3703c5d41e51cea)
* phoenix-core/src/main/java/org/apache/phoenix/query/QueryServices.java
* 
phoenix-core/src/main/java/org/apache/phoenix/cache/aggcache/SpillableGroupByCache.java
* phoenix-core/src/main/java/org/apache/phoenix/compile/GroupByCompiler.java
* phoenix-core/src/main/java/org/apache/phoenix/join/HashCacheFactory.java
* phoenix-core/src/main/java/org/apache/phoenix/iterate/ExplainTable.java
* phoenix-core/src/main/java/org/apache/phoenix/cache/aggcache/SpillManager.java
* phoenix-core/src/it/java/org/apache/phoenix/end2end/QueryIT.java
* phoenix-core/src/test/java/org/apache/phoenix/compile/QueryCompilerTest.java
* phoenix-core/src/main/java/org/apache/phoenix/execute/AggregatePlan.java
* phoenix-core/src/main/java/org/apache/phoenix/cache/GlobalCache.java
* 
phoenix-core/src/main/java/org/apache/phoenix/coprocessor/BaseScannerRegionObserver.java
* 
phoenix-core/src/main/java/org/apache/phoenix/coprocessor/GroupedAggregateRegionObserver.java
* phoenix-core/src/it/java/org/apache/phoenix/end2end/UpsertValuesIT.java
* phoenix-core/src/main/java/org/apache/phoenix/util/SizedUtil.java
* phoenix-core/src/main/java/org/apache/phoenix/coprocessor/GroupByCache.java
* phoenix-core/src/main/java/org/apache/phoenix/schema/PColumnFamilyImpl.java
* phoenix-core/src/it/java/org/apache/phoenix/end2end/SpillableGroupByIT.java
* phoenix-core/src/main/java/org/apache/phoenix/compile/AggregationManager.java


> Select DISTINCT with LIMIT does full scans
> ------------------------------------------
>
>                 Key: PHOENIX-846
>                 URL: https://issues.apache.org/jira/browse/PHOENIX-846
>             Project: Phoenix
>          Issue Type: Bug
>    Affects Versions: 3.0.0, 4.0.0
>            Reporter: alex kamil
>            Assignee: James Taylor
>            Priority: Critical
>             Fix For: 3.0.0, 4.0.0, 5.0.0
>
>         Attachments: PHOENIX-846.patch
>
>
> When running SELECT DISTINCT with LIMIT it does full scan and aggregation (no 
> pageFilter/limit used on server side), 
> this severely affects performance  (query returns in 20sec vs 300ms without 
> DISTINCT)
> : jdbc:phoenix:localhost> explain select DISTINCT ROWKEY from TEST_1M LIMIT 
> 100;
> +------------+
> |    PLAN    |
> +------------+
> | CLIENT PARALLEL 30-WAY FULL SCAN OVER TEST_1M |
> |     SERVER FILTER BY FIRST KEY ONLY |
> |     SERVER AGGREGATE INTO ORDERED DISTINCT ROWS BY [ROWKEY] |
> | CLIENT MERGE SORT |
> | CLIENT 100 ROW LIMIT |
> +------------+
> -------------------------------------------------
> for comparison SELECT without  DISTINCT uses a limit PageFilter=100 on server 
> side and doesn't do full scan (query returns in 300ms)
> explain select ROWKEY from TEST_1M LIMIT 100;
> +------------+
> |    PLAN    |
> +------------+
> | CLIENT PARALLEL 30-WAY FULL SCAN OVER TEST_1M |
> |     SERVER FILTER BY FIRST KEY ONLY AND PageFilter 100 |
> | CLIENT MERGE SORT |
> | CLIENT 100 ROW LIMIT |
> +------------+



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to