[
https://issues.apache.org/jira/browse/HIVE-2340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Phabricator updated HIVE-2340:
------------------------------
Attachment: HIVE-2340.D1209.7.patch
navis updated the revision "HIVE-2340 [jira] optimize orderby followed by a
groupby".
Reviewers: JIRA
1. Does not try RS dedup when child GBY is for grouping set.
2. Prevent converting deduped parent JOIN to MAPJOIN (by
hive.auto.convert.join=true)
For case 2, I don't knww what is better to choose deduped-JOIN or MAPJOIN+RS
With auto_join31.q, deduped-JOIN took 50sec and MAPJOIN+RS took 57sec. But it
would be dependent to situation.
Running test.
REVISION DETAIL
https://reviews.facebook.net/D1209
AFFECTED FILES
ql/src/java/org/apache/hadoop/hive/ql/optimizer/ColumnPrunerProcFactory.java
ql/src/java/org/apache/hadoop/hive/ql/optimizer/ReduceSinkDeDuplication.java
ql/src/java/org/apache/hadoop/hive/ql/optimizer/physical/CommonJoinResolver.java
ql/src/java/org/apache/hadoop/hive/ql/plan/JoinDesc.java
ql/src/test/queries/clientpositive/auto_join16.q
ql/src/test/queries/clientpositive/auto_join22.q
ql/src/test/queries/clientpositive/auto_join24.q
ql/src/test/queries/clientpositive/auto_join26.q
ql/src/test/queries/clientpositive/auto_join30.q
ql/src/test/queries/clientpositive/auto_join31.q
ql/src/test/queries/clientpositive/reduce_deduplicate_extended.q
ql/src/test/results/clientpositive/auto_join0.q.out
ql/src/test/results/clientpositive/auto_join10.q.out
ql/src/test/results/clientpositive/auto_join11.q.out
ql/src/test/results/clientpositive/auto_join12.q.out
ql/src/test/results/clientpositive/auto_join13.q.out
ql/src/test/results/clientpositive/auto_join15.q.out
ql/src/test/results/clientpositive/auto_join16.q.out
ql/src/test/results/clientpositive/auto_join18.q.out
ql/src/test/results/clientpositive/auto_join18_multi_distinct.q.out
ql/src/test/results/clientpositive/auto_join20.q.out
ql/src/test/results/clientpositive/auto_join22.q.out
ql/src/test/results/clientpositive/auto_join24.q.out
ql/src/test/results/clientpositive/auto_join26.q.out
ql/src/test/results/clientpositive/auto_join27.q.out
ql/src/test/results/clientpositive/auto_join30.q.out
ql/src/test/results/clientpositive/auto_join31.q.out
ql/src/test/results/clientpositive/index_bitmap3.q.out
ql/src/test/results/clientpositive/index_bitmap_auto.q.out
ql/src/test/results/clientpositive/join40.q.out
ql/src/test/results/clientpositive/metadataonly1.q.out
ql/src/test/results/clientpositive/ppd_gby_join.q.out
ql/src/test/results/clientpositive/ql_rewrite_gbtoidx.q.out
ql/src/test/results/clientpositive/reduce_deduplicate_extended.q.out
ql/src/test/results/clientpositive/smb_mapjoin_14.q.out
ql/src/test/results/clientpositive/udtf_json_tuple.q.out
ql/src/test/results/clientpositive/union24.q.out
ql/src/test/results/compiler/plan/join2.q.xml
To: JIRA, navis
> optimize orderby followed by a groupby
> --------------------------------------
>
> Key: HIVE-2340
> URL: https://issues.apache.org/jira/browse/HIVE-2340
> Project: Hive
> Issue Type: Sub-task
> Components: Query Processor
> Reporter: Navis
> Assignee: Navis
> Priority: Minor
> Labels: perfomance
> Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.1.patch,
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.2.patch,
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.3.patch,
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.4.patch,
> ASF.LICENSE.NOT.GRANTED--HIVE-2340.D1209.5.patch, HIVE-2340.1.patch.txt,
> HIVE-2340.D1209.6.patch, HIVE-2340.D1209.7.patch
>
>
> Before implementing optimizer for JOIN-GBY, try to implement RS-GBY
> optimizer(cluster-by following group-by).
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira