[
https://issues.apache.org/jira/browse/HIVE-2056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13030814#comment-13030814
]
[email protected] commented on HIVE-2056:
-----------------------------------------------------
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/700/#review651
-----------------------------------------------------------
Change hive-default.xml with the new parameter.
Add the new parameter in the name of the jira.
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
<https://reviews.apache.org/r/700/#comment1306>
Add a comment - this optimization is not enabled
if one of the sub-queries does not involve a
aggregation
trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
<https://reviews.apache.org/r/700/#comment1307>
The code is not preforming a prefix match.
I mean,
if the query is:
from T
insert overwrite T1 select ... group by c1
insert overwrite T1 select ... group by c2, c1
c1 will still be returned.
Is that desirable ?
I dont think this will work - can you add a testcase
for this - I mean, with a explain which shows that
the parameter does not make a difference
- namit
On 2011-05-09 13:36:28, Amareshwari Sriramadasu wrote:
bq.
bq. -----------------------------------------------------------
bq. This is an automatically generated e-mail. To reply, visit:
bq. https://reviews.apache.org/r/700/
bq. -----------------------------------------------------------
bq.
bq. (Updated 2011-05-09 13:36:28)
bq.
bq.
bq. Review request for hive.
bq.
bq.
bq. Summary
bq. -------
bq.
bq. Attached patch generates a single M/R job for multi group by query with
non-null common group by key set. Added configuration
hive.multigroupby.singlemr to turn on and off the optimization.
bq.
bq.
bq. This addresses bug HIVE-2056.
bq. https://issues.apache.org/jira/browse/HIVE-2056
bq.
bq.
bq. Diffs
bq. -----
bq.
bq. trunk/common/src/java/org/apache/hadoop/hive/conf/HiveConf.java 1100910
bq. trunk/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java
1100910
bq. trunk/ql/src/test/queries/clientpositive/groupby10.q 1100910
bq. trunk/ql/src/test/queries/clientpositive/groupby8.q 1100910
bq. trunk/ql/src/test/queries/clientpositive/groupby8_noskew.q 1100910
bq. trunk/ql/src/test/queries/clientpositive/groupby9.q 1100910
bq. trunk/ql/src/test/results/clientpositive/groupby10.q.out 1100910
bq. trunk/ql/src/test/results/clientpositive/groupby8.q.out 1100910
bq. trunk/ql/src/test/results/clientpositive/groupby9.q.out 1100910
bq.
bq. Diff: https://reviews.apache.org/r/700/diff
bq.
bq.
bq. Testing
bq. -------
bq.
bq. Updated jira with performance tests.
bq.
bq.
bq. Thanks,
bq.
bq. Amareshwari
bq.
bq.
> Generate single MR job for multi groupby query.
> -----------------------------------------------
>
> Key: HIVE-2056
> URL: https://issues.apache.org/jira/browse/HIVE-2056
> Project: Hive
> Issue Type: Improvement
> Reporter: Amareshwari Sriramadasu
> Assignee: Amareshwari Sriramadasu
> Fix For: 0.8.0
>
> Attachments: patch-2056.txt
>
>
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira