Prasanth J created HIVE-7589:
--------------------------------
Summary: Some fixes and improvements to statistics annotation rules
Key: HIVE-7589
URL: https://issues.apache.org/jira/browse/HIVE-7589
Project: Hive
Issue Type: Sub-task
Affects Versions: 0.14.0
Reporter: Prasanth J
Assignee: Prasanth J
*FIXES:*
1) JOIN rule does not properly propagate the column statistics from its parent
2) Multi-way join rule computes the denominator for #rows estimation wrongly
3) GROUPBY rule does not account for the data size of aggregate column
4) Prefix removal from column names wasn't working
5) GROUPBY rule looks at missing column statistics for aggregate column from
its parent and assumes PARTIAL column stats state
*IMPROVEMENTS:*
1) Replaced "EXPLAIN EXTENDED" with "EXPLAIN" in test cases to make the golden
files easy to comprehend and to reduce verbosity
2) Introduced rule for ReduceSink operator which only does renaming of column
statistics as per output row schema
3) Added more rows to the test datasets to avoid 0 row scenario in join test
cases
4) JOIN rule improvement to avoid long overflow
--
This message was sent by Atlassian JIRA
(v6.2#6252)