Vineet Garg created HIVE-15933:
----------------------------------
Summary: Improve plans for correlated subquery with join and
predicate
Key: HIVE-15933
URL: https://issues.apache.org/jira/browse/HIVE-15933
Project: Hive
Issue Type: Sub-task
Components: Query Planning
Reporter: Vineet Garg
Assignee: Vineet Garg
This is a continuation of HIVE-15905
for queries such as:
{code:SQL}
explain select
cd_gender,
cd_marital_status,
cd_education_status,
count(*) cnt1,
cd_purchase_estimate,
count(*) cnt2,
cd_credit_rating,
count(*) cnt3,
cd_dep_count,
count(*) cnt4,
cd_dep_employed_count,
count(*) cnt5,
cd_dep_college_count,
count(*) cnt6
from
customer c,customer_address ca,customer_demographics
where
c.c_current_addr_sk = ca.ca_address_sk and
ca_county in ('Walker County','Richland County','Gaines County','Douglas
County','Dona Ana County') and
cd_demo_sk = c.c_current_cdemo_sk and
exists (select *
from store_sales,date_dim
where c.c_customer_sk = ss_customer_sk and
ss_sold_date_sk = d_date_sk and
d_year = 2002 and
d_moy between 4 and 4+3)
group by cd_gender,
cd_marital_status,
cd_education_status,
cd_purchase_estimate,
cd_credit_rating,
cd_dep_count,
cd_dep_employed_count,
cd_dep_college_count
order by cd_gender,
cd_marital_status,
cd_education_status,
cd_purchase_estimate,
cd_credit_rating,
cd_dep_count,
cd_dep_employed_count,
cd_dep_college_count
limit 100;
{code}
HIVE generates un-necessary joins to produce value for correlated columns.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)