[
https://issues.apache.org/jira/browse/SPARK-8403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hong Shen updated SPARK-8403:
-
Description:
When partition field and fieldSchema exist in sql predicates, pruner partition
won't effective.
Here is the sql,
{code}
select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_209026 r where
r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)16)
{code}
Table t_dw_qqlive_209026 is partition by imp_date, itimestamp is a
fieldSchema in t_dw_qqlive_209026.
When run on hive, it will only scan data in partition 20150615, but if run on
spark sql, it will scan the whole table t_dw_qqlive_209026.
was:
When partition field and fieldSchema exist in sql predicates, pruner partition
won't effective.
Here is the sql,
{code}
select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_209026 r where
r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)16)
{code}
When run on hive, it will only scan data in partition 20150615, but if run on
spark sql, it will scan the whole table t_dw_qqlive_209026.
Pruner partition won't effective when partition field and fieldSchema exist
in sql predicate
Key: SPARK-8403
URL: https://issues.apache.org/jira/browse/SPARK-8403
Project: Spark
Issue Type: Bug
Components: SQL
Reporter: Hong Shen
When partition field and fieldSchema exist in sql predicates, pruner
partition won't effective.
Here is the sql,
{code}
select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_209026 r
where r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)16)
{code}
Table t_dw_qqlive_209026 is partition by imp_date, itimestamp is a
fieldSchema in t_dw_qqlive_209026.
When run on hive, it will only scan data in partition 20150615, but if run on
spark sql, it will scan the whole table t_dw_qqlive_209026.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org