[ https://issues.apache.org/jira/browse/SPARK-8403?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hong Shen updated SPARK-8403: ----------------------------- Description: When udf exit in sql predicates, pruner partition won't effective. Here is the sql, {code} select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_2090000026 r where r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)>16) {code} When run on hive, it will only scan data in partition 20150615, but if run on spark sql, it will scan the whole table t_dw_qqlive_2090000026. was: When udf exit in sql predicates, pruner partition won't effective. Here is the sql, {code} select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_2090000026 r where r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)>16) {code} When run on hive, it will only scan data in partition 20150615, but if run on spark sql, it will scan the whole table from t_dw_qqlive_2090000026. > Pruner partition won't effective when udf exit in sql predicates > ---------------------------------------------------------------- > > Key: SPARK-8403 > URL: https://issues.apache.org/jira/browse/SPARK-8403 > Project: Spark > Issue Type: Bug > Components: SQL > Reporter: Hong Shen > > When udf exit in sql predicates, pruner partition won't effective. > Here is the sql, > {code} > select r.uin,r.vid,r.ctype,r.bakstr2,r.cmd from t_dw_qqlive_2090000026 r > where r.cmd = 2 and (r.imp_date = 20150615 or and hour(r.itimestamp)>16) > {code} > When run on hive, it will only scan data in partition 20150615, but if run on > spark sql, it will scan the whole table t_dw_qqlive_2090000026. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org