[GitHub] [spark] fuwhu opened a new pull request #27232: [SPARK-30525][SQL]HiveTableScanExec do not need to prune partitions again after pushing down to hive metastore

GitBox Sun, 19 Jan 2020 23:25:26 -0800

fuwhu opened a new pull request #27232: [SPARK-30525][SQL]HiveTableScanExec do 
not need to prune partitions again after pushing down to hive metastore
URL: https://github.com/apache/spark/pull/27232
 
 
   ### What changes were proposed in this pull request?
   HiveTableScanExec does not prune partitions again after 
SessionCatalog.listPartitionsByFilter called.
   
   ### Why are the changes needed?
   In HiveTableScanExec, it will push down to hive metastore for partition 
pruning if spark.sql.hive.metastorePartitionPruning is true, and then it will 
prune the returned partitions again using partition filters, because some 
predicates, eg. "b like 'xyz'", are not supported in hive metastore. But now 
this problem is already fixed in HiveExternalCatalog.listPartitionsByFilter, 
the HiveExternalCatalog.listPartitionsByFilter can return exactly what we want 
now. So it is not necessary any more to double prune in HiveTableScanExec.
   
   
   ### Does this PR introduce any user-facing change?
   no
   
   ### How was this patch tested?
   Existing unit tests.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] [spark] fuwhu opened a new pull request #27232: [SPARK-30525][SQL]HiveTableScanExec do not need to prune partitions again after pushing down to hive metastore

Reply via email to