nhufas created SPARK-26769: ------------------------------ Summary: partition prunning in inner join Key: SPARK-26769 URL: https://issues.apache.org/jira/browse/SPARK-26769 Project: Spark Issue Type: Improvement Components: Spark Core Affects Versions: 2.4.0 Reporter: nhufas
When joining a partitioned parquet table with another table by partition column it should prunne partitions from partitioned table based on another table values. example: tableA parquet table partitioned be part_filter tableB table with column with partition values tableA is partitioned by part_A,part_B,part_C,part_D tableB is a single column with 2 rows having part_A and part_B as values. doing select * from tableA inner join tableB on tableA.part_filter=tableB.part_filter should generate a partition prunning on tableA based on tableB values (in this case scanning only 2 partitions) but it wll read all 4 partitions from tableA only filter the results. note: this kind of approach works on Hive (filtering tableA partitions) -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org