John Smith created SPARK-52160: ---------------------------------- Summary: Dynamic pruning not being used with broadcast hash join Key: SPARK-52160 URL: https://issues.apache.org/jira/browse/SPARK-52160 Project: Spark Issue Type: Improvement Components: Spark Core Affects Versions: 3.5.5 Reporter: John Smith
We have a hive partitioned dataset, partitioned on year, month, day. When doing a broadcast hash join with that table, with the partition keys as the join keys, dynamic partition pruning is not being used. It's reading all partitions. The query would be a lot faster with DPP. When filtering on year, month, and day (no join), Spark does read only the partitions that match the filter. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org