[ https://issues.apache.org/jira/browse/SPARK-33760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17584882#comment-17584882 ]
Willi Raschkowski commented on SPARK-33760: ------------------------------------------- Is this related to SPARK-35779? > Extend Dynamic Partition Pruning Support to DataSources > ------------------------------------------------------- > > Key: SPARK-33760 > URL: https://issues.apache.org/jira/browse/SPARK-33760 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 3.0.1 > Reporter: Anoop Johnson > Priority: Major > > The implementation of Dynamic Partition Pruning (DPP) in Spark is > [specific|https://github.com/apache/spark/blob/fb2e3af4b5d92398d57e61b766466cc7efd9d7cb/sql/core/src/main/scala/org/apache/spark/sql/execution/dynamicpruning/PartitionPruning.scala#L59-L64] > to HadoopFSRelation. As a result, DPP is not triggered for queries that use > data sources. > The DataSource v2 readers can expose the partition metadata. Can we use this > metadata and extend DPP to work on data sources as well? > Would appreciate thoughts or corner cases we need to handle. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org