[ https://issues.apache.org/jira/browse/SPARK-31067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17056688#comment-17056688 ]
Hyukjin Kwon commented on SPARK-31067: -------------------------------------- [~dirrao], please clarify the steps to reproduce, ideally copy-and-paste-able. > Spark 2.4.* SQL query with partition columns scans entire AVRO data > -------------------------------------------------------------------- > > Key: SPARK-31067 > URL: https://issues.apache.org/jira/browse/SPARK-31067 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.4.0, 2.4.4 > Reporter: Gopal > Priority: Major > > Partition Column: dt > SQL Query: select distinct dt from table1 > Table format: AVRO > It is scanning entire avro data in a table to get the distinct dt values -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org