[ https://issues.apache.org/jira/browse/SPARK-16847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen updated SPARK-16847: ------------------------------ Assignee: Hyukjin Kwon > Prevent to potentially read corrupt statstics on binary in Parquet via > VectorizedReader > --------------------------------------------------------------------------------------- > > Key: SPARK-16847 > URL: https://issues.apache.org/jira/browse/SPARK-16847 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 2.0.0 > Reporter: Hyukjin Kwon > Assignee: Hyukjin Kwon > Priority: Minor > Fix For: 2.1.0 > > > It is still possible to read corrupt Parquet's statistics. > This problem was found in PARQUET-251 and we disabled filter pushdown on > binary columns in Spark before. > We enabled this after upgrading Parquet but it seems there are potential > incompatibility for Parquet files written in lower Spark versions. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org