[ https://issues.apache.org/jira/browse/SPARK-26345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17308000#comment-17308000 ]
Xinli Shang commented on SPARK-26345: ------------------------------------- Yes, it needs some synchronization. I have the modified version implementation in Presto. You can check it [here|https://github.com/shangxinli/presto/commit/f6327a161eb6cfd5137f679620e095d8257816b8#diff-bb24b92e28343804ebaf540efe6c1cda0b5e2524e6811f8fe2daee5944dad386R203]. > Parquet support Column indexes > ------------------------------ > > Key: SPARK-26345 > URL: https://issues.apache.org/jira/browse/SPARK-26345 > Project: Spark > Issue Type: Umbrella > Components: SQL > Affects Versions: 3.1.0 > Reporter: Yuming Wang > Assignee: Yuming Wang > Priority: Major > Fix For: 3.2.0 > > > Parquet 1.11 supports column indexing. Spark can supports this feature for > better read performance. > More details: > https://issues.apache.org/jira/browse/PARQUET-1201 > > Benchmark result: > [https://github.com/apache/spark/pull/31393#issuecomment-769767724] > This feature is enabled by default, and users can disable it by setting > {{parquet.filter.columnindex.enabled}} to false. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org