[ https://issues.apache.org/jira/browse/PARQUET-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17646612#comment-17646612 ]
Antoine Pitrou commented on PARQUET-1629: ----------------------------------------- [~mwish] for the record. Perhaps you would be interested in doing this, if you can do some Java. > Page-level CRC checksum verification for DataPageV2 > --------------------------------------------------- > > Key: PARQUET-1629 > URL: https://issues.apache.org/jira/browse/PARQUET-1629 > Project: Parquet > Issue Type: Improvement > Components: parquet-mr > Reporter: Boudewijn Braams > Priority: Major > > In https://jira.apache.org/jira/browse/PARQUET-1580 (Github PR: > https://github.com/apache/parquet-mr/pull/647) we implemented page level CRC > checksum verification for DataPageV1. As a follow up, we should add support > for DataPageV2 that follows the spec (see see > https://jira.apache.org/jira/browse/PARQUET-1539). > What needs to be done: > * Add writing out checksums for DataPageV2 > * Add checksum verification for DataPageV2 > * Create new test suite > * Create new benchmarks -- This message was sent by Atlassian Jira (v8.20.10#820010)