[ https://issues.apache.org/jira/browse/PARQUET-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17703893#comment-17703893 ]
ASF GitHub Bot commented on PARQUET-1629: ----------------------------------------- wgtmac opened a new pull request, #1044: URL: https://github.com/apache/parquet-mr/pull/1044 ### Jira https://issues.apache.org/jira/browse/PARQUET-1629 ### Tests TODO ### Commits - Writer computes crc and writes it into data page v2 header if enabled. - Reader verifies crc from data page v2 header if enabled. > Page-level CRC checksum verification for DataPageV2 > --------------------------------------------------- > > Key: PARQUET-1629 > URL: https://issues.apache.org/jira/browse/PARQUET-1629 > Project: Parquet > Issue Type: Improvement > Components: parquet-mr > Reporter: Boudewijn Braams > Assignee: Gang Wu > Priority: Major > > In https://jira.apache.org/jira/browse/PARQUET-1580 (Github PR: > https://github.com/apache/parquet-mr/pull/647) we implemented page level CRC > checksum verification for DataPageV1. As a follow up, we should add support > for DataPageV2 that follows the spec (see see > https://jira.apache.org/jira/browse/PARQUET-1539). > What needs to be done: > * Add writing out checksums for DataPageV2 > * Add checksum verification for DataPageV2 > * Create new test suite > * Create new benchmarks -- This message was sent by Atlassian Jira (v8.20.10#820010)