ArnavBalyan opened a new pull request, #3326:
URL: https://github.com/apache/parquet-java/pull/3326

   ### Rationale for this change
    - Currently `ColumnChunkPageReadStore` sets decompressed bytes when 
creating datapagev1.
    - However DataPageV1 assumes the compressed bytes to be extracted from the 
bytes which are set by the PageReadStore.
    - This causes wrong byte value to be set on the datapagev1. Downstream gets 
actually uncompressed bytes instead of compressed bytes upon doing 
`page.getCompressedSize()`
    - This also affects CLI where `ShowPagesCommand` actually reports to wrong 
compressedSize.
    - Fix the same by adding a constructor which can accept both 
compressed/uncompressed sizes and intelligently set it for the internal Page.
   
   ### Are these changes tested?
    - Yes
   
   ### Are there any user-facing changes?
    - Yes


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@parquet.apache.org
For additional commands, e-mail: issues-h...@parquet.apache.org

Reply via email to