mapleFU commented on PR #38272:
URL: https://github.com/apache/arrow/pull/38272#issuecomment-1768403422

   I found arrow-cpp we can read the file because we don't check the decompress 
size:
   
   ```c++
     // Decompress the values
     PARQUET_ASSIGN_OR_THROW(auto decompress_len, decompressor_->Decompress(
         compressed_len - levels_byte_len, page_buffer->data() + 
levels_byte_len,
         uncompressed_len - levels_byte_len,
         decompression_buffer_->mutable_data() + levels_byte_len));
     if (decompress_len != uncompressed_len - levels_byte_len) {
       throw ParquetException("Expected " + std::to_string(uncompressed_len - 
levels_byte_len) +
                              " bytes but decompressed " + 
std::to_string(decompress_len));
     }
   ```
   
   @wgtmac @pitrou Do you think we need to add the check above?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to