mapleFU commented on PR #38272:
URL: https://github.com/apache/arrow/pull/38272#issuecomment-1768403422
I found arrow-cpp we can read the file because we don't check the decompress
size:
```c++
// Decompress the values
PARQUET_ASSIGN_OR_THROW(auto decompress_len, decompressor_->Decompress(
compressed_len - levels_byte_len, page_buffer->data() +
levels_byte_len,
uncompressed_len - levels_byte_len,
decompression_buffer_->mutable_data() + levels_byte_len));
if (decompress_len != uncompressed_len - levels_byte_len) {
throw ParquetException("Expected " + std::to_string(uncompressed_len -
levels_byte_len) +
" bytes but decompressed " +
std::to_string(decompress_len));
}
```
@wgtmac @pitrou Do you think we need to add the check above?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]