hadrian-reppas commented on issue #46094: URL: https://github.com/apache/arrow/issues/46094#issuecomment-2992055287
Since the encoding only supports literal runs with lengths that are a multiple of 8, the `RleEncoder` sometimes has to pad the last literal run with zeros. This means that the number of encoded values can't be determined from just looking at the run-length encoded data since the encoding does not differentiate between real values and padding values. So calling `RleDecoder::Get` until it returns false might result in extra zeros from the padding. The solution is to separately keep track of the number of values and only call `Get` that many times. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
