wgtmac commented on a change in pull request #1008:
URL: https://github.com/apache/orc/pull/1008#discussion_r783179137



##########
File path: c++/src/Compression.cc
##########
@@ -533,24 +545,37 @@ DIAGNOSTIC_PUSH
   }
 
   /** There are three possible scenarios when seeking a position:
-   * 1. The seeked position is already read and decompressed into
-   *    the output stream.
-   * 2. It is already read from the input stream, but has not been
-   *    decompressed yet, ie. it's not in the output stream.
-   * 3. It is not read yet from the inputstream.
+   * 1. The chunk of the seeked position is already read and decompressed into 
the output
+   *    stream, ie. chunk header is read and chunk contents are in the output 
stream.
+   * 2. The chunk of the seeked position is partially read. This only happens 
for

Review comment:
       The root cause is in line 495 where DecompressionStream::Next only reads 
availableSize but not the full uncompressed chunk. So another fix is to enforce 
read full uncompressed chunk there.
   

##########
File path: c++/src/Compression.cc
##########
@@ -533,24 +545,37 @@ DIAGNOSTIC_PUSH
   }
 
   /** There are three possible scenarios when seeking a position:

Review comment:
       **four** possible scenarios




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to