sunchao commented on a change in pull request #32753:
URL: https://github.com/apache/spark/pull/32753#discussion_r657284056



##########
File path: 
sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/VectorizedColumnReader.java
##########
@@ -95,6 +80,18 @@
    */
   private final ParquetVectorUpdaterFactory updaterFactory;
 
+  /**
+   * Helper struct to track intermediate states while reading Parquet pages in 
the column chunk.
+   */
+  private final ParquetReadState readState;
+
+  /**
+   * The index for the first row in the current page, among all rows across 
all pages in the
+   * column chunk for this reader. The value for this is 0 if there is no 
column index for the

Review comment:
       Yes it can happen. Perhaps I should rephrase it to: "If there is no 
column index, the value for this is 0".




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to