sunchao commented on a change in pull request #32753: URL: https://github.com/apache/spark/pull/32753#discussion_r657284056
########## File path: sql/core/src/main/java/org/apache/spark/sql/execution/datasources/parquet/VectorizedColumnReader.java ########## @@ -95,6 +80,18 @@ */ private final ParquetVectorUpdaterFactory updaterFactory; + /** + * Helper struct to track intermediate states while reading Parquet pages in the column chunk. + */ + private final ParquetReadState readState; + + /** + * The index for the first row in the current page, among all rows across all pages in the + * column chunk for this reader. The value for this is 0 if there is no column index for the Review comment: Yes it can happen. Perhaps I should rephrase it to: "If there is no column index, the value for this is 0". -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org