[ https://issues.apache.org/jira/browse/PARQUET-2080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17421193#comment-17421193 ]
Gidon Gershinsky commented on PARQUET-2080: ------------------------------------------- Hi [~gszadovszky] , I've prepared a short writeup on this alternative solution, with a discussion of the tradeoffs. After writing it, my feeling is that the trade-off is not in favor of this alternative option; but [here it goes|https://docs.google.com/document/d/1zr6-4em8C8DGi-D3jGosQe2gvJKluat-8uUbS0y7F-0/edit?usp=sharing], just to cover all bases. Will appreciate your opinion on this. > Deprecate RowGroup.file_offset > ------------------------------ > > Key: PARQUET-2080 > URL: https://issues.apache.org/jira/browse/PARQUET-2080 > Project: Parquet > Issue Type: Bug > Components: parquet-format > Reporter: Gabor Szadovszky > Assignee: Gidon Gershinsky > Priority: Major > > Due to PARQUET-2078 RowGroup.file_offset is not reliable. > This field is also wrongly calculated in the C++ oss parquet implementation > PARQUET-2089 -- This message was sent by Atlassian Jira (v8.3.4#803005)