I think you are looking for RowGroup Metadata defined at [1] and accessible through the file reader [2] [1] https://github.com/apache/arrow/blob/main/cpp/src/parquet/metadata.h#L198 [2] https://github.com/apache/arrow/blob/main/cpp/src/parquet/file_reader.h#L55
On Fri, Sep 15, 2023 at 1:26 PM Bipin Mathew <[email protected]> wrote: > Good Afternoon Everyone, > > I am looking for a C++ analog to the python SDK's > pyarrow.parquet.RowGroupMetaData > function ( > https://arrow.apache.org/docs/python/generated/pyarrow.parquet.RowGroupMetaData.html > ). Does one exist? At a high level I am trying to obtain the layout of a > parquet file without decoding the actual data content. Specifically, I need > to obtain the number of row groups ( which I know how to get ) and the > number of rows in *each* row group. What is the best way to do this? > > Regards, > > Bipin > >
