[C++] ways to read parquet

Алексей Рябов Sun, 02 Apr 2023 22:30:01 -0700

Hello Team.

As far as i could find in documentation/samples, there are 2 ways for
reading parquet files:
- using FileReader from parquet::arrow namespace.
- using low-level ParquetFileReader from parquet namespace.


1st one reads to arrow tables, transforming parquet data to arrow
types according to logical data in parquet schema. 2nd one reads raw
parquet bytes w/o any transform.
When reading to arrow tabes I can use threads to speedup process, but
get only arrow types, not raw bytes. The question is if there is a way
to read parquet file using threads but w/o converting to arrow types,
thus, getting an arrow table where each raw bytes are not converted to
arrow types, such as Utf8, Decimal128 and so on (except primitives)?

Please advise.

[C++] ways to read parquet

Reply via email to