[Python][Parquet]pq.ParquetFile.read faster than pq.read_table?

Shawn Zeng Wed, 23 Feb 2022 21:07:59 -0800

Hi all, I found that for the same parquet file,
using pq.ParquetFile(file_name).read() takes 6s while
pq.read_table(file_name) takes 17s. How do those two apis differ? I thought
they use the same internals but it seems not. The parquet file is 865MB,
snappy compression and enable dictionary. All other settings are default,
writing with pyarrow.

[Python][Parquet]pq.ParquetFile.read faster than pq.read_table?

Reply via email to