[C+]About Parquet writer, encoding

Xinyu Zeng Wed, 23 Mar 2022 05:40:18 -0700

Why is DELTA_BINARY_PACKED not supported for writing Parquet since it
may be useful for some sorted int column?


Also, I was wondering if the original Parquet implementation(seems
like parquet-mr) give each data type a default encoding(or fall back
encoding when dictionary is enabled) other than PLAIN? It seems like
for Arrow, we have to manually set the best encoding for each column.
Would a default non plain encoding be beneficial?

Another question is, as from the encoding section, reading is much
more supported than writing Parquet, does it mean Arrow is more
favored to be used as a read-only query engine rather than writing?
e.g it simply uses Parquet files sourced from other applications.

Thanks

[C+]About Parquet writer, encoding

Reply via email to