Hello Vinoo/Team,
As per pyarrow Team , They  don't see any concern , please check below.
Please let us know *where it says Parquet V2 is not official *

"> *As per Apache Parquet Community Parquet V2 is not final yet so it is not
> official . They are advising not to use Parquet V2 for writing (though
code
> is available ) .*

This would be news to me.  Parquet releases are listed (by the parquet
community) at [1]

The vote to release parquet 2.10 is here: [2]


*Neither of these links mention anything about this being an
experimental,unofficial, or non-finalized release.*

I understand your concern.  I believe your quotes are coming from your
discussion on the parquet mailing list here [3].  This communication is
unfortunate and confusing to me as well.

[1] https://parquet.apache.org/blog/
[2] https://lists.apache.org/thread/fdf1zz0f3xzz5zpvo6c811xjswhm1zy6
[3] https://lists.apache.org/thread/4nzroc68czwxnp0ndqz15kp1vhcd7vg3";


On Mon, Apr 22, 2024 at 4:56 PM Prem Sahoo <prem.re...@gmail.com> wrote:

> Hello Vinoo/Team,.
> I was going through pyarrow and they have started using V2 as default .
> isn't it they should avoid it as it is not official.
>
>
> https://arrow.apache.org/docs/python/generated/pyarrow.parquet.write_table.html#pyarrow.parquet.write_table
>
> version{“1.0”, “2.4”, “2.6”}, default “2.6”
>
> Determine which Parquet logical types are available for use, whether the
> reduced set from the Parquet 1.x.x format or the expanded logical types
> added in later format versions. Files written with version=’2.4’ or ‘2.6’
> may not be readable in all Parquet implementations, so version=’1.0’ is
> likely the choice that maximizes file compatibility. UINT32 and some
> logical types are only available with version ‘2.4’. Nanosecond timestamps
> are only available with version ‘2.6’. Other features such as compression
> algorithms or the new serialized data page format must be enabled
> separately (see ‘compression’ and ‘data_page_version’).
>

Reply via email to