Totally +1. This puts a much needed clarity on what "v2" means. On Fri, Dec 5, 2025 at 11:55 PM Micah Kornfield <[email protected]> wrote:
> There still appears to be a recurring question for what exactly constitutes > Parquet 2.0. > > Given current implementation statuses, my suggestion is to not mention 2.0 > in general. I've made a proposed changes > <https://github.com/apache/parquet-format/pull/535> [1] to this effect in > a > parquet-format repo to try to give guidance that: > > 1. All encodings documented can now be used regardless of page type. > 2. DataPageHeaderV2 is now widely supported by readers > 3. Versions should be populated with "1", but readers should accept "1" > and "2". > > Thoughts? Does this seem like a reasonable path forward? > > Thanks, > Micah > > > [1] https://github.com/apache/parquet-format/pull/535 >
