quick question: how to serialize keys in the nested objects? Do we need to
concatenate its parent key like the json path?

On Tue, May 13, 2025 at 3:19 PM wish maple <[email protected]> wrote:

> Just to make sure if it's ok or this should be forbidden. Since it
> affect how reader/writer handles this
>
> Best,
> Xuwei Fu
>
> Aihua Xu <[email protected]> 于2025年5月13日周二 14:32写道:
>
> > It should be just single ‘a’ to reduce the storage by reusing the same
> > key. Any reason that we want to keep both ‘a’ there?
> >
> >
> >
> > > On May 12, 2025, at 7:43 PM, wish maple <[email protected]>
> wrote:
> > >
> > > Thanks! So, in the nested object scenario, would the metadata be
> > > field 0: "a", field 1: "a" or just field 0: "a"
> > > do the both way is ok for reader/writer, or we need limit the
> > > metadata implementation?
> > >
> > > Best,
> > > Xuwei Fu
> > >
> > > Ryan Blue <[email protected]> 于2025年5月13日周二 04:05写道:
> > >
> > >> Keys may appear in nested objects, but cannot appear in the same
> > object. So
> > >> the first example, {"a": {"a": 1}} is allowed. The second example,
> > {"a": 1,
> > >> "a": 2} is not allowed.
> > >>
> > >> Ryan
> > >>
> > >>> On Sun, May 11, 2025 at 11:47 PM wish maple <[email protected]>
> > >>> wrote:
> > >>>
> > >>> In the Parquet variant spec, metadata part says that
> > >>>
> > >>>> Object: An unordered collection of string/Variant pairs (i.e.
> > key/value
> > >>> pairs). An object may not contain duplicate keys. [1]
> > >>>
> > >>> Considering a nested json object like {"a": {"a": 1}}, would the
> > metadata
> > >>> like field 0: "a", field 1: "a" or just field 0: "a" , or both of
> them
> > is
> > >>> ok for reader/writer?
> > >>>
> > >>> And besides, would duplicate keys be allowed in the same object? Like
> > >> {"a":
> > >>> 1, "a": 2}?
> > >>>
> > >>> Best, Xuwei Fu
> > >>>
> > >>> [1]
> > >>>
> > https://github.com/apache/parquet-format/blob/master/VariantEncoding.md
> > >>>
> > >>
> >
>

Reply via email to