[RESULT][VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-28 Thread Sutou Kouhei
Resend with "[RESULT]" prefix.

In 
  "Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format" on Mon, 28 Aug 2023 
14:18:53 -0400,
  Benjamin Kietzman  wrote:

> The vote passes with 7 binding +1 votes, 5 non-binding +1 votes, and no
> -1/0 votes.
> 
> Thanks everyone!
> 
> On Thu, Aug 24, 2023 at 10:10 AM Antoine Pitrou  wrote:
> 
>>
>> +1 on the format additions
>>
>> The implementations will probably need a bit more review back-and-forth.
>>
>> Regards
>>
>> Antoine.
>>
>>
>> Le 28/06/2023 à 21:34, Benjamin Kietzman a écrit :
>> > Hello,
>> >
>> > I'd like to propose adding Utf8View arrays to the arrow format.
>> > Previous discussion in [1], columnar format description in [2],
>> > flatbuffers changes in [3].
>> >
>> > There are implementations available in both C++[4] and Go[5] which
>> > exercise the new type over IPC. Utf8View format demonstrates[6]
>> > significant performance benefits over Utf8 in common tasks.
>> >
>> > The vote will be open for at least 72 hours.
>> >
>> > [ ] +1 add the proposed Utf8View type to the Apache Arrow format
>> > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
>> > because...
>> >
>> > Sincerely,
>> > Ben Kietzman
>> >
>> > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
>> > [2]
>> >
>> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
>> > [3]
>> >
>> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
>> > [4] https://github.com/apache/arrow/pull/35628
>> > [5] https://github.com/apache/arrow/pull/35769
>> > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
>> >
>>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-28 Thread Benjamin Kietzman
The vote passes with 7 binding +1 votes, 5 non-binding +1 votes, and no
-1/0 votes.

Thanks everyone!

On Thu, Aug 24, 2023 at 10:10 AM Antoine Pitrou  wrote:

>
> +1 on the format additions
>
> The implementations will probably need a bit more review back-and-forth.
>
> Regards
>
> Antoine.
>
>
> Le 28/06/2023 à 21:34, Benjamin Kietzman a écrit :
> > Hello,
> >
> > I'd like to propose adding Utf8View arrays to the arrow format.
> > Previous discussion in [1], columnar format description in [2],
> > flatbuffers changes in [3].
> >
> > There are implementations available in both C++[4] and Go[5] which
> > exercise the new type over IPC. Utf8View format demonstrates[6]
> > significant performance benefits over Utf8 in common tasks.
> >
> > The vote will be open for at least 72 hours.
> >
> > [ ] +1 add the proposed Utf8View type to the Apache Arrow format
> > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
> > because...
> >
> > Sincerely,
> > Ben Kietzman
> >
> > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > [2]
> >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > [3]
> >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > [4] https://github.com/apache/arrow/pull/35628
> > [5] https://github.com/apache/arrow/pull/35769
> > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-24 Thread Antoine Pitrou



+1 on the format additions

The implementations will probably need a bit more review back-and-forth.

Regards

Antoine.


Le 28/06/2023 à 21:34, Benjamin Kietzman a écrit :

Hello,

I'd like to propose adding Utf8View arrays to the arrow format.
Previous discussion in [1], columnar format description in [2],
flatbuffers changes in [3].

There are implementations available in both C++[4] and Go[5] which
exercise the new type over IPC. Utf8View format demonstrates[6]
significant performance benefits over Utf8 in common tasks.

The vote will be open for at least 72 hours.

[ ] +1 add the proposed Utf8View type to the Apache Arrow format
[ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
because...

Sincerely,
Ben Kietzman

[1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
[2]
https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
[3]
https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
[4] https://github.com/apache/arrow/pull/35628
[5] https://github.com/apache/arrow/pull/35769
[6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617



Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-22 Thread Joris Van den Bossche
+1

On Mon, 21 Aug 2023 at 19:33, Weston Pace  wrote:
>
> +1
>
> Thanks to all for the discussion and thanks to Ben for all of the great
> work.
>
>
> On Mon, Aug 21, 2023 at 9:16 AM wish maple  wrote:
>
> > +1 (non-binding)
> >
> > It would help a lot when processing UTF-8 related data!
> >
> > Xuwei
> >
> > Andrew Lamb  于2023年8月22日周二 00:11写道:
> >
> > > +1
> > >
> > > This is a great example of collaboration
> > >
> > > On Sat, Aug 19, 2023 at 4:10 PM Chao Sun  wrote:
> > >
> > > > +1 (non-binding)!
> > > >
> > > > On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho <
> > > > felipe...@gmail.com> wrote:
> > > >
> > > > > +1 (non-binding)
> > > > >
> > > > > —
> > > > > Felipe
> > > > >
> > > > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens
> > > > >  wrote:
> > > > >
> > > > > > +1 (non-binding)
> > > > > >
> > > > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh 
> > > wrote:
> > > > > >
> > > > > > > +1 (binding)
> > > > > > >
> > > > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
> > > > > > >  wrote:
> > > > > > > >
> > > > > > > > +1
> > > > > > > >
> > > > > > > > Thanks all for the thoughtful discussions here.
> > > > > > > >
> > > > > > > > Neal
> > > > > > > >
> > > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
> > > > > > > >  wrote:
> > > > > > > >
> > > > > > > > > +1 (binding)
> > > > > > > > >
> > > > > > > > > Despite my earlier misgivings, I think this will be a
> > valuable
> > > > > > addition
> > > > > > > > > to the specification.
> > > > > > > > >
> > > > > > > > > To clarify I've interpreted this as a vote on both Utf8View
> > and
> > > > > > > > > BinaryView as in the linked PR.
> > > > > > > > >
> > > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > > > > > > > > Hello,
> > > > > > > > > >
> > > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow
> > > format.
> > > > > > > > > > Previous discussion in [1], columnar format description in
> > > [2],
> > > > > > > > > > flatbuffers changes in [3].
> > > > > > > > > >
> > > > > > > > > > There are implementations available in both C++[4] and
> > Go[5]
> > > > > which
> > > > > > > > > > exercise the new type over IPC. Utf8View format
> > > demonstrates[6]
> > > > > > > > > > significant performance benefits over Utf8 in common tasks.
> > > > > > > > > >
> > > > > > > > > > The vote will be open for at least 72 hours.
> > > > > > > > > >
> > > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow
> > > > format
> > > > > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache
> > > > Arrow
> > > > > > > format
> > > > > > > > > > because...
> > > > > > > > > >
> > > > > > > > > > Sincerely,
> > > > > > > > > > Ben Kietzman
> > > > > > > > > >
> > > > > > > > > > [1]
> > > > > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > > > > > > > > [2]
> > > > > > > > > >
> > > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > > > > > > > > [3]
> > > > > > > > > >
> > > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > > > > > > > > [4] https://github.com/apache/arrow/pull/35628
> > > > > > > > > > [5] https://github.com/apache/arrow/pull/35769
> > > > > > > > > > [6]
> > > > > > >
> > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > > > > > > > > >
> > > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-21 Thread Weston Pace
+1

Thanks to all for the discussion and thanks to Ben for all of the great
work.


On Mon, Aug 21, 2023 at 9:16 AM wish maple  wrote:

> +1 (non-binding)
>
> It would help a lot when processing UTF-8 related data!
>
> Xuwei
>
> Andrew Lamb  于2023年8月22日周二 00:11写道:
>
> > +1
> >
> > This is a great example of collaboration
> >
> > On Sat, Aug 19, 2023 at 4:10 PM Chao Sun  wrote:
> >
> > > +1 (non-binding)!
> > >
> > > On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho <
> > > felipe...@gmail.com> wrote:
> > >
> > > > +1 (non-binding)
> > > >
> > > > —
> > > > Felipe
> > > >
> > > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens
> > > >  wrote:
> > > >
> > > > > +1 (non-binding)
> > > > >
> > > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh 
> > wrote:
> > > > >
> > > > > > +1 (binding)
> > > > > >
> > > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
> > > > > >  wrote:
> > > > > > >
> > > > > > > +1
> > > > > > >
> > > > > > > Thanks all for the thoughtful discussions here.
> > > > > > >
> > > > > > > Neal
> > > > > > >
> > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
> > > > > > >  wrote:
> > > > > > >
> > > > > > > > +1 (binding)
> > > > > > > >
> > > > > > > > Despite my earlier misgivings, I think this will be a
> valuable
> > > > > addition
> > > > > > > > to the specification.
> > > > > > > >
> > > > > > > > To clarify I've interpreted this as a vote on both Utf8View
> and
> > > > > > > > BinaryView as in the linked PR.
> > > > > > > >
> > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > > > > > > > Hello,
> > > > > > > > >
> > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow
> > format.
> > > > > > > > > Previous discussion in [1], columnar format description in
> > [2],
> > > > > > > > > flatbuffers changes in [3].
> > > > > > > > >
> > > > > > > > > There are implementations available in both C++[4] and
> Go[5]
> > > > which
> > > > > > > > > exercise the new type over IPC. Utf8View format
> > demonstrates[6]
> > > > > > > > > significant performance benefits over Utf8 in common tasks.
> > > > > > > > >
> > > > > > > > > The vote will be open for at least 72 hours.
> > > > > > > > >
> > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow
> > > format
> > > > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache
> > > Arrow
> > > > > > format
> > > > > > > > > because...
> > > > > > > > >
> > > > > > > > > Sincerely,
> > > > > > > > > Ben Kietzman
> > > > > > > > >
> > > > > > > > > [1]
> > > > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > > > > > > > [2]
> > > > > > > > >
> > > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > > > > > > > [3]
> > > > > > > > >
> > > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > > > > > > > [4] https://github.com/apache/arrow/pull/35628
> > > > > > > > > [5] https://github.com/apache/arrow/pull/35769
> > > > > > > > > [6]
> > > > > >
> https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > > > > > > > >
> > > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-21 Thread wish maple
+1 (non-binding)

It would help a lot when processing UTF-8 related data!

Xuwei

Andrew Lamb  于2023年8月22日周二 00:11写道:

> +1
>
> This is a great example of collaboration
>
> On Sat, Aug 19, 2023 at 4:10 PM Chao Sun  wrote:
>
> > +1 (non-binding)!
> >
> > On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho <
> > felipe...@gmail.com> wrote:
> >
> > > +1 (non-binding)
> > >
> > > —
> > > Felipe
> > >
> > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens
> > >  wrote:
> > >
> > > > +1 (non-binding)
> > > >
> > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh 
> wrote:
> > > >
> > > > > +1 (binding)
> > > > >
> > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
> > > > >  wrote:
> > > > > >
> > > > > > +1
> > > > > >
> > > > > > Thanks all for the thoughtful discussions here.
> > > > > >
> > > > > > Neal
> > > > > >
> > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
> > > > > >  wrote:
> > > > > >
> > > > > > > +1 (binding)
> > > > > > >
> > > > > > > Despite my earlier misgivings, I think this will be a valuable
> > > > addition
> > > > > > > to the specification.
> > > > > > >
> > > > > > > To clarify I've interpreted this as a vote on both Utf8View and
> > > > > > > BinaryView as in the linked PR.
> > > > > > >
> > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > > > > > > Hello,
> > > > > > > >
> > > > > > > > I'd like to propose adding Utf8View arrays to the arrow
> format.
> > > > > > > > Previous discussion in [1], columnar format description in
> [2],
> > > > > > > > flatbuffers changes in [3].
> > > > > > > >
> > > > > > > > There are implementations available in both C++[4] and Go[5]
> > > which
> > > > > > > > exercise the new type over IPC. Utf8View format
> demonstrates[6]
> > > > > > > > significant performance benefits over Utf8 in common tasks.
> > > > > > > >
> > > > > > > > The vote will be open for at least 72 hours.
> > > > > > > >
> > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow
> > format
> > > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache
> > Arrow
> > > > > format
> > > > > > > > because...
> > > > > > > >
> > > > > > > > Sincerely,
> > > > > > > > Ben Kietzman
> > > > > > > >
> > > > > > > > [1]
> > > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > > > > > > [2]
> > > > > > > >
> > > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > > > > > > [3]
> > > > > > > >
> > > > > > >
> > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > > > > > > [4] https://github.com/apache/arrow/pull/35628
> > > > > > > > [5] https://github.com/apache/arrow/pull/35769
> > > > > > > > [6]
> > > > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > > > > > > >
> > > > > > >
> > > > >
> > > >
> > >
> >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-21 Thread Andrew Lamb
+1

This is a great example of collaboration

On Sat, Aug 19, 2023 at 4:10 PM Chao Sun  wrote:

> +1 (non-binding)!
>
> On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho <
> felipe...@gmail.com> wrote:
>
> > +1 (non-binding)
> >
> > —
> > Felipe
> >
> > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens
> >  wrote:
> >
> > > +1 (non-binding)
> > >
> > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh  wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
> > > >  wrote:
> > > > >
> > > > > +1
> > > > >
> > > > > Thanks all for the thoughtful discussions here.
> > > > >
> > > > > Neal
> > > > >
> > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
> > > > >  wrote:
> > > > >
> > > > > > +1 (binding)
> > > > > >
> > > > > > Despite my earlier misgivings, I think this will be a valuable
> > > addition
> > > > > > to the specification.
> > > > > >
> > > > > > To clarify I've interpreted this as a vote on both Utf8View and
> > > > > > BinaryView as in the linked PR.
> > > > > >
> > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > > > > > Hello,
> > > > > > >
> > > > > > > I'd like to propose adding Utf8View arrays to the arrow format.
> > > > > > > Previous discussion in [1], columnar format description in [2],
> > > > > > > flatbuffers changes in [3].
> > > > > > >
> > > > > > > There are implementations available in both C++[4] and Go[5]
> > which
> > > > > > > exercise the new type over IPC. Utf8View format demonstrates[6]
> > > > > > > significant performance benefits over Utf8 in common tasks.
> > > > > > >
> > > > > > > The vote will be open for at least 72 hours.
> > > > > > >
> > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow
> format
> > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache
> Arrow
> > > > format
> > > > > > > because...
> > > > > > >
> > > > > > > Sincerely,
> > > > > > > Ben Kietzman
> > > > > > >
> > > > > > > [1]
> > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > > > > > [2]
> > > > > > >
> > > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > > > > > [3]
> > > > > > >
> > > > > >
> > > >
> > >
> >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > > > > > [4] https://github.com/apache/arrow/pull/35628
> > > > > > > [5] https://github.com/apache/arrow/pull/35769
> > > > > > > [6]
> > > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > > > > > >
> > > > > >
> > > >
> > >
> >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-19 Thread Chao Sun
+1 (non-binding)!

On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho <
felipe...@gmail.com> wrote:

> +1 (non-binding)
>
> —
> Felipe
>
> On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens
>  wrote:
>
> > +1 (non-binding)
> >
> > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh  wrote:
> >
> > > +1 (binding)
> > >
> > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
> > >  wrote:
> > > >
> > > > +1
> > > >
> > > > Thanks all for the thoughtful discussions here.
> > > >
> > > > Neal
> > > >
> > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
> > > >  wrote:
> > > >
> > > > > +1 (binding)
> > > > >
> > > > > Despite my earlier misgivings, I think this will be a valuable
> > addition
> > > > > to the specification.
> > > > >
> > > > > To clarify I've interpreted this as a vote on both Utf8View and
> > > > > BinaryView as in the linked PR.
> > > > >
> > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > > > > Hello,
> > > > > >
> > > > > > I'd like to propose adding Utf8View arrays to the arrow format.
> > > > > > Previous discussion in [1], columnar format description in [2],
> > > > > > flatbuffers changes in [3].
> > > > > >
> > > > > > There are implementations available in both C++[4] and Go[5]
> which
> > > > > > exercise the new type over IPC. Utf8View format demonstrates[6]
> > > > > > significant performance benefits over Utf8 in common tasks.
> > > > > >
> > > > > > The vote will be open for at least 72 hours.
> > > > > >
> > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format
> > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow
> > > format
> > > > > > because...
> > > > > >
> > > > > > Sincerely,
> > > > > > Ben Kietzman
> > > > > >
> > > > > > [1]
> > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > > > > [2]
> > > > > >
> > > > >
> > >
> >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > > > > [3]
> > > > > >
> > > > >
> > >
> >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > > > > [4] https://github.com/apache/arrow/pull/35628
> > > > > > [5] https://github.com/apache/arrow/pull/35769
> > > > > > [6]
> > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > > > > >
> > > > >
> > >
> >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-18 Thread Felipe Oliveira Carvalho
+1 (non-binding)

—
Felipe

On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens
 wrote:

> +1 (non-binding)
>
> On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh  wrote:
>
> > +1 (binding)
> >
> > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
> >  wrote:
> > >
> > > +1
> > >
> > > Thanks all for the thoughtful discussions here.
> > >
> > > Neal
> > >
> > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
> > >  wrote:
> > >
> > > > +1 (binding)
> > > >
> > > > Despite my earlier misgivings, I think this will be a valuable
> addition
> > > > to the specification.
> > > >
> > > > To clarify I've interpreted this as a vote on both Utf8View and
> > > > BinaryView as in the linked PR.
> > > >
> > > > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > > > Hello,
> > > > >
> > > > > I'd like to propose adding Utf8View arrays to the arrow format.
> > > > > Previous discussion in [1], columnar format description in [2],
> > > > > flatbuffers changes in [3].
> > > > >
> > > > > There are implementations available in both C++[4] and Go[5] which
> > > > > exercise the new type over IPC. Utf8View format demonstrates[6]
> > > > > significant performance benefits over Utf8 in common tasks.
> > > > >
> > > > > The vote will be open for at least 72 hours.
> > > > >
> > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format
> > > > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow
> > format
> > > > > because...
> > > > >
> > > > > Sincerely,
> > > > > Ben Kietzman
> > > > >
> > > > > [1]
> https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > > > [2]
> > > > >
> > > >
> >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > > > [3]
> > > > >
> > > >
> >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > > > [4] https://github.com/apache/arrow/pull/35628
> > > > > [5] https://github.com/apache/arrow/pull/35769
> > > > > [6]
> > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > > > >
> > > >
> >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-18 Thread Jacob Wujciak-Jens
+1 (non-binding)

On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh  wrote:

> +1 (binding)
>
> On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
>  wrote:
> >
> > +1
> >
> > Thanks all for the thoughtful discussions here.
> >
> > Neal
> >
> > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
> >  wrote:
> >
> > > +1 (binding)
> > >
> > > Despite my earlier misgivings, I think this will be a valuable addition
> > > to the specification.
> > >
> > > To clarify I've interpreted this as a vote on both Utf8View and
> > > BinaryView as in the linked PR.
> > >
> > > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > > Hello,
> > > >
> > > > I'd like to propose adding Utf8View arrays to the arrow format.
> > > > Previous discussion in [1], columnar format description in [2],
> > > > flatbuffers changes in [3].
> > > >
> > > > There are implementations available in both C++[4] and Go[5] which
> > > > exercise the new type over IPC. Utf8View format demonstrates[6]
> > > > significant performance benefits over Utf8 in common tasks.
> > > >
> > > > The vote will be open for at least 72 hours.
> > > >
> > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format
> > > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow
> format
> > > > because...
> > > >
> > > > Sincerely,
> > > > Ben Kietzman
> > > >
> > > > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > > [2]
> > > >
> > >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > > [3]
> > > >
> > >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > > [4] https://github.com/apache/arrow/pull/35628
> > > > [5] https://github.com/apache/arrow/pull/35769
> > > > [6]
> https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > > >
> > >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-18 Thread L. C. Hsieh
+1 (binding)

On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson
 wrote:
>
> +1
>
> Thanks all for the thoughtful discussions here.
>
> Neal
>
> On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
>  wrote:
>
> > +1 (binding)
> >
> > Despite my earlier misgivings, I think this will be a valuable addition
> > to the specification.
> >
> > To clarify I've interpreted this as a vote on both Utf8View and
> > BinaryView as in the linked PR.
> >
> > On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > > Hello,
> > >
> > > I'd like to propose adding Utf8View arrays to the arrow format.
> > > Previous discussion in [1], columnar format description in [2],
> > > flatbuffers changes in [3].
> > >
> > > There are implementations available in both C++[4] and Go[5] which
> > > exercise the new type over IPC. Utf8View format demonstrates[6]
> > > significant performance benefits over Utf8 in common tasks.
> > >
> > > The vote will be open for at least 72 hours.
> > >
> > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format
> > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
> > > because...
> > >
> > > Sincerely,
> > > Ben Kietzman
> > >
> > > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > > [2]
> > >
> > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > > [3]
> > >
> > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > > [4] https://github.com/apache/arrow/pull/35628
> > > [5] https://github.com/apache/arrow/pull/35769
> > > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> > >
> >


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-18 Thread Neal Richardson
+1

Thanks all for the thoughtful discussions here.

Neal

On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies
 wrote:

> +1 (binding)
>
> Despite my earlier misgivings, I think this will be a valuable addition
> to the specification.
>
> To clarify I've interpreted this as a vote on both Utf8View and
> BinaryView as in the linked PR.
>
> On 28/06/2023 20:34, Benjamin Kietzman wrote:
> > Hello,
> >
> > I'd like to propose adding Utf8View arrays to the arrow format.
> > Previous discussion in [1], columnar format description in [2],
> > flatbuffers changes in [3].
> >
> > There are implementations available in both C++[4] and Go[5] which
> > exercise the new type over IPC. Utf8View format demonstrates[6]
> > significant performance benefits over Utf8 in common tasks.
> >
> > The vote will be open for at least 72 hours.
> >
> > [ ] +1 add the proposed Utf8View type to the Apache Arrow format
> > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
> > because...
> >
> > Sincerely,
> > Ben Kietzman
> >
> > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> > [2]
> >
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> > [3]
> >
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> > [4] https://github.com/apache/arrow/pull/35628
> > [5] https://github.com/apache/arrow/pull/35769
> > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
> >
>


Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-08-18 Thread Raphael Taylor-Davies

+1 (binding)

Despite my earlier misgivings, I think this will be a valuable addition 
to the specification.


To clarify I've interpreted this as a vote on both Utf8View and 
BinaryView as in the linked PR.


On 28/06/2023 20:34, Benjamin Kietzman wrote:

Hello,

I'd like to propose adding Utf8View arrays to the arrow format.
Previous discussion in [1], columnar format description in [2],
flatbuffers changes in [3].

There are implementations available in both C++[4] and Go[5] which
exercise the new type over IPC. Utf8View format demonstrates[6]
significant performance benefits over Utf8 in common tasks.

The vote will be open for at least 72 hours.

[ ] +1 add the proposed Utf8View type to the Apache Arrow format
[ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
because...

Sincerely,
Ben Kietzman

[1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
[2]
https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
[3]
https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
[4] https://github.com/apache/arrow/pull/35628
[5] https://github.com/apache/arrow/pull/35769
[6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617



Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-06-29 Thread Gang Wu
+1 (non-binding)

Thanks,
Gang

On Thu, Jun 29, 2023 at 3:35 AM Benjamin Kietzman 
wrote:

> Hello,
>
> I'd like to propose adding Utf8View arrays to the arrow format.
> Previous discussion in [1], columnar format description in [2],
> flatbuffers changes in [3].
>
> There are implementations available in both C++[4] and Go[5] which
> exercise the new type over IPC. Utf8View format demonstrates[6]
> significant performance benefits over Utf8 in common tasks.
>
> The vote will be open for at least 72 hours.
>
> [ ] +1 add the proposed Utf8View type to the Apache Arrow format
> [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
> because...
>
> Sincerely,
> Ben Kietzman
>
> [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
> [2]
>
> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
> [3]
>
> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
> [4] https://github.com/apache/arrow/pull/35628
> [5] https://github.com/apache/arrow/pull/35769
> [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
>


[VOTE][Format] Add Utf8View Arrays to Arrow Format

2023-06-28 Thread Benjamin Kietzman
Hello,

I'd like to propose adding Utf8View arrays to the arrow format.
Previous discussion in [1], columnar format description in [2],
flatbuffers changes in [3].

There are implementations available in both C++[4] and Go[5] which
exercise the new type over IPC. Utf8View format demonstrates[6]
significant performance benefits over Utf8 in common tasks.

The vote will be open for at least 72 hours.

[ ] +1 add the proposed Utf8View type to the Apache Arrow format
[ ] -1 do not add the proposed Utf8View type to the Apache Arrow format
because...

Sincerely,
Ben Kietzman

[1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt
[2]
https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout
[3]
https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3
[4] https://github.com/apache/arrow/pull/35628
[5] https://github.com/apache/arrow/pull/35769
[6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617