[RESULT][VOTE][Format] Add Utf8View Arrays to Arrow Format
Resend with "[RESULT]" prefix. In "Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format" on Mon, 28 Aug 2023 14:18:53 -0400, Benjamin Kietzman wrote: > The vote passes with 7 binding +1 votes, 5 non-binding +1 votes, and no > -1/0 votes. > > Thanks everyone! > > On Thu, Aug 24, 2023 at 10:10 AM Antoine Pitrou wrote: > >> >> +1 on the format additions >> >> The implementations will probably need a bit more review back-and-forth. >> >> Regards >> >> Antoine. >> >> >> Le 28/06/2023 à 21:34, Benjamin Kietzman a écrit : >> > Hello, >> > >> > I'd like to propose adding Utf8View arrays to the arrow format. >> > Previous discussion in [1], columnar format description in [2], >> > flatbuffers changes in [3]. >> > >> > There are implementations available in both C++[4] and Go[5] which >> > exercise the new type over IPC. Utf8View format demonstrates[6] >> > significant performance benefits over Utf8 in common tasks. >> > >> > The vote will be open for at least 72 hours. >> > >> > [ ] +1 add the proposed Utf8View type to the Apache Arrow format >> > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format >> > because... >> > >> > Sincerely, >> > Ben Kietzman >> > >> > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt >> > [2] >> > >> https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout >> > [3] >> > >> https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 >> > [4] https://github.com/apache/arrow/pull/35628 >> > [5] https://github.com/apache/arrow/pull/35769 >> > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 >> > >>
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
The vote passes with 7 binding +1 votes, 5 non-binding +1 votes, and no -1/0 votes. Thanks everyone! On Thu, Aug 24, 2023 at 10:10 AM Antoine Pitrou wrote: > > +1 on the format additions > > The implementations will probably need a bit more review back-and-forth. > > Regards > > Antoine. > > > Le 28/06/2023 à 21:34, Benjamin Kietzman a écrit : > > Hello, > > > > I'd like to propose adding Utf8View arrays to the arrow format. > > Previous discussion in [1], columnar format description in [2], > > flatbuffers changes in [3]. > > > > There are implementations available in both C++[4] and Go[5] which > > exercise the new type over IPC. Utf8View format demonstrates[6] > > significant performance benefits over Utf8 in common tasks. > > > > The vote will be open for at least 72 hours. > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format > > because... > > > > Sincerely, > > Ben Kietzman > > > > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > [2] > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > [3] > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > [4] https://github.com/apache/arrow/pull/35628 > > [5] https://github.com/apache/arrow/pull/35769 > > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 on the format additions The implementations will probably need a bit more review back-and-forth. Regards Antoine. Le 28/06/2023 à 21:34, Benjamin Kietzman a écrit : Hello, I'd like to propose adding Utf8View arrays to the arrow format. Previous discussion in [1], columnar format description in [2], flatbuffers changes in [3]. There are implementations available in both C++[4] and Go[5] which exercise the new type over IPC. Utf8View format demonstrates[6] significant performance benefits over Utf8 in common tasks. The vote will be open for at least 72 hours. [ ] +1 add the proposed Utf8View type to the Apache Arrow format [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format because... Sincerely, Ben Kietzman [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt [2] https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout [3] https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 [4] https://github.com/apache/arrow/pull/35628 [5] https://github.com/apache/arrow/pull/35769 [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 On Mon, 21 Aug 2023 at 19:33, Weston Pace wrote: > > +1 > > Thanks to all for the discussion and thanks to Ben for all of the great > work. > > > On Mon, Aug 21, 2023 at 9:16 AM wish maple wrote: > > > +1 (non-binding) > > > > It would help a lot when processing UTF-8 related data! > > > > Xuwei > > > > Andrew Lamb 于2023年8月22日周二 00:11写道: > > > > > +1 > > > > > > This is a great example of collaboration > > > > > > On Sat, Aug 19, 2023 at 4:10 PM Chao Sun wrote: > > > > > > > +1 (non-binding)! > > > > > > > > On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho < > > > > felipe...@gmail.com> wrote: > > > > > > > > > +1 (non-binding) > > > > > > > > > > — > > > > > Felipe > > > > > > > > > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens > > > > > wrote: > > > > > > > > > > > +1 (non-binding) > > > > > > > > > > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh > > > wrote: > > > > > > > > > > > > > +1 (binding) > > > > > > > > > > > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson > > > > > > > wrote: > > > > > > > > > > > > > > > > +1 > > > > > > > > > > > > > > > > Thanks all for the thoughtful discussions here. > > > > > > > > > > > > > > > > Neal > > > > > > > > > > > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > > > > > > > > wrote: > > > > > > > > > > > > > > > > > +1 (binding) > > > > > > > > > > > > > > > > > > Despite my earlier misgivings, I think this will be a > > valuable > > > > > > addition > > > > > > > > > to the specification. > > > > > > > > > > > > > > > > > > To clarify I've interpreted this as a vote on both Utf8View > > and > > > > > > > > > BinaryView as in the linked PR. > > > > > > > > > > > > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > > > > > > > > Hello, > > > > > > > > > > > > > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow > > > format. > > > > > > > > > > Previous discussion in [1], columnar format description in > > > [2], > > > > > > > > > > flatbuffers changes in [3]. > > > > > > > > > > > > > > > > > > > > There are implementations available in both C++[4] and > > Go[5] > > > > > which > > > > > > > > > > exercise the new type over IPC. Utf8View format > > > demonstrates[6] > > > > > > > > > > significant performance benefits over Utf8 in common tasks. > > > > > > > > > > > > > > > > > > > > The vote will be open for at least 72 hours. > > > > > > > > > > > > > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow > > > > format > > > > > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache > > > > Arrow > > > > > > > format > > > > > > > > > > because... > > > > > > > > > > > > > > > > > > > > Sincerely, > > > > > > > > > > Ben Kietzman > > > > > > > > > > > > > > > > > > > > [1] > > > > > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > > > > > > > > [2] > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > > > > > > > > [3] > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > > > > > > > > [4] https://github.com/apache/arrow/pull/35628 > > > > > > > > > > [5] https://github.com/apache/arrow/pull/35769 > > > > > > > > > > [6] > > > > > > > > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 Thanks to all for the discussion and thanks to Ben for all of the great work. On Mon, Aug 21, 2023 at 9:16 AM wish maple wrote: > +1 (non-binding) > > It would help a lot when processing UTF-8 related data! > > Xuwei > > Andrew Lamb 于2023年8月22日周二 00:11写道: > > > +1 > > > > This is a great example of collaboration > > > > On Sat, Aug 19, 2023 at 4:10 PM Chao Sun wrote: > > > > > +1 (non-binding)! > > > > > > On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho < > > > felipe...@gmail.com> wrote: > > > > > > > +1 (non-binding) > > > > > > > > — > > > > Felipe > > > > > > > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens > > > > wrote: > > > > > > > > > +1 (non-binding) > > > > > > > > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh > > wrote: > > > > > > > > > > > +1 (binding) > > > > > > > > > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson > > > > > > wrote: > > > > > > > > > > > > > > +1 > > > > > > > > > > > > > > Thanks all for the thoughtful discussions here. > > > > > > > > > > > > > > Neal > > > > > > > > > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > > > > > > > wrote: > > > > > > > > > > > > > > > +1 (binding) > > > > > > > > > > > > > > > > Despite my earlier misgivings, I think this will be a > valuable > > > > > addition > > > > > > > > to the specification. > > > > > > > > > > > > > > > > To clarify I've interpreted this as a vote on both Utf8View > and > > > > > > > > BinaryView as in the linked PR. > > > > > > > > > > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > > > > > > > Hello, > > > > > > > > > > > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow > > format. > > > > > > > > > Previous discussion in [1], columnar format description in > > [2], > > > > > > > > > flatbuffers changes in [3]. > > > > > > > > > > > > > > > > > > There are implementations available in both C++[4] and > Go[5] > > > > which > > > > > > > > > exercise the new type over IPC. Utf8View format > > demonstrates[6] > > > > > > > > > significant performance benefits over Utf8 in common tasks. > > > > > > > > > > > > > > > > > > The vote will be open for at least 72 hours. > > > > > > > > > > > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow > > > format > > > > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache > > > Arrow > > > > > > format > > > > > > > > > because... > > > > > > > > > > > > > > > > > > Sincerely, > > > > > > > > > Ben Kietzman > > > > > > > > > > > > > > > > > > [1] > > > > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > > > > > > > [2] > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > > > > > > > [3] > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > > > > > > > [4] https://github.com/apache/arrow/pull/35628 > > > > > > > > > [5] https://github.com/apache/arrow/pull/35769 > > > > > > > > > [6] > > > > > > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 (non-binding) It would help a lot when processing UTF-8 related data! Xuwei Andrew Lamb 于2023年8月22日周二 00:11写道: > +1 > > This is a great example of collaboration > > On Sat, Aug 19, 2023 at 4:10 PM Chao Sun wrote: > > > +1 (non-binding)! > > > > On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho < > > felipe...@gmail.com> wrote: > > > > > +1 (non-binding) > > > > > > — > > > Felipe > > > > > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens > > > wrote: > > > > > > > +1 (non-binding) > > > > > > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh > wrote: > > > > > > > > > +1 (binding) > > > > > > > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson > > > > > wrote: > > > > > > > > > > > > +1 > > > > > > > > > > > > Thanks all for the thoughtful discussions here. > > > > > > > > > > > > Neal > > > > > > > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > > > > > > wrote: > > > > > > > > > > > > > +1 (binding) > > > > > > > > > > > > > > Despite my earlier misgivings, I think this will be a valuable > > > > addition > > > > > > > to the specification. > > > > > > > > > > > > > > To clarify I've interpreted this as a vote on both Utf8View and > > > > > > > BinaryView as in the linked PR. > > > > > > > > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > > > > > > Hello, > > > > > > > > > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow > format. > > > > > > > > Previous discussion in [1], columnar format description in > [2], > > > > > > > > flatbuffers changes in [3]. > > > > > > > > > > > > > > > > There are implementations available in both C++[4] and Go[5] > > > which > > > > > > > > exercise the new type over IPC. Utf8View format > demonstrates[6] > > > > > > > > significant performance benefits over Utf8 in common tasks. > > > > > > > > > > > > > > > > The vote will be open for at least 72 hours. > > > > > > > > > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow > > format > > > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache > > Arrow > > > > > format > > > > > > > > because... > > > > > > > > > > > > > > > > Sincerely, > > > > > > > > Ben Kietzman > > > > > > > > > > > > > > > > [1] > > > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > > > > > > [2] > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > > > > > > [3] > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > > > > > > [4] https://github.com/apache/arrow/pull/35628 > > > > > > > > [5] https://github.com/apache/arrow/pull/35769 > > > > > > > > [6] > > > > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 This is a great example of collaboration On Sat, Aug 19, 2023 at 4:10 PM Chao Sun wrote: > +1 (non-binding)! > > On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho < > felipe...@gmail.com> wrote: > > > +1 (non-binding) > > > > — > > Felipe > > > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens > > wrote: > > > > > +1 (non-binding) > > > > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh wrote: > > > > > > > +1 (binding) > > > > > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson > > > > wrote: > > > > > > > > > > +1 > > > > > > > > > > Thanks all for the thoughtful discussions here. > > > > > > > > > > Neal > > > > > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > > > > > wrote: > > > > > > > > > > > +1 (binding) > > > > > > > > > > > > Despite my earlier misgivings, I think this will be a valuable > > > addition > > > > > > to the specification. > > > > > > > > > > > > To clarify I've interpreted this as a vote on both Utf8View and > > > > > > BinaryView as in the linked PR. > > > > > > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > > > > > Hello, > > > > > > > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow format. > > > > > > > Previous discussion in [1], columnar format description in [2], > > > > > > > flatbuffers changes in [3]. > > > > > > > > > > > > > > There are implementations available in both C++[4] and Go[5] > > which > > > > > > > exercise the new type over IPC. Utf8View format demonstrates[6] > > > > > > > significant performance benefits over Utf8 in common tasks. > > > > > > > > > > > > > > The vote will be open for at least 72 hours. > > > > > > > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow > format > > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache > Arrow > > > > format > > > > > > > because... > > > > > > > > > > > > > > Sincerely, > > > > > > > Ben Kietzman > > > > > > > > > > > > > > [1] > > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > > > > > [2] > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > > > > > [3] > > > > > > > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > > > > > [4] https://github.com/apache/arrow/pull/35628 > > > > > > > [5] https://github.com/apache/arrow/pull/35769 > > > > > > > [6] > > > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > > > > > > > > > > > > > > > > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 (non-binding)! On Fri, Aug 18, 2023 at 12:59 PM Felipe Oliveira Carvalho < felipe...@gmail.com> wrote: > +1 (non-binding) > > — > Felipe > > On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens > wrote: > > > +1 (non-binding) > > > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh wrote: > > > > > +1 (binding) > > > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson > > > wrote: > > > > > > > > +1 > > > > > > > > Thanks all for the thoughtful discussions here. > > > > > > > > Neal > > > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > > > > wrote: > > > > > > > > > +1 (binding) > > > > > > > > > > Despite my earlier misgivings, I think this will be a valuable > > addition > > > > > to the specification. > > > > > > > > > > To clarify I've interpreted this as a vote on both Utf8View and > > > > > BinaryView as in the linked PR. > > > > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > > > > Hello, > > > > > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow format. > > > > > > Previous discussion in [1], columnar format description in [2], > > > > > > flatbuffers changes in [3]. > > > > > > > > > > > > There are implementations available in both C++[4] and Go[5] > which > > > > > > exercise the new type over IPC. Utf8View format demonstrates[6] > > > > > > significant performance benefits over Utf8 in common tasks. > > > > > > > > > > > > The vote will be open for at least 72 hours. > > > > > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format > > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow > > > format > > > > > > because... > > > > > > > > > > > > Sincerely, > > > > > > Ben Kietzman > > > > > > > > > > > > [1] > > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > > > > [2] > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > > > > [3] > > > > > > > > > > > > > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > > > > [4] https://github.com/apache/arrow/pull/35628 > > > > > > [5] https://github.com/apache/arrow/pull/35769 > > > > > > [6] > > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > > > > > > > > > > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 (non-binding) — Felipe On Fri, 18 Aug 2023 at 18:48 Jacob Wujciak-Jens wrote: > +1 (non-binding) > > On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh wrote: > > > +1 (binding) > > > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson > > wrote: > > > > > > +1 > > > > > > Thanks all for the thoughtful discussions here. > > > > > > Neal > > > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > > > wrote: > > > > > > > +1 (binding) > > > > > > > > Despite my earlier misgivings, I think this will be a valuable > addition > > > > to the specification. > > > > > > > > To clarify I've interpreted this as a vote on both Utf8View and > > > > BinaryView as in the linked PR. > > > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > > > Hello, > > > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow format. > > > > > Previous discussion in [1], columnar format description in [2], > > > > > flatbuffers changes in [3]. > > > > > > > > > > There are implementations available in both C++[4] and Go[5] which > > > > > exercise the new type over IPC. Utf8View format demonstrates[6] > > > > > significant performance benefits over Utf8 in common tasks. > > > > > > > > > > The vote will be open for at least 72 hours. > > > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format > > > > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow > > format > > > > > because... > > > > > > > > > > Sincerely, > > > > > Ben Kietzman > > > > > > > > > > [1] > https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > > > [2] > > > > > > > > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > > > [3] > > > > > > > > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > > > [4] https://github.com/apache/arrow/pull/35628 > > > > > [5] https://github.com/apache/arrow/pull/35769 > > > > > [6] > > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > > > > > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 (non-binding) On Fri, Aug 18, 2023 at 6:04 PM L. C. Hsieh wrote: > +1 (binding) > > On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson > wrote: > > > > +1 > > > > Thanks all for the thoughtful discussions here. > > > > Neal > > > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > > wrote: > > > > > +1 (binding) > > > > > > Despite my earlier misgivings, I think this will be a valuable addition > > > to the specification. > > > > > > To clarify I've interpreted this as a vote on both Utf8View and > > > BinaryView as in the linked PR. > > > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > > Hello, > > > > > > > > I'd like to propose adding Utf8View arrays to the arrow format. > > > > Previous discussion in [1], columnar format description in [2], > > > > flatbuffers changes in [3]. > > > > > > > > There are implementations available in both C++[4] and Go[5] which > > > > exercise the new type over IPC. Utf8View format demonstrates[6] > > > > significant performance benefits over Utf8 in common tasks. > > > > > > > > The vote will be open for at least 72 hours. > > > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format > > > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow > format > > > > because... > > > > > > > > Sincerely, > > > > Ben Kietzman > > > > > > > > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > > [2] > > > > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > > [3] > > > > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > > [4] https://github.com/apache/arrow/pull/35628 > > > > [5] https://github.com/apache/arrow/pull/35769 > > > > [6] > https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 (binding) On Fri, Aug 18, 2023 at 5:53 AM Neal Richardson wrote: > > +1 > > Thanks all for the thoughtful discussions here. > > Neal > > On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies > wrote: > > > +1 (binding) > > > > Despite my earlier misgivings, I think this will be a valuable addition > > to the specification. > > > > To clarify I've interpreted this as a vote on both Utf8View and > > BinaryView as in the linked PR. > > > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > > Hello, > > > > > > I'd like to propose adding Utf8View arrays to the arrow format. > > > Previous discussion in [1], columnar format description in [2], > > > flatbuffers changes in [3]. > > > > > > There are implementations available in both C++[4] and Go[5] which > > > exercise the new type over IPC. Utf8View format demonstrates[6] > > > significant performance benefits over Utf8 in common tasks. > > > > > > The vote will be open for at least 72 hours. > > > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format > > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format > > > because... > > > > > > Sincerely, > > > Ben Kietzman > > > > > > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > > [2] > > > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > > [3] > > > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > > [4] https://github.com/apache/arrow/pull/35628 > > > [5] https://github.com/apache/arrow/pull/35769 > > > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 Thanks all for the thoughtful discussions here. Neal On Fri, Aug 18, 2023 at 4:14 AM Raphael Taylor-Davies wrote: > +1 (binding) > > Despite my earlier misgivings, I think this will be a valuable addition > to the specification. > > To clarify I've interpreted this as a vote on both Utf8View and > BinaryView as in the linked PR. > > On 28/06/2023 20:34, Benjamin Kietzman wrote: > > Hello, > > > > I'd like to propose adding Utf8View arrays to the arrow format. > > Previous discussion in [1], columnar format description in [2], > > flatbuffers changes in [3]. > > > > There are implementations available in both C++[4] and Go[5] which > > exercise the new type over IPC. Utf8View format demonstrates[6] > > significant performance benefits over Utf8 in common tasks. > > > > The vote will be open for at least 72 hours. > > > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format > > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format > > because... > > > > Sincerely, > > Ben Kietzman > > > > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > > [2] > > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > > [3] > > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > > [4] https://github.com/apache/arrow/pull/35628 > > [5] https://github.com/apache/arrow/pull/35769 > > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 > > >
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 (binding) Despite my earlier misgivings, I think this will be a valuable addition to the specification. To clarify I've interpreted this as a vote on both Utf8View and BinaryView as in the linked PR. On 28/06/2023 20:34, Benjamin Kietzman wrote: Hello, I'd like to propose adding Utf8View arrays to the arrow format. Previous discussion in [1], columnar format description in [2], flatbuffers changes in [3]. There are implementations available in both C++[4] and Go[5] which exercise the new type over IPC. Utf8View format demonstrates[6] significant performance benefits over Utf8 in common tasks. The vote will be open for at least 72 hours. [ ] +1 add the proposed Utf8View type to the Apache Arrow format [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format because... Sincerely, Ben Kietzman [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt [2] https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout [3] https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 [4] https://github.com/apache/arrow/pull/35628 [5] https://github.com/apache/arrow/pull/35769 [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617
Re: [VOTE][Format] Add Utf8View Arrays to Arrow Format
+1 (non-binding) Thanks, Gang On Thu, Jun 29, 2023 at 3:35 AM Benjamin Kietzman wrote: > Hello, > > I'd like to propose adding Utf8View arrays to the arrow format. > Previous discussion in [1], columnar format description in [2], > flatbuffers changes in [3]. > > There are implementations available in both C++[4] and Go[5] which > exercise the new type over IPC. Utf8View format demonstrates[6] > significant performance benefits over Utf8 in common tasks. > > The vote will be open for at least 72 hours. > > [ ] +1 add the proposed Utf8View type to the Apache Arrow format > [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format > because... > > Sincerely, > Ben Kietzman > > [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt > [2] > > https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout > [3] > > https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 > [4] https://github.com/apache/arrow/pull/35628 > [5] https://github.com/apache/arrow/pull/35769 > [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617 >
[VOTE][Format] Add Utf8View Arrays to Arrow Format
Hello, I'd like to propose adding Utf8View arrays to the arrow format. Previous discussion in [1], columnar format description in [2], flatbuffers changes in [3]. There are implementations available in both C++[4] and Go[5] which exercise the new type over IPC. Utf8View format demonstrates[6] significant performance benefits over Utf8 in common tasks. The vote will be open for at least 72 hours. [ ] +1 add the proposed Utf8View type to the Apache Arrow format [ ] -1 do not add the proposed Utf8View type to the Apache Arrow format because... Sincerely, Ben Kietzman [1] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt [2] https://github.com/apache/arrow/blob/46cf7e67766f0646760acefa4d2d01cdfead2d5d/docs/source/format/Columnar.rst#variable-size-binary-view-layout [3] https://github.com/apache/arrow/pull/35628/files#diff-0623d567d0260222d5501b4e169141b5070eabc2ec09c3482da453a3346c5bf3 [4] https://github.com/apache/arrow/pull/35628 [5] https://github.com/apache/arrow/pull/35769 [6] https://github.com/apache/arrow/pull/35628#issuecomment-1583218617