Re: [ANNOUNCE] New Arrow PMC member: Weston Pace

2022-09-06 Thread Krisztián Szűcs
Congrats Weston! On Wed, Sep 7, 2022 at 1:41 AM Percy Camilo Triveño Aucahuasi wrote: > > Great news! Congratulations Weston! > > On Tue, Sep 6, 2022 at 1:42 PM Andy Grove wrote: > > > Congrats Weston! > > > > On Tue, Sep 6, 2022 at 8:04 AM Aldrin wrote: > > > > > awesome, congrats! > > > > > >

Re: [ANNOUNCE] New Arrow PMC member: Weston Pace

2022-09-06 Thread Percy Camilo Triveño Aucahuasi
Great news! Congratulations Weston! On Tue, Sep 6, 2022 at 1:42 PM Andy Grove wrote: > Congrats Weston! > > On Tue, Sep 6, 2022 at 8:04 AM Aldrin wrote: > > > awesome, congrats! > > > > Aldrin Montana > > Computer Science PhD Student > > UC Santa Cruz > > > > > > On Tue, Sep 6, 2022 at 6:10 AM

Re: Apache Software Foundation community survey 2022

2022-09-06 Thread Wes McKinney
hi Antoine — thank you for circulating this survey. Even though it takes a few minutes to complete I encourage community members to take the time to participate since data about community participation helps the ASF do better in the future. Thanks, Wes On Thu, Aug 25, 2022 at 2:10 AM Antoine Pitr

Re: design for ordered aggregation

2022-09-06 Thread Weston Pace
It seems like a reasonable approach. I think my initial gut feeling would be that initializing and finalizing state for each change of key might be a bit heavyweight in cases where there are only a few values per key. I think these cases are fairly common as a data simplification / cleaning pass.

Re: [ANNOUNCE] New Arrow PMC member: Weston Pace

2022-09-06 Thread Andy Grove
Congrats Weston! On Tue, Sep 6, 2022 at 8:04 AM Aldrin wrote: > awesome, congrats! > > Aldrin Montana > Computer Science PhD Student > UC Santa Cruz > > > On Tue, Sep 6, 2022 at 6:10 AM Joris Van den Bossche < > jorisvandenboss...@gmail.com> wrote: > > > Congrats Weston! It is great to have you

Re: Arrow Flight usage with graph databases

2022-09-06 Thread Bill Zhao
unsubscribe Valentyn Kahamlyk 于2022年7月18日周一 16:56写道: > > Hi All, > > I'm investigating the possibility of using Arrow Flight with graph databases, > and exploring how to enable Arrow Flight endpoint in Apache Tinkerpop Gremlin > server. > > Now graph databases use several incompatible protocols

design for ordered aggregation

2022-09-06 Thread Yaron Gvili
Hi All, I'm working on a design for ordered aggregations in Arrow C++ and would like to get some opinions about it. Ordered aggregation is similar to grouped aggregation except that one column in the grouping key is (known to be) ordered. The result of both types of aggregations is the same but

Re: [VOTE] Substrait for Flight SQL

2022-09-06 Thread David Li
We could add a new type code to the union. Presumably consumers would just error on or ignore such values (the libraries just hand the Arrow array to the application, so it's up to the application what to do with an unknown type code). (And for a new consumer talking to an old server, the new ty

Re: [VOTE] Substrait for Flight SQL

2022-09-06 Thread Antoine Pitrou
Le 06/09/2022 à 17:21, David Li a écrit : Thanks Antoine! I've updated the PR (except for the comment about timeout units, since SqlInfo values can't be doubles/floats unless we change the schema there) Can we change the schema in a backwards-compatible way?

Re: [VOTE] Substrait for Flight SQL

2022-09-06 Thread David Li
Thanks Antoine! I've updated the PR (except for the comment about timeout units, since SqlInfo values can't be doubles/floats unless we change the schema there) On Tue, Sep 6, 2022, at 09:24, Antoine Pitrou wrote: > Hi, > > Sorry for the delay. I took the time to read the protobuf definitions >

Re: [ANNOUNCE] New Arrow PMC member: Weston Pace

2022-09-06 Thread Aldrin
awesome, congrats! Aldrin Montana Computer Science PhD Student UC Santa Cruz On Tue, Sep 6, 2022 at 6:10 AM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > Congrats Weston! It is great to have you on the team! > > On Tue, 6 Sept 2022 at 06:10, Weston Pace wrote: > > > Thank you

Re: Usage of the name Feather?

2022-09-06 Thread Joris Van den Bossche
Personally, I like the "Feather" name (and actually think it could help disambiguate the file vs in-memory distinction), but I understand that we have chosen a certain path (eg ".arrow" is the official registered extension), and have to move on. However, I think we need to be very careful in how w

Re: [VOTE] Substrait for Flight SQL

2022-09-06 Thread Antoine Pitrou
Hi, Sorry for the delay. I took the time to read the protobuf definitions again and posted a few (relatively minor) comments in the PR. On the principle the spec looks sound so I'm giving this a +1 (binding). Regards Antoine. Le 01/09/2022 à 01:51, David Li a écrit : Hello, I am propos

Re: [ANNOUNCE] New Arrow PMC member: Weston Pace

2022-09-06 Thread Joris Van den Bossche
Congrats Weston! It is great to have you on the team! On Tue, 6 Sept 2022 at 06:10, Weston Pace wrote: > Thank you everyone! I look forward to continuing to work with you all. > > On Mon, Sep 5, 2022 at 3:56 PM Kun Liu wrote: > > > > Congrats Weston!! > > > > > > Gavin Ray 于2022年9月6日周二 08:04写

Re: Integration between Flight and Acero

2022-09-06 Thread Yaron Gvili
Hi Li, Here's my 2 cents about the Ibis/Substrait part of this. An Ibis expression carries a schema. If you're planning to create an integrated Ibis/Substrait/Arrow solution, then you'll need the schema to be available to Ibis in Python. So, you'll need a Python wrapper for the C++ implementati

Re: Alluxio cache read support

2022-09-06 Thread Antoine Pitrou
Le 06/09/2022 à 09:45, Manoj Kumar a écrit : Hi Sutou Kouhei/Team *[Background]* Working on intel gazelle_plugin , It's a C++ based backend with an arrow compute engine for spark. Now during scan i.e reading data from HDFS/Cloud currently we are

Re: Alluxio cache read support

2022-09-06 Thread Manoj Kumar
Hi Sutou Kouhei/Team *[Background]* Working on intel gazelle_plugin , It's a C++ based backend with an arrow compute engine for spark. Now during scan i.e reading data from HDFS/Cloud currently we are using cloud/hdfs APIs as mentioned above. But no