Re: [ANNOUNCE] New Arrow PMC member: Benjamin Kietzman

2021-05-05 Thread Weston Pace
Congratulations Ben! On Wed, May 5, 2021 at 6:48 PM Micah Kornfield wrote: > Congrats! > > On Wed, May 5, 2021 at 4:33 PM David Li wrote: > > > Congrats Ben! Well deserved. > > > > Best, > > David > > > > On Wed, May 5, 2021, at 19:22, Neal Richardson wrote: > > > Congrats Ben! > > > > > >

Re: [ANNOUNCE] New Arrow PMC member: Benjamin Kietzman

2021-05-05 Thread Micah Kornfield
Congrats! On Wed, May 5, 2021 at 4:33 PM David Li wrote: > Congrats Ben! Well deserved. > > Best, > David > > On Wed, May 5, 2021, at 19:22, Neal Richardson wrote: > > Congrats Ben! > > > > Neal > > > > On Wed, May 5, 2021 at 4:16 PM Eduardo Ponce > wrote: > > > >

Re: New style in documentation on the website looks great

2021-05-05 Thread Aldrin
I very much enjoy the new theme Aldrin Montana Computer Science PhD Student UC Santa Cruz On Tue, May 4, 2021 at 11:47 PM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > Thanks, I am happy that people like it! > It's a slightly customized version of the pydata-sphinx-theme >

Re: [DISCUSS][C++] Refactoring of Expression simplification passes

2021-05-05 Thread Wes McKinney
This seems like it could be a premature optimization, do we know what fraction of important workloads are taken up by this operation? On Wed, May 5, 2021 at 12:35 PM Benjamin Kietzman wrote: > > Sorry, yes: I meant 4 microseconds and not 4 milliseconds. > > On Wed, May 5, 2021 at 1:27 PM Antoine

Re: [ANNOUNCE] New Arrow PMC member: Benjamin Kietzman

2021-05-05 Thread David Li
Congrats Ben! Well deserved. Best, David On Wed, May 5, 2021, at 19:22, Neal Richardson wrote: > Congrats Ben! > > Neal > > On Wed, May 5, 2021 at 4:16 PM Eduardo Ponce > wrote: > > > Great news! Congratulations Ben. > > > > ~Eduardo > > > >

Re: [ANNOUNCE] New Arrow PMC member: Benjamin Kietzman

2021-05-05 Thread Neal Richardson
Congrats Ben! Neal On Wed, May 5, 2021 at 4:16 PM Eduardo Ponce wrote: > Great news! Congratulations Ben. > > ~Eduardo > > > From: Wes McKinney > Sent: Wednesday, May 5, 2021, 7:10 PM > To: dev > Subject: [ANNOUNCE] New Arrow PMC member: Benjamin Kietzman > >

Re: [ANNOUNCE] New Arrow PMC member: Benjamin Kietzman

2021-05-05 Thread Eduardo Ponce
Great news! Congratulations Ben. ~Eduardo From: Wes McKinney Sent: Wednesday, May 5, 2021, 7:10 PM To: dev Subject: [ANNOUNCE] New Arrow PMC member: Benjamin Kietzman The Project Management Committee (PMC) for Apache Arrow has invited Benjamin Kietzman to

[ANNOUNCE] New Arrow PMC member: Benjamin Kietzman

2021-05-05 Thread Wes McKinney
The Project Management Committee (PMC) for Apache Arrow has invited Benjamin Kietzman to become a PMC member and we are pleased to announce that Benjamin has accepted. Congratulations and welcome!

Re: [DISCUSS][C++] Refactoring of Expression simplification passes

2021-05-05 Thread Benjamin Kietzman
Sorry, yes: I meant 4 microseconds and not 4 milliseconds. On Wed, May 5, 2021 at 1:27 PM Antoine Pitrou wrote: > On Wed, 5 May 2021 13:23:36 -0400 > Benjamin Kietzman wrote: > > Currently, Expressions (used to specify dataset filters and projections) > > are simplified by direct rewriting: a

Re: [DISCUSS][C++] Refactoring of Expression simplification passes

2021-05-05 Thread Antoine Pitrou
On Wed, 5 May 2021 13:23:36 -0400 Benjamin Kietzman wrote: > Currently, Expressions (used to specify dataset filters and projections) > are simplified by direct rewriting: a filter such as `alpha == 2 and beta > > 3` > on a partition where we are guaranteed that `beta == 5` will be rewritten > to

[DISCUSS][C++] Refactoring of Expression simplification passes

2021-05-05 Thread Benjamin Kietzman
Currently, Expressions (used to specify dataset filters and projections) are simplified by direct rewriting: a filter such as `alpha == 2 and beta > 3` on a partition where we are guaranteed that `beta == 5` will be rewritten to `alpha == 2` before evaluation against scanned batches. This can

Apache Arrow Rust Sync Call 5/5/2021

2021-05-05 Thread Andy Grove
Attendees - Andy Grove - Andrew Lamb - Jorge Leitao - Danniel Heres - Fernando Herrera - Andrew Lamb - Ruan Pearce-Authers - Benjamin Blodgett - Jorn Horstmann - Michael Edwards Topics Discussed - Andrew has an

[Rust] V2 Proposal for bi-weekly Rust Arrow Releases

2021-05-05 Thread Andrew Lamb
First of all, thank you for all the comments so far on the proposal for releasing the Rust Arrow implementation more frequently. I have incorporated the feedback from the initial proposal into an updated proposal for bi weekly Rust arrow releases[1]. The largest change, in my opinion, is the

Re: [VOTE] Register media types (MIME types) for Apache Arrow formats to IANA

2021-05-05 Thread Kazuaki Ishizaki
+1, great Weston Pace wrote on 2021/05/04 20:41:34: > From: Weston Pace > To: dev@arrow.apache.org > Date: 2021/05/04 20:41 > Subject: [EXTERNAL] [VOTE] Register media types (MIME types) for > Apache Arrow formats to IANA > > Per ARROW-7396 I would like to propose an application to the IANA

Re: [DISCUSS] [Rust] Python-datafusion

2021-05-05 Thread Andy Grove
Wes, thanks for following up on this and making sure that we are following the process here. I have merged a PR to revert the previous revert, so the Python bindings are now back in the repo. On Tue, May 4, 2021 at 4:14 PM Wes McKinney wrote: > Based on the general@incubator thread, there isn't

Re: [VOTE] Register media types (MIME types) for Apache Arrow formats to IANA

2021-05-05 Thread Andrew Lamb
+1 On Wed, May 5, 2021 at 8:09 AM Sutou Kouhei wrote: > +1 > > In > "[VOTE] Register media types (MIME types) for Apache Arrow formats to > IANA" on Tue, 4 May 2021 01:41:34 -1000, > Weston Pace wrote: > > > Per ARROW-7396 I would like to propose an application to the IANA to > > register

Re: [VOTE] Register media types (MIME types) for Apache Arrow formats to IANA

2021-05-05 Thread Sutou Kouhei
+1 In "[VOTE] Register media types (MIME types) for Apache Arrow formats to IANA" on Tue, 4 May 2021 01:41:34 -1000, Weston Pace wrote: > Per ARROW-7396 I would like to propose an application to the IANA to > register media types for the Arrow IPC formats (both file and > streaming). > >

[DataFusion] [Discuss] Output Schema for queries with multiple relations

2021-05-05 Thread Andrew Lamb
I wanted to bring some additional attention to some discussion occurring on a PR [1], specifically the proposal of how to construct output field names from queries that have multiple relations (that may have the same input field). The documents are: * Document for output schema field name

[NIGHTLY] Arrow Build Report for Job nightly-2021-05-05-0

2021-05-05 Thread Crossbow
Arrow Build Report for Job nightly-2021-05-05-0 All tasks: https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-05-05-0 Failed Tasks: - conda-linux-gcc-py36-arm64: URL:

Re: New style in documentation on the website looks great

2021-05-05 Thread Joris Van den Bossche
Thanks, I am happy that people like it! It's a slightly customized version of the pydata-sphinx-theme , to feature a single sidebar and some custom colors. Concrete feedback is certainly welcome (I am no design expert ;)). Joris On Sun, 2 May 2021

Re: [VOTE] Register media types (MIME types) for Apache Arrow formats to IANA

2021-05-05 Thread Joris Van den Bossche
+1 On Tue, 4 May 2021 at 13:41, Weston Pace wrote: > Per ARROW-7396 I would like to propose an application to the IANA to > register media types for the Arrow IPC formats (both file and > streaming). > > The proposed application is available as [1]. It is based on previous > discussion in a