[jira] [Created] (ARROW-3098) [Python] BufferReader doesn't adhere to the seek protocol

2018-08-21 Thread JIRA
Björn Andersson created ARROW-3098: -- Summary: [Python] BufferReader doesn't adhere to the seek protocol Key: ARROW-3098 URL: https://issues.apache.org/jira/browse/ARROW-3098 Project: Apache Arrow

Developing native Arrow interfaces to database protocols

2018-08-21 Thread Wes McKinney
hi folks, I have long desired since the project's inception to develop higher performance database clients that natively return Arrow columnar format. This is a natural analogue to building Arrow-native interfaces to storage formats like Parquet and ORC. If we can't get fast access to data, many o

[jira] [Created] (ARROW-3099) [C++] Add benchmark for number parsing

2018-08-21 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-3099: - Summary: [C++] Add benchmark for number parsing Key: ARROW-3099 URL: https://issues.apache.org/jira/browse/ARROW-3099 Project: Apache Arrow Issue Type: Wis

Re: Parquet Build issues

2018-08-21 Thread Wes McKinney
Sounds like the build instructions need to be updated. We should make sure things are up to date after the parquet-cpp/Arrow merge On Mon, Aug 20, 2018 at 3:52 PM, Brian Bowman wrote: > All, > > My final hurdle to make parquet was updating zlib on a fresh Ubuntu VM: sudo > apt-get install zlib1

[jira] [Created] (ARROW-3100) [CI] C/glib build broken on OS X

2018-08-21 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-3100: - Summary: [CI] C/glib build broken on OS X Key: ARROW-3100 URL: https://issues.apache.org/jira/browse/ARROW-3100 Project: Apache Arrow Issue Type: Bug

[VOTE] Combining Arrow C++ development process with Apache Parquet C++

2018-08-21 Thread Wes McKinney
hi folks, As discussed on the Arrow and Parquet mailing lists ([1]) and resolved by vote on the Parquet mailing list ([2]), we are proposing to develop the Arrow and Parquet C++ libraries out of the apache/arrow git repository, while continuing to create separate Parquet C++ releases. In order to

[RESULT] [VOTE] Accept donation of Gandiva to Apache Arrow

2018-08-21 Thread Wes McKinney
The vote carries with 7 binding +1 votes and 4 non-binding +1 We look forward to working with the Gandiva developers on the code donation process On Sat, Aug 18, 2018 at 11:09 PM, Uwe L. Korn wrote: > +1 > > Uwe > > On Fri, Aug 17, 2018, at 5:02 AM, Jeff Zhang wrote: >> +1 >> >> Phillip Cloud 于2

Re: Developing native Arrow interfaces to database protocols

2018-08-21 Thread Dimitri Vorona
Hi Wes, I would personally be very interested in this project and see it as huge extension of Arrow's capabilities. I actually experimented with integration of Arrow into a main-memory db (HyPer [0]) though I might have had a slightly different focus. The way I took was to compile the export/impo

Re: [RESULT] [VOTE] Accept donation of Gandiva to Apache Arrow

2018-08-21 Thread Pindikura Ravindra
On Tue, Aug 21, 2018 at 10:27 PM Wes McKinney wrote: > The vote carries with 7 binding +1 votes and 4 non-binding +1 > > We look forward to working with the Gandiva developers on the code > donation process > thank you ! We got delayed a bit, but plan to focus on making this happen in the comin

Timeline for 0.11 Arrow release

2018-08-21 Thread Wes McKinney
hi all, I know it seems like we just released 0.10, but we are already 75 issues deep in 0.11 and have addressed a number of issues that surfaced with 0.10 https://issues.apache.org/jira/projects/ARROW/versions/12343066 I suggest that we plan to release again the week of September 10. That will

[ANNOUNCE] New Arrow PMC member: Antoine Pitrou

2018-08-21 Thread Wes McKinney
The Project Management Committee (PMC) for Apache Arrow has invited Antoine Pitrou to become a PMC member and we are pleased to announce that he has accepted. Congratulations and welcome, Antoine!

Re: [ANNOUNCE] New Arrow PMC member: Antoine Pitrou

2018-08-21 Thread Phillip Cloud
Welcome! Great to have you on the team! On Tue, Aug 21, 2018 at 4:27 PM Wes McKinney wrote: > The Project Management Committee (PMC) for Apache Arrow has invited > Antoine Pitrou to become a PMC member and we are pleased to announce > that he has accepted. > > Congratulations and welcome, Antoin

[jira] [Created] (ARROW-3101) [C++] Define common abstract APIs for database clients

2018-08-21 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3101: --- Summary: [C++] Define common abstract APIs for database clients Key: ARROW-3101 URL: https://issues.apache.org/jira/browse/ARROW-3101 Project: Apache Arrow Iss

[jira] [Created] (ARROW-3103) [C++] Conversion to Arrow record batch for HiveServer2 ColumnarRowSet

2018-08-21 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3103: --- Summary: [C++] Conversion to Arrow record batch for HiveServer2 ColumnarRowSet Key: ARROW-3103 URL: https://issues.apache.org/jira/browse/ARROW-3103 Project: Apache Arr

[jira] [Created] (ARROW-3102) [C++] Native Arrow interface to sqlite3

2018-08-21 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3102: --- Summary: [C++] Native Arrow interface to sqlite3 Key: ARROW-3102 URL: https://issues.apache.org/jira/browse/ARROW-3102 Project: Apache Arrow Issue Type: New Fe

[jira] [Created] (ARROW-3104) [Python] Python bindings for HiveServer2 client interface

2018-08-21 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3104: --- Summary: [Python] Python bindings for HiveServer2 client interface Key: ARROW-3104 URL: https://issues.apache.org/jira/browse/ARROW-3104 Project: Apache Arrow

Re: [ANNOUNCE] New Arrow PMC member: Antoine Pitrou

2018-08-21 Thread Philipp Moritz
Awesome, congratulations and welcome! On Tue, Aug 21, 2018 at 1:28 PM, Phillip Cloud wrote: > Welcome! Great to have you on the team! > > On Tue, Aug 21, 2018 at 4:27 PM Wes McKinney wrote: > > > The Project Management Committee (PMC) for Apache Arrow has invited > > Antoine Pitrou to become a

Re: Developing native Arrow interfaces to database protocols

2018-08-21 Thread Wes McKinney
hi Dimitri, On Tue, Aug 21, 2018 at 1:04 PM, Dimitri Vorona wrote: > Hi Wes, > > I would personally be very interested in this project and see it as huge > extension of Arrow's capabilities. > > I actually experimented with integration of Arrow into a main-memory db > (HyPer [0]) though I might h

Re: [ANNOUNCE] New Arrow PMC member: Antoine Pitrou

2018-08-21 Thread Antoine Pitrou
Hi, Thanks for inviting and accepting me! I really feel honored. Best regards Antoine. Le 21/08/2018 à 22:26, Wes McKinney a écrit : > The Project Management Committee (PMC) for Apache Arrow has invited > Antoine Pitrou to become a PMC member and we are pleased to announce > that he has acc

Re: [VOTE] Combining Arrow C++ development process with Apache Parquet C++

2018-08-21 Thread Antoine Pitrou
+1 from me. Regards Antoine. Le 21/08/2018 à 18:49, Wes McKinney a écrit : > hi folks, > > As discussed on the Arrow and Parquet mailing lists ([1]) and resolved > by vote on the Parquet mailing list ([2]), we are proposing to develop > the Arrow and Parquet C++ libraries out of the apache/a

Re: [VOTE] Combining Arrow C++ development process with Apache Parquet C++

2018-08-21 Thread Philipp Moritz
+1 for the monorepo plan and push access to Parquet C++ committers -- Philipp. On Tue, Aug 21, 2018 at 3:12 PM, Antoine Pitrou wrote: > > +1 from me. > > Regards > > Antoine. > > > Le 21/08/2018 à 18:49, Wes McKinney a écrit : > > hi folks, > > > > As discussed on the Arrow and Parquet mailing

Re: [VOTE] Combining Arrow C++ development process with Apache Parquet C++

2018-08-21 Thread Jacques Nadeau
+1 On Tue, Aug 21, 2018 at 3:21 PM Philipp Moritz wrote: > +1 for the monorepo plan and push access to Parquet C++ committers > > -- Philipp. > > On Tue, Aug 21, 2018 at 3:12 PM, Antoine Pitrou > wrote: > > > > > +1 from me. > > > > Regards > > > > Antoine. > > > > > > Le 21/08/2018 à 18:49,

Re: [VOTE] Combining Arrow C++ development process with Apache Parquet C++

2018-08-21 Thread Phillip Cloud
+1 On Tue, Aug 21, 2018 at 6:26 PM Jacques Nadeau wrote: > +1 > > > > On Tue, Aug 21, 2018 at 3:21 PM Philipp Moritz wrote: > > > +1 for the monorepo plan and push access to Parquet C++ committers > > > > -- Philipp. > > > > On Tue, Aug 21, 2018 at 3:12 PM, Antoine Pitrou > > wrote: > > > > >

[jira] [Created] (ARROW-3105) [Plasma] Improve flushing error message

2018-08-21 Thread Philipp Moritz (JIRA)
Philipp Moritz created ARROW-3105: - Summary: [Plasma] Improve flushing error message Key: ARROW-3105 URL: https://issues.apache.org/jira/browse/ARROW-3105 Project: Apache Arrow Issue Type: Im

[jira] [Created] (ARROW-3106) [Website] Update committers and PMC roster on website

2018-08-21 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-3106: --- Summary: [Website] Update committers and PMC roster on website Key: ARROW-3106 URL: https://issues.apache.org/jira/browse/ARROW-3106 Project: Apache Arrow Iss

Re: Timeline for 0.11 Arrow release

2018-08-21 Thread Uwe L. Korn
Hello, I will also go over the release and add items. For my personal goal for 0.11, I want to have predicate pushdown for Parquet files working. This means that we should be able to determine in Python code what the relevant RowGroups in a file are as well as filtering a Table given the set of

Re: [VOTE] Combining Arrow C++ development process with Apache Parquet C++

2018-08-21 Thread Uwe L. Korn
+1 On Wed, Aug 22, 2018, at 12:53 AM, Phillip Cloud wrote: > +1 > > On Tue, Aug 21, 2018 at 6:26 PM Jacques Nadeau wrote: > > > +1 > > > > > > > > On Tue, Aug 21, 2018 at 3:21 PM Philipp Moritz wrote: > > > > > +1 for the monorepo plan and push access to Parquet C++ committers > > > > > > -- P