[jira] [Created] (ARROW-5869) Need a way to access UnionArray's children as Arrays in pyarrow

2019-07-06 Thread Jim Pivarski (JIRA)
Jim Pivarski created ARROW-5869: --- Summary: Need a way to access UnionArray's children as Arrays in pyarrow Key: ARROW-5869 URL: https://issues.apache.org/jira/browse/ARROW-5869 Project: Apache Arrow

Re: [Discuss] Format additions to Arrow for sparse data and data integrity

2019-07-06 Thread Jacques Nadeau
> > What is the driving force for transport compression? Are you seeing that >> as a major bottleneck in particular circumstances? (I'm not disagreeing, >> just want to clearly define the particular problem you're worried about.) > > > I've been working on a 20% project where we appear to be IO bou

[DISCUSS] Release cadence and release vote conventions

2019-07-06 Thread Wes McKinney
hi folks, As a reminder, particularly since we have many new community members (some of whom have never been involved with an ASF project before), releases are approved exclusively by the PMC and in general releases cannot be vetoed. In spite of that, we strive to make releases that have unanimous

Re: Flight authentication interoperability

2019-07-06 Thread Wes McKinney
Are there some action items (JIRA issues) to follow up here? At minimum having documentation about this for the Python client side would seem to be in order. On Thu, Jul 4, 2019 at 2:20 PM Ryan Murray wrote: > > Hey David, > > I was actually testing test_flight.test_http_basic_auth(). But I think

Re: flatbuffers vectors and --gen-object-api

2019-07-06 Thread Wes McKinney
On Fri, Jul 5, 2019 at 10:53 AM John Muehlhausen wrote: > > It seems as if Arrow expects for some vectors to be empty rather than null. > (Examples: Footer.dictionaries, Field.children) > When you say "Arrow expects" do you mean the C++ library, or implementations in general? > Anyone using --g

Re: [Discuss] Format additions to Arrow for sparse data and data integrity

2019-07-06 Thread Antoine Pitrou
Hi Micah, Le 05/07/2019 à 20:53, Micah Kornfield a écrit : > > Going into more details on the specific features in the PR: > >1. > >Sparse encodings for arrays and buffers. The guiding principles behind >the suggested encodings are to support encodings that can be exploited by >

Re: [Discuss] Format additions to Arrow for sparse data and data integrity

2019-07-06 Thread Paul Taylor
Hi Micah, Similar to Jacques I'm not disagreeing, but wondering if they belong in Arrow vs. can be done externally. I'm mostly interested in changes that might impact SIMD processing, considering Arrow's already made conscious design decisions to trade memory for speed. Apologies in advance if

Re: [RESULT][VOTE] Release Apache Arrow 0.14.0 - RC0

2019-07-06 Thread Chao Sun
Hi Sutou, Do you know whether we saw this error before? seems it has nothing to do with the patch you put. I think in the meanwhile we can just remove the lines with "[[bench]]" as they are not needed for "cargo publish". We can file a JIRA to investigate this latter. Chao On Thu, Jul 4, 2019 at

[jira] [Created] (ARROW-5868) [Python] AttributeError: module 'pyarrow' has no attribute 'compat'

2019-07-06 Thread Haowei Yu (JIRA)
Haowei Yu created ARROW-5868: Summary: [Python] AttributeError: module 'pyarrow' has no attribute 'compat' Key: ARROW-5868 URL: https://issues.apache.org/jira/browse/ARROW-5868 Project: Apache Arrow

[jira] [Created] (ARROW-5867) [C++][Gandiva] Add support for cast int to decimal

2019-07-06 Thread Pindikura Ravindra (JIRA)
Pindikura Ravindra created ARROW-5867: - Summary: [C++][Gandiva] Add support for cast int to decimal Key: ARROW-5867 URL: https://issues.apache.org/jira/browse/ARROW-5867 Project: Apache Arrow