[jira] [Created] (ARROW-6114) Datatypes are not preserved when a pandas dataframe partitioned and saved as parquet file using pyarrow

2019-08-01 Thread Naga (JIRA)
Naga created ARROW-6114: --- Summary: Datatypes are not preserved when a pandas dataframe partitioned and saved as parquet file using pyarrow Key: ARROW-6114 URL: https://issues.apache.org/jira/browse/ARROW-6114 P

[jira] [Created] (ARROW-6113) [Java] Support vector deduplicate function

2019-08-01 Thread Liya Fan (JIRA)
Liya Fan created ARROW-6113: --- Summary: [Java] Support vector deduplicate function Key: ARROW-6113 URL: https://issues.apache.org/jira/browse/ARROW-6113 Project: Apache Arrow Issue Type: New Feature

[jira] [Created] (ARROW-6112) [Java] Update APIs to support 64-bit address space

2019-08-01 Thread Micah Kornfield (JIRA)
Micah Kornfield created ARROW-6112: -- Summary: [Java] Update APIs to support 64-bit address space Key: ARROW-6112 URL: https://issues.apache.org/jira/browse/ARROW-6112 Project: Apache Arrow I

[jira] [Created] (ARROW-6110) [Java] Support LargeList Type and add integration test with C++

2019-08-01 Thread Micah Kornfield (JIRA)
Micah Kornfield created ARROW-6110: -- Summary: [Java] Support LargeList Type and add integration test with C++ Key: ARROW-6110 URL: https://issues.apache.org/jira/browse/ARROW-6110 Project: Apache Arr

[jira] [Created] (ARROW-6111) [Java] Support LargeVarChar and LargeBinary types and add integration test with C++

2019-08-01 Thread Micah Kornfield (JIRA)
Micah Kornfield created ARROW-6111: -- Summary: [Java] Support LargeVarChar and LargeBinary types and add integration test with C++ Key: ARROW-6111 URL: https://issues.apache.org/jira/browse/ARROW-6111

[jira] [Created] (ARROW-6109) [Integration] Docker image for integration testing can't be built on windows

2019-08-01 Thread Paddy Horan (JIRA)
Paddy Horan created ARROW-6109: -- Summary: [Integration] Docker image for integration testing can't be built on windows Key: ARROW-6109 URL: https://issues.apache.org/jira/browse/ARROW-6109 Project: Apach

[jira] [Created] (ARROW-6108) [C++] Appveyor Build_Debug configuration is hanging in C++ unit tests

2019-08-01 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6108: --- Summary: [C++] Appveyor Build_Debug configuration is hanging in C++ unit tests Key: ARROW-6108 URL: https://issues.apache.org/jira/browse/ARROW-6108 Project: Apache Arr

[jira] [Created] (ARROW-6107) [Go] ipc.Writer Option to skip appending data buffers

2019-08-01 Thread Nick Poorman (JIRA)
Nick Poorman created ARROW-6107: --- Summary: [Go] ipc.Writer Option to skip appending data buffers Key: ARROW-6107 URL: https://issues.apache.org/jira/browse/ARROW-6107 Project: Apache Arrow Issu

[jira] [Created] (ARROW-6106) Scala lang support

2019-08-01 Thread Boris V.Kuznetsov (JIRA)
Boris V.Kuznetsov created ARROW-6106: Summary: Scala lang support Key: ARROW-6106 URL: https://issues.apache.org/jira/browse/ARROW-6106 Project: Apache Arrow Issue Type: Wish

[jira] [Created] (ARROW-6105) [C++][Parquet][Python] Add test case showing dictionary-encoded subfields in nested type

2019-08-01 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6105: --- Summary: [C++][Parquet][Python] Add test case showing dictionary-encoded subfields in nested type Key: ARROW-6105 URL: https://issues.apache.org/jira/browse/ARROW-6105

Re: [DISCUSS] Release cadence and release vote conventions

2019-08-01 Thread Wes McKinney
I agree. In my experiences as RM I have found the involvement of Maven in the release process to be a nuisance. I think it makes more sense in Java-only projects On Thu, Aug 1, 2019 at 2:54 PM Andy Grove wrote: > > I'll start taking a look at the maven issue. We might not want to use maven > rele

[jira] [Created] (ARROW-6104) [Rust] [DataFusion] Don't allow bare_trait_objects

2019-08-01 Thread Andy Grove (JIRA)
Andy Grove created ARROW-6104: - Summary: [Rust] [DataFusion] Don't allow bare_trait_objects Key: ARROW-6104 URL: https://issues.apache.org/jira/browse/ARROW-6104 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-6103) [Java] Do we really want to use the maven release plugin?

2019-08-01 Thread Andy Grove (JIRA)
Andy Grove created ARROW-6103: - Summary: [Java] Do we really want to use the maven release plugin? Key: ARROW-6103 URL: https://issues.apache.org/jira/browse/ARROW-6103 Project: Apache Arrow Issu

Re: [DISCUSS] Release cadence and release vote conventions

2019-08-01 Thread Andy Grove
I'll start taking a look at the maven issue. We might not want to use maven release plugin given that we control the version number already across this repository via other means. On Wed, Jul 31, 2019 at 4:26 PM Sutou Kouhei wrote: > Hi, > > Sorry for not replying this thread. > > I think that t

[jira] [Created] (ARROW-6102) [Testing] Add partitioned CSV file to arrow-testing repo

2019-08-01 Thread Andy Grove (JIRA)
Andy Grove created ARROW-6102: - Summary: [Testing] Add partitioned CSV file to arrow-testing repo Key: ARROW-6102 URL: https://issues.apache.org/jira/browse/ARROW-6102 Project: Apache Arrow Issue

Re: [DISCUSS] Add GetFlightSchema to Flight RPC

2019-08-01 Thread Bryan Cutler
Sounds good to me, I would just echo what others have said. On Thu, Aug 1, 2019 at 8:17 AM Ryan Murray wrote: > Thanks Wes, > > The descriptor is only there to maintain a bit of symmetry with > GetFlightInfo. Happy to remove it, I don't think its necessary and already > a few people agree. Simil

[jira] [Created] (ARROW-6101) [Rust] [DataFusion] Create physical plan from logical plan

2019-08-01 Thread Andy Grove (JIRA)
Andy Grove created ARROW-6101: - Summary: [Rust] [DataFusion] Create physical plan from logical plan Key: ARROW-6101 URL: https://issues.apache.org/jira/browse/ARROW-6101 Project: Apache Arrow Iss

Re: Metadata orderedness?

2019-08-01 Thread Wes McKinney
I think that orderedness should not matter for equality testing. Semantically I think that this field is supposed to be dictionary-like and the keys are intended to be unique (but this isn't stipulated in Schema.fbs at the moment) On Thu, Aug 1, 2019 at 10:47 AM Antoine Pitrou wrote: > > > Hello,

[jira] [Created] (ARROW-6100) [Rust] Pin to specific Rust nightly release

2019-08-01 Thread Andy Grove (JIRA)
Andy Grove created ARROW-6100: - Summary: [Rust] Pin to specific Rust nightly release Key: ARROW-6100 URL: https://issues.apache.org/jira/browse/ARROW-6100 Project: Apache Arrow Issue Type: Improv

[jira] [Created] (ARROW-6099) [JAVA] Has the ability to not using slf4j logging framework

2019-08-01 Thread Haowei Yu (JIRA)
Haowei Yu created ARROW-6099: Summary: [JAVA] Has the ability to not using slf4j logging framework Key: ARROW-6099 URL: https://issues.apache.org/jira/browse/ARROW-6099 Project: Apache Arrow Iss

[jira] [Created] (ARROW-6098) [C++] Partially mitigating CPU scaling effects in benchmarks

2019-08-01 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6098: --- Summary: [C++] Partially mitigating CPU scaling effects in benchmarks Key: ARROW-6098 URL: https://issues.apache.org/jira/browse/ARROW-6098 Project: Apache Arrow

Metadata orderedness?

2019-08-01 Thread Antoine Pitrou
Hello, Is key/value metadata (as attached to fields) supposed to be ordered or unordered? In the C++ codebase currently, order is significant in KeyValueMetadata::Equals(). Regards Antoine.

Re: [DISCUSS] Add GetFlightSchema to Flight RPC

2019-08-01 Thread Ryan Murray
Thanks Wes, The descriptor is only there to maintain a bit of symmetry with GetFlightInfo. Happy to remove it, I don't think its necessary and already a few people agree. Similar with the method name, I am neutral to the naming and can call it whatever the community is happy with. Best, Ryan On

[jira] [Created] (ARROW-6097) [Java] Avro adapter implement unions type

2019-08-01 Thread Ji Liu (JIRA)
Ji Liu created ARROW-6097: - Summary: [Java] Avro adapter implement unions type Key: ARROW-6097 URL: https://issues.apache.org/jira/browse/ARROW-6097 Project: Apache Arrow Issue Type: Sub-task

Re: [DISCUSS] Add GetFlightSchema to Flight RPC

2019-08-01 Thread Wes McKinney
I'm generally supporting of adding the new RPC endpoint. To make a couple points from the document * I'm not sure what the purpose of returning the FlightDescriptor is, but I haven't thought too hard about it * The Schema consists of a single IPC message -- dictionaries will appear in the actual

[jira] [Created] (ARROW-6096) [C++] Remove dependency on boost regex library

2019-08-01 Thread Hatem Helal (JIRA)
Hatem Helal created ARROW-6096: -- Summary: [C++] Remove dependency on boost regex library Key: ARROW-6096 URL: https://issues.apache.org/jira/browse/ARROW-6096 Project: Apache Arrow Issue Type: I

[jira] [Created] (ARROW-6095) [C++] Python subproject ignores ARROW_TEST_LINKAGE

2019-08-01 Thread Benjamin Kietzman (JIRA)
Benjamin Kietzman created ARROW-6095: Summary: [C++] Python subproject ignores ARROW_TEST_LINKAGE Key: ARROW-6095 URL: https://issues.apache.org/jira/browse/ARROW-6095 Project: Apache Arrow

Re: [DISCUSS] Add GetFlightSchema to Flight RPC

2019-08-01 Thread David Li
Hi Ryan, Thanks for writing this up! I made a couple of minor comments in the doc/implementation, but overall I'm in favor of having this RPC method. Best, David On 8/1/19, Ryan Murray wrote: > Hi All, > > Please see the attached document for a proposed addition to the Flight > RPC[1]. This is

Re: [DISCUSS][Format] FixedSizeList w/ row-length not specified as part of the type

2019-08-01 Thread Edward Loper
Brian: yes, you're correct. Sorry, I've been playing around with a couple different ways to extend things, and was conflating them when I wrote my response. For this proposal, the dimension must have the same size for all items in a given record batch. As suggested by Francois and Wes, I will lo

[DISCUSS] Add GetFlightSchema to Flight RPC

2019-08-01 Thread Ryan Murray
Hi All, Please see the attached document for a proposed addition to the Flight RPC[1]. This is the result of a previous mailing list discussion[2]. I have created the Pull Request[3] to make the proposal a little more concrete. Please let me know if you have any

[jira] [Created] (ARROW-6094) Add GetFlightSchema to Flight RPC

2019-08-01 Thread Ryan Murray (JIRA)
Ryan Murray created ARROW-6094: -- Summary: Add GetFlightSchema to Flight RPC Key: ARROW-6094 URL: https://issues.apache.org/jira/browse/ARROW-6094 Project: Apache Arrow Issue Type: Task

[jira] [Created] (ARROW-6093) [Java] reduce branches in algo for first match in VectorRangeSearcher

2019-08-01 Thread Pindikura Ravindra (JIRA)
Pindikura Ravindra created ARROW-6093: - Summary: [Java] reduce branches in algo for first match in VectorRangeSearcher Key: ARROW-6093 URL: https://issues.apache.org/jira/browse/ARROW-6093 Project

[jira] [Created] (ARROW-6092) [C++] unit test failure due to unexpected result

2019-08-01 Thread Lee June Woo (JIRA)
Lee June Woo created ARROW-6092: --- Summary: [C++] unit test failure due to unexpected result Key: ARROW-6092 URL: https://issues.apache.org/jira/browse/ARROW-6092 Project: Apache Arrow Issue Ty