[jira] [Created] (ARROW-2041) [Python] pyarrow.serialize has high overhead for list of NumPy arrays

2018-01-25 Thread Richard Shin (JIRA)
Richard Shin created ARROW-2041: --- Summary: [Python] pyarrow.serialize has high overhead for list of NumPy arrays Key: ARROW-2041 URL: https://issues.apache.org/jira/browse/ARROW-2041 Project: Apache Arr

[jira] [Created] (ARROW-2040) [Python] pyarrow.read_serialized returns bogus data

2018-01-25 Thread Richard Shin (JIRA)
Richard Shin created ARROW-2040: --- Summary: [Python] pyarrow.read_serialized returns bogus data Key: ARROW-2040 URL: https://issues.apache.org/jira/browse/ARROW-2040 Project: Apache Arrow Issue

[jira] [Created] (ARROW-2039) [Python] pyarrow.Buffer().to_pybytes() segfaults

2018-01-25 Thread Richard Shin (JIRA)
Richard Shin created ARROW-2039: --- Summary: [Python] pyarrow.Buffer().to_pybytes() segfaults Key: ARROW-2039 URL: https://issues.apache.org/jira/browse/ARROW-2039 Project: Apache Arrow Issue Typ

Re: Typelib file for namespace 'Arrow' not found for go examples

2018-01-25 Thread Kouhei Sutou
Hi, In <20180125.222043.1260691864055204959@clear-code.com> "Re: Typelib file for namespace 'Arrow' not found for go examples" on Thu, 25 Jan 2018 22:20:43 +0900 (JST), Kouhei Sutou wrote: > 1. gir1.2-arrow-1.0 deb package for Ubuntu 14.04 installs >.typelib file to wrong directory.

[jira] [Created] (ARROW-2038) [Python] Follow-up bug fixes for s3fs Parquet support

2018-01-25 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2038: --- Summary: [Python] Follow-up bug fixes for s3fs Parquet support Key: ARROW-2038 URL: https://issues.apache.org/jira/browse/ARROW-2038 Project: Apache Arrow Issu

[jira] [Created] (ARROW-2037) [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty'

2018-01-25 Thread Phillip Cloud (JIRA)
Phillip Cloud created ARROW-2037: Summary: [Python]: Add tests for ARROW-1941 cases where pandas inferred type is 'empty' Key: ARROW-2037 URL: https://issues.apache.org/jira/browse/ARROW-2037 Project:

[jira] [Created] (ARROW-2036) NativeFile should support standard IOBase methods

2018-01-25 Thread Jim Crist (JIRA)
Jim Crist created ARROW-2036: Summary: NativeFile should support standard IOBase methods Key: ARROW-2036 URL: https://issues.apache.org/jira/browse/ARROW-2036 Project: Apache Arrow Issue Type: Im

Re: [DISCUSS] Union Vector

2018-01-25 Thread Philipp Moritz
Hey Li, In Ray we need the second type of union, since there can be arbitrary nesting. -- Philipp. On Thu, Jan 25, 2018 at 8:56 AM, Li Jin wrote: > Hi All, > > I'd like to bump this thread to get some more feedbacks from other people. > I think what Wes says makes sense, there seems to be two

Re: [DISCUSS] Union Vector

2018-01-25 Thread Li Jin
Hi All, I'd like to bump this thread to get some more feedbacks from other people. I think what Wes says makes sense, there seems to be two requirement for union types and it might make sense to make them different types. I think Dremio has more use case for the first type of union. I think Ray a

[jira] [Created] (ARROW-2035) [C++] Update vendored cpplint.py to a Py3-compatible one

2018-01-25 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2035: --- Summary: [C++] Update vendored cpplint.py to a Py3-compatible one Key: ARROW-2035 URL: https://issues.apache.org/jira/browse/ARROW-2035 Project: Apache Arrow I

[jira] [Created] (ARROW-2034) [C++] Filesystem implementation for Azure Blob Store

2018-01-25 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2034: --- Summary: [C++] Filesystem implementation for Azure Blob Store Key: ARROW-2034 URL: https://issues.apache.org/jira/browse/ARROW-2034 Project: Apache Arrow Issue

Re: [Python] Disk size performance of Snappy vs Brotli vs Blosc

2018-01-25 Thread simba nyatsanga
Thanks all for the great feedback! Thanks Daniel for the sample data sets. I loaded them up and they're quite comparable in size to some of the data I'm dealing with. In my case the shapes range from 150 to ~100million rows. Column wise they range from 2-3 columns to ~500,000 columns. Thanks Wes

Re: Typelib file for namespace 'Arrow' not found for go examples

2018-01-25 Thread Kouhei Sutou
Hi, Thanks for reporting it. In "Typelib file for namespace 'Arrow' not found for go examples" on Wed, 24 Jan 2018 01:41:58 -0800, Mike Sam wrote: > % git clone https://github.com/apache/arrow.git ~/arrow > % cd ~/arrow/c_glib/example/go > % make generate > > This returns the following e

[jira] [Created] (ARROW-2033) pa.array() doesn't work with iterators

2018-01-25 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-2033: - Summary: pa.array() doesn't work with iterators Key: ARROW-2033 URL: https://issues.apache.org/jira/browse/ARROW-2033 Project: Apache Arrow Issue Type: Bug