[jira] [Resolved] (ARROW-1873) [Python] Segmentation fault when loading total 2GB of parquet files

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1873. - Resolution: Fixed Issue resolved by pull request 1404 [https://github.com/apache/arrow/pull/1404]

[jira] [Commented] (ARROW-1864) [Java] Upgrade Netty to 4.1.x

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282882#comment-16282882 ] ASF GitHub Bot commented on ARROW-1864: --- wesm commented on issue #1376: ARROW-1864:

[jira] [Commented] (ARROW-1864) [Java] Upgrade Netty to 4.1.x

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282884#comment-16282884 ] ASF GitHub Bot commented on ARROW-1864: --- wesm closed pull request #1376: ARROW-1864:

[jira] [Resolved] (ARROW-1864) [Java] Upgrade Netty to 4.1.x

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1864. - Resolution: Fixed Issue resolved by pull request 1376 [https://github.com/apache/arrow/pull/1376]

[jira] [Updated] (ARROW-1042) [Python] C++ API plumbing for returning generic instance of ipc::RecordBatchReader to user

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1042: Fix Version/s: 0.9.0 > [Python] C++ API plumbing for returning generic instance of > ipc::RecordBat

[jira] [Updated] (ARROW-1266) [Plasma] Move heap allocations to arrow memory pool

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1266: Fix Version/s: 0.9.0 > [Plasma] Move heap allocations to arrow memory pool > ---

[jira] [Commented] (ARROW-1873) [Python] Segmentation fault when loading total 2GB of parquet files

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282857#comment-16282857 ] ASF GitHub Bot commented on ARROW-1873: --- wesm closed pull request #1404: ARROW-1873:

[jira] [Updated] (ARROW-987) [JS] Implement JSON writer for Integration tests

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-987: --- Fix Version/s: 0.9.0 > [JS] Implement JSON writer for Integration tests > -

[jira] [Commented] (ARROW-1864) [Java] Upgrade Netty to 4.1.x

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282771#comment-16282771 ] ASF GitHub Bot commented on ARROW-1864: --- siddharthteotia commented on issue #1376: A

[jira] [Commented] (ARROW-1864) [Java] Upgrade Netty to 4.1.x

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282759#comment-16282759 ] ASF GitHub Bot commented on ARROW-1864: --- zsxwing commented on issue #1376: ARROW-186

[jira] [Commented] (ARROW-1864) [Java] Upgrade Netty to 4.1.x

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282760#comment-16282760 ] ASF GitHub Bot commented on ARROW-1864: --- zsxwing commented on issue #1376: ARROW-186

[jira] [Closed] (ARROW-1816) [Java] Resolve new vector classes structure for timestamp, date and maybe interval

2017-12-07 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin closed ARROW-1816. - Resolution: Won't Fix Won't fix per discussion: https://docs.google.com/document/d/1n4qjO20wZyS7wSpISgYdIVuD22zstL

[jira] [Commented] (ARROW-1816) [Java] Resolve new vector classes structure for timestamp, date and maybe interval

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282638#comment-16282638 ] ASF GitHub Bot commented on ARROW-1816: --- icexelloss closed pull request #1330: ARROW

[jira] [Commented] (ARROW-1873) [Python] Segmentation fault when loading total 2GB of parquet files

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282585#comment-16282585 ] ASF GitHub Bot commented on ARROW-1873: --- wesm commented on issue #1404: ARROW-1873:

[jira] [Updated] (ARROW-1036) [C++] Define abstract API for filtering Arrow streams (e.g. predicate evaluation)

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1036: Fix Version/s: 1.0.0 > [C++] Define abstract API for filtering Arrow streams (e.g. predicate > eval

[jira] [Created] (ARROW-1905) [Python] Add more functions for checking exact types in pyarrow.types

2017-12-07 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1905: --- Summary: [Python] Add more functions for checking exact types in pyarrow.types Key: ARROW-1905 URL: https://issues.apache.org/jira/browse/ARROW-1905 Project: Apache Arr

[jira] [Assigned] (ARROW-1904) [C++] Using the raw_values() method on arrow::PrimitiveArray yields unreliable results on some compilers

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-1904: --- Assignee: Wes McKinney > [C++] Using the raw_values() method on arrow::PrimitiveArray yields

[jira] [Commented] (ARROW-1873) [Python] Segmentation fault when loading total 2GB of parquet files

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282516#comment-16282516 ] ASF GitHub Bot commented on ARROW-1873: --- wesm opened a new pull request #1404: ARROW

[jira] [Updated] (ARROW-1873) [Python] Segmentation fault when loading total 2GB of parquet files

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1873: -- Labels: pull-request-available (was: ) > [Python] Segmentation fault when loading total 2GB of

[jira] [Commented] (ARROW-1884) [C++] Make JsonReader/JsonWriter classes internal APIs

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282514#comment-16282514 ] ASF GitHub Bot commented on ARROW-1884: --- wesm closed pull request #1400: ARROW-1884:

[jira] [Resolved] (ARROW-1884) [C++] Make JsonReader/JsonWriter classes internal APIs

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1884. - Resolution: Fixed Issue resolved by pull request 1400 [https://github.com/apache/arrow/pull/1400]

[jira] [Created] (ARROW-1904) [C++] Using the raw_values() method on arrow::PrimitiveArray yields unreliable results on some compilers

2017-12-07 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1904: --- Summary: [C++] Using the raw_values() method on arrow::PrimitiveArray yields unreliable results on some compilers Key: ARROW-1904 URL: https://issues.apache.org/jira/browse/ARROW-19

[jira] [Updated] (ARROW-1902) [Python] Remove mkdir race condition from write_to_dataset

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1902: -- Labels: pull-request-available (was: ) > [Python] Remove mkdir race condition from write_to_dat

[jira] [Created] (ARROW-1902) [Python] Remove mkdir race condition from write_to_dataset

2017-12-07 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-1902: -- Summary: [Python] Remove mkdir race condition from write_to_dataset Key: ARROW-1902 URL: https://issues.apache.org/jira/browse/ARROW-1902 Project: Apache Arrow

[jira] [Commented] (ARROW-1902) [Python] Remove mkdir race condition from write_to_dataset

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282464#comment-16282464 ] ASF GitHub Bot commented on ARROW-1902: --- xhochy opened a new pull request #1402: ARR

[jira] [Created] (ARROW-1903) [JS] Fix typings consuming apache-arrow module when noImplicitAny is false

2017-12-07 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-1903: -- Summary: [JS] Fix typings consuming apache-arrow module when noImplicitAny is false Key: ARROW-1903 URL: https://issues.apache.org/jira/browse/ARROW-1903 Project: Apache

[jira] [Commented] (ARROW-1901) [Python] Support recursive mkdir for DaskFilesystem

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282452#comment-16282452 ] ASF GitHub Bot commented on ARROW-1901: --- xhochy opened a new pull request #1401: ARR

[jira] [Updated] (ARROW-1901) [Python] Support recursive mkdir for DaskFilesystem

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1901: -- Labels: pull-request-available (was: ) > [Python] Support recursive mkdir for DaskFilesystem >

[jira] [Created] (ARROW-1901) [Python] Support recursive mkdir for DaskFilesystem

2017-12-07 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-1901: -- Summary: [Python] Support recursive mkdir for DaskFilesystem Key: ARROW-1901 URL: https://issues.apache.org/jira/browse/ARROW-1901 Project: Apache Arrow Issue Ty

[jira] [Assigned] (ARROW-1901) [Python] Support recursive mkdir for DaskFilesystem

2017-12-07 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn reassigned ARROW-1901: -- Assignee: Uwe L. Korn > [Python] Support recursive mkdir for DaskFilesystem > -

[jira] [Commented] (ARROW-1873) [Python] Segmentation fault when loading total 2GB of parquet files

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282316#comment-16282316 ] Wes McKinney commented on ARROW-1873: - I'm going to add a few missing null checks to h

[jira] [Assigned] (ARROW-1873) [Python] Segmentation fault when loading total 2GB of parquet files

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-1873: --- Assignee: Wes McKinney > [Python] Segmentation fault when loading total 2GB of parquet files

[jira] [Updated] (ARROW-1884) [C++] Make JsonReader/JsonWriter classes internal APIs

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1884: -- Labels: pull-request-available (was: ) > [C++] Make JsonReader/JsonWriter classes internal APIs

[jira] [Commented] (ARROW-1884) [C++] Make JsonReader/JsonWriter classes internal APIs

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282298#comment-16282298 ] ASF GitHub Bot commented on ARROW-1884: --- wesm opened a new pull request #1400: ARROW

[jira] [Updated] (ARROW-1561) [C++] Kernel implementations for "isin" (set containment)

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1561: Fix Version/s: 0.9.0 > [C++] Kernel implementations for "isin" (set containment) > -

[jira] [Updated] (ARROW-1560) [C++] Kernel implementations for "match" function

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1560: Fix Version/s: 0.9.0 > [C++] Kernel implementations for "match" function > -

[jira] [Updated] (ARROW-1569) [C++] Kernel functions for determining monotonicity (ascending or descending) for well-ordered types

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1569: Fix Version/s: 1.0.0 > [C++] Kernel functions for determining monotonicity (ascending or descending)

[jira] [Created] (ARROW-1900) [C++] Add utility functions for determining value range (maximum and minimum) of integer arrays

2017-12-07 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1900: --- Summary: [C++] Add utility functions for determining value range (maximum and minimum) of integer arrays Key: ARROW-1900 URL: https://issues.apache.org/jira/browse/ARROW-1900

[jira] [Updated] (ARROW-1567) [C++] Implement "fill null" kernels that replace null values with some scalar replacement value

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1567: Fix Version/s: 1.0.0 > [C++] Implement "fill null" kernels that replace null values with some scalar

[jira] [Updated] (ARROW-1580) [Python] Instructions for setting up nightly builds on Linux

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1580: Fix Version/s: 0.9.0 > [Python] Instructions for setting up nightly builds on Linux > --

[jira] [Updated] (ARROW-1570) [C++] Define API for creating a kernel instance from function of scalar input and output with a particular signature

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1570: Fix Version/s: 0.9.0 > [C++] Define API for creating a kernel instance from function of scalar input

[jira] [Updated] (ARROW-1501) [JS] JavaScript integration tests

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1501: Fix Version/s: 0.9.0 > [JS] JavaScript integration tests > - > >

[jira] [Updated] (ARROW-1424) [Python] Initial bindings for libarrow_gpu

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1424: Fix Version/s: 0.9.0 > [Python] Initial bindings for libarrow_gpu >

[jira] [Commented] (ARROW-1329) [C++] Define "virtual table" interface

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282175#comment-16282175 ] Wes McKinney commented on ARROW-1329: - There has been partial progress toward this. It

[jira] [Updated] (ARROW-638) [Format] Add metadata for single and double precision complex numbers

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-638: --- Fix Version/s: 1.0.0 > [Format] Add metadata for single and double precision complex numbers >

[jira] [Updated] (ARROW-976) [Python] Provide API for defining and reading Parquet datasets with more ad hoc partition schemes

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-976: --- Fix Version/s: 0.9.0 > [Python] Provide API for defining and reading Parquet datasets with more ad > h

[jira] [Updated] (ARROW-567) [C++] File and stream APIs for interacting with "large" schemas

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-567: --- Fix Version/s: 0.9.0 > [C++] File and stream APIs for interacting with "large" schemas > --

[jira] [Updated] (ARROW-764) [C++] Improve performance of CopyBitmap, add benchmarks

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-764: --- Fix Version/s: 0.9.0 > [C++] Improve performance of CopyBitmap, add benchmarks > --

[jira] [Updated] (ARROW-522) [Java] VectorLoader throws exception data schema contains list of maps.

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-522: --- Summary: [Java] VectorLoader throws exception data schema contains list of maps. (was: VectorLoader th

[jira] [Updated] (ARROW-1393) [C++] Simplified CUDA IPC writer and reader for communicating a CPU + GPU payload to another process

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1393: Fix Version/s: 0.9.0 > [C++] Simplified CUDA IPC writer and reader for communicating a CPU + GPU >

[jira] [Updated] (ARROW-792) [Java] Allow loading/unloading vectors without using FieldNodes

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-792: --- Summary: [Java] Allow loading/unloading vectors without using FieldNodes (was: Allow loading/unloading

[jira] [Updated] (ARROW-1382) [Python] Deduplicate non-scalar Python objects when using pyarrow.serialize

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1382: Fix Version/s: 0.9.0 > [Python] Deduplicate non-scalar Python objects when using pyarrow.serialize >

[jira] [Updated] (ARROW-1329) [C++] Define "virtual table" interface

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1329: Fix Version/s: 1.0.0 > [C++] Define "virtual table" interface >

[jira] [Updated] (ARROW-501) [C++] Implement concurrent / buffering InputStream for streaming data use cases

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-501: --- Fix Version/s: 0.9.0 > [C++] Implement concurrent / buffering InputStream for streaming data use > cas

[jira] [Updated] (ARROW-1012) [C++] Create implementation of StreamReader that reads from Apache Parquet files

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1012: Fix Version/s: 0.9.0 > [C++] Create implementation of StreamReader that reads from Apache Parquet >

[jira] [Updated] (ARROW-1009) [C++] Create asynchronous version of StreamReader

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1009: Fix Version/s: 0.9.0 > [C++] Create asynchronous version of StreamReader > -

[jira] [Updated] (ARROW-530) C++/Python: Provide subpools for better memory allocation tracking

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-530: --- Fix Version/s: 0.9.0 > C++/Python: Provide subpools for better memory allocation tracking > ---

[jira] [Updated] (ARROW-554) [C++] Implement functions to conform unequal dictionaries amongst multiple Arrow arrays

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-554: --- Fix Version/s: 0.9.0 > [C++] Implement functions to conform unequal dictionaries amongst multiple > Ar

[jira] [Updated] (ARROW-973) [Website] Add FAQ page about project

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-973: --- Fix Version/s: (was: 1.0.0) 0.9.0 > [Website] Add FAQ page about project > -

[jira] [Updated] (ARROW-640) [Python] Arrow scalar values should have a sensible __hash__ and comparison

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-640: --- Summary: [Python] Arrow scalar values should have a sensible __hash__ and comparison (was: [Python] Ar

[jira] [Updated] (ARROW-640) [Python] Arrow types should have a sensible __hash__ and comparison

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-640: --- Fix Version/s: 0.9.0 > [Python] Arrow types should have a sensible __hash__ and comparison > --

[jira] [Updated] (ARROW-41) C++: Convert table to std::vector of Struct arrays

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-41?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-41: -- Fix Version/s: 0.9.0 > C++: Convert table to std::vector of Struct arrays > --

[jira] [Updated] (ARROW-40) C++: Reinterpret Struct arrays as tables

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-40?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-40: -- Fix Version/s: 0.9.0 > C++: Reinterpret Struct arrays as tables >

[jira] [Updated] (ARROW-1572) [C++] Implement "value counts" kernels for tabulating value frequencies

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1572: Fix Version/s: 0.9.0 > [C++] Implement "value counts" kernels for tabulating value frequencies > ---

[jira] [Commented] (ARROW-1579) Add dockerized test setup to validate Spark integration

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282092#comment-16282092 ] Wes McKinney commented on ARROW-1579: - After 0.8.0 settles it would be great to have t

[jira] [Updated] (ARROW-1579) Add dockerized test setup to validate Spark integration

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1579: Fix Version/s: 0.9.0 > Add dockerized test setup to validate Spark integration > ---

[jira] [Updated] (ARROW-1599) PyArrow unable to read Parquet files with vector as column

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1599: Fix Version/s: 0.9.0 > PyArrow unable to read Parquet files with vector as column >

[jira] [Updated] (ARROW-1639) [Python] More efficient serialization for RangeIndex in serialize_pandas

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1639: Fix Version/s: 0.9.0 > [Python] More efficient serialization for RangeIndex in serialize_pandas > --

[jira] [Updated] (ARROW-1623) [C++] Add convenience method to construct Buffer from a string that owns its memory

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1623: Fix Version/s: 0.9.0 > [C++] Add convenience method to construct Buffer from a string that owns its

[jira] [Closed] (ARROW-1645) Access HDFS with read_table() automatically

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-1645. --- Resolution: Duplicate Duplicate of ARROW-1643 > Access HDFS with read_table() automatically > ---

[jira] [Updated] (ARROW-1644) [Python] Read and write nested Parquet data with a mix of struct and list nesting levels

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1644: Fix Version/s: 0.9.0 > [Python] Read and write nested Parquet data with a mix of struct and list >

[jira] [Updated] (ARROW-1643) [Python] Accept hdfs:// prefixes in parquet.read_table and attempt to connect to HDFS

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1643: Fix Version/s: 0.9.0 > [Python] Accept hdfs:// prefixes in parquet.read_table and attempt to connect

[jira] [Updated] (ARROW-1645) Access HDFS with read_table() automatically

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1645: Fix Version/s: 0.9.0 > Access HDFS with read_table() automatically > ---

[jira] [Commented] (ARROW-1645) Access HDFS with read_table() automatically

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282084#comment-16282084 ] Wes McKinney commented on ARROW-1645: - Could you submit a patch for this? > Access HD

[jira] [Updated] (ARROW-1696) [C++] Add codec benchmarks

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1696: Fix Version/s: 0.9.0 > [C++] Add codec benchmarks > -- > > K

[jira] [Updated] (ARROW-1692) [Python, Java] UnionArray round trip not working

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1692: Fix Version/s: 0.9.0 > [Python, Java] UnionArray round trip not working > --

[jira] [Updated] (ARROW-1705) [Python] Create StructArray (+ type inference) from sequence of dicts

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1705: Fix Version/s: 0.9.0 > [Python] Create StructArray (+ type inference) from sequence of dicts > -

[jira] [Updated] (ARROW-1669) [C++] Consider adding Abseil (Google C++11 standard library extensions) to toolchain

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1669: Fix Version/s: 0.9.0 > [C++] Consider adding Abseil (Google C++11 standard library extensions) to >

[jira] [Updated] (ARROW-1706) [Python] StructArray.from_arrays should handle sequences that are coercible to arrays

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1706: Fix Version/s: 0.9.0 > [Python] StructArray.from_arrays should handle sequences that are coercible

[jira] [Updated] (ARROW-1715) [Python] Implement pickling for Array, Column, ChunkedArray, RecordBatch, Table

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1715: Fix Version/s: 0.9.0 > [Python] Implement pickling for Array, Column, ChunkedArray, RecordBatch, >

[jira] [Updated] (ARROW-1712) [C++] Add method to BinaryBuilder to reserve space for value data

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1712: Fix Version/s: 0.9.0 > [C++] Add method to BinaryBuilder to reserve space for value data > -

[jira] [Updated] (ARROW-1722) [C++] Add linting script to look for C++/CLI issues

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1722: Fix Version/s: 0.9.0 > [C++] Add linting script to look for C++/CLI issues > ---

[jira] [Updated] (ARROW-1731) [Python] Provide for selecting a subset of columns to convert in RecordBatch/Table.from_pandas

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1731: Fix Version/s: 0.9.0 > [Python] Provide for selecting a subset of columns to convert in > RecordBat

[jira] [Updated] (ARROW-1744) [Plasma] Provide TensorFlow operator to read tensors from plasma

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1744: Fix Version/s: 0.9.0 > [Plasma] Provide TensorFlow operator to read tensors from plasma > --

[jira] [Updated] (ARROW-1774) [C++] Add "view" function to create zero-copy views for compatible types, if supported

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1774: Fix Version/s: 0.9.0 > [C++] Add "view" function to create zero-copy views for compatible types, if

[jira] [Updated] (ARROW-1860) [C++] Add data structure to "stage" a sequence of IPC messages from in-memory data

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1860: Fix Version/s: 0.9.0 > [C++] Add data structure to "stage" a sequence of IPC messages from in-memory

[jira] [Updated] (ARROW-1858) [Python] Add documentation about parquet.write_to_dataset and related methods

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1858: Fix Version/s: 0.9.0 > [Python] Add documentation about parquet.write_to_dataset and related methods

[jira] [Updated] (ARROW-1861) [Python] Fix up ASV setup, add developer instructions for writing new benchmarks and running benchmark suite locally

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1861: Fix Version/s: 0.9.0 > [Python] Fix up ASV setup, add developer instructions for writing new > benc

[jira] [Updated] (ARROW-1848) [Python] Add documentation examples for reading single Parquet files and datasets from HDFS

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1848: Fix Version/s: 0.9.0 > [Python] Add documentation examples for reading single Parquet files and > d

[jira] [Updated] (ARROW-1886) [Python] Add function to "flatten" structs within tables

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1886: Fix Version/s: 0.9.0 > [Python] Add function to "flatten" structs within tables > --

[jira] [Updated] (ARROW-1875) Write 64-bit ints as strings in integration test JSON files

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1875: Fix Version/s: 0.9.0 > Write 64-bit ints as strings in integration test JSON files > ---

[jira] [Updated] (ARROW-1870) [JS] Enable build scripts to work with NodeJS 6.10.2 LTS

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1870: Fix Version/s: 0.9.0 > [JS] Enable build scripts to work with NodeJS 6.10.2 LTS > --

[jira] [Updated] (ARROW-1894) [Python] Treat CPython memoryview or buffer objects equivalently to pyarrow.Buffer in pyarrow.serialize

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1894: Fix Version/s: 0.9.0 > [Python] Treat CPython memoryview or buffer objects equivalently to > pyarro

[jira] [Created] (ARROW-1899) [Python] Refactor handling of null sentinels in python/numpy_to_arrow.cc

2017-12-07 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1899: --- Summary: [Python] Refactor handling of null sentinels in python/numpy_to_arrow.cc Key: ARROW-1899 URL: https://issues.apache.org/jira/browse/ARROW-1899 Project: Apache

[jira] [Created] (ARROW-1898) [JS] Update Flatbuffers per metadata changes in ARROW-1785

2017-12-07 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1898: --- Summary: [JS] Update Flatbuffers per metadata changes in ARROW-1785 Key: ARROW-1898 URL: https://issues.apache.org/jira/browse/ARROW-1898 Project: Apache Arrow

[jira] [Updated] (ARROW-1897) [Python] Incorrect numpy_type for pandas metadata of Categoricals

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1897: Summary: [Python] Incorrect numpy_type for pandas metadata of Categoricals (was: Incorrect numpy_ty

[jira] [Commented] (ARROW-1895) [Python] Add field_name to pandas index metadata

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282063#comment-16282063 ] ASF GitHub Bot commented on ARROW-1895: --- cpcloud commented on a change in pull reque

[jira] [Resolved] (ARROW-1893) [Python] test_primitive_serialization fails on Python 2.7.3

2017-12-07 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1893. - Resolution: Fixed Issue resolved by pull request 1398 [https://github.com/apache/arrow/pull/1398]

[jira] [Commented] (ARROW-1893) [Python] test_primitive_serialization fails on Python 2.7.3

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282058#comment-16282058 ] ASF GitHub Bot commented on ARROW-1893: --- wesm closed pull request #1398: ARROW-1893:

[jira] [Commented] (ARROW-1895) [Python] Add field_name to pandas index metadata

2017-12-07 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16282055#comment-16282055 ] ASF GitHub Bot commented on ARROW-1895: --- jorisvandenbossche commented on a change in

  1   2   >