[jira] [Assigned] (ARROW-3358) [Gandiva][C++] Replace usages of gandiva/status.h with arrow/status.h
[ https://issues.apache.org/jira/browse/ARROW-3358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Praveen Kumar Desabandu reassigned ARROW-3358: -- Assignee: Praveen Kumar Desabandu > [Gandiva][C++] Replace usages of gandiva/status.h with arrow/status.h > - > > Key: ARROW-3358 > URL: https://issues.apache.org/jira/browse/ARROW-3358 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Reporter: Wes McKinney >Assignee: Praveen Kumar Desabandu >Priority: Major > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-1382) [Python] Deduplicate non-scalar Python objects when using pyarrow.serialize
[ https://issues.apache.org/jira/browse/ARROW-1382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1382: -- Labels: pull-request-available (was: ) > [Python] Deduplicate non-scalar Python objects when using pyarrow.serialize > --- > > Key: ARROW-1382 > URL: https://issues.apache.org/jira/browse/ARROW-1382 > Project: Apache Arrow > Issue Type: Bug > Components: Python >Reporter: Robert Nishihara >Priority: Major > Labels: pull-request-available > Fix For: 0.13.0 > > > If a Python object appears multiple times within a list/tuple/dictionary, > then when pyarrow serializes the object, it will duplicate the object many > times. This leads to a potentially huge expansion in the size of the object > (e.g., the serialized version of {{100 * [np.zeros(10 ** 6)]}} will be 100 > times bigger than it needs to be). > {code} > import pyarrow as pa > l = [0] > original_object = [l, l] > # Serialize and deserialize the object. > buf = pa.serialize(original_object).to_buffer() > new_object = pa.deserialize(buf) > # This works. > assert original_object[0] is original_object[1] > # This fails. > assert new_object[0] is new_object[1] > {code} > One potential way to address this is to use the Arrow dictionary encoding. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3644) [Rust] Implement ListArrayBuilder
[ https://issues.apache.org/jira/browse/ARROW-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan updated ARROW-3644: --- Issue Type: Sub-task (was: New Feature) Parent: ARROW-3089 > [Rust] Implement ListArrayBuilder > - > > Key: ARROW-3644 > URL: https://issues.apache.org/jira/browse/ARROW-3644 > Project: Apache Arrow > Issue Type: Sub-task > Components: Rust >Reporter: Paddy Horan >Assignee: Paddy Horan >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3644) [Rust] Implement ListArrayBuilder
Paddy Horan created ARROW-3644: -- Summary: [Rust] Implement ListArrayBuilder Key: ARROW-3644 URL: https://issues.apache.org/jira/browse/ARROW-3644 Project: Apache Arrow Issue Type: New Feature Components: Rust Reporter: Paddy Horan Assignee: Paddy Horan -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3643) Optimize `push_slice` of `BufferBuilder`
Paddy Horan created ARROW-3643: -- Summary: Optimize `push_slice` of `BufferBuilder` Key: ARROW-3643 URL: https://issues.apache.org/jira/browse/ARROW-3643 Project: Apache Arrow Issue Type: Improvement Components: Rust Reporter: Paddy Horan Current implementation just repeatedly calls `push`, this should be optimized. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3108) [C++] arrow::PrettyPrint for Table instances
[ https://issues.apache.org/jira/browse/ARROW-3108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3108: -- Labels: beginner pull-request-available (was: beginner) > [C++] arrow::PrettyPrint for Table instances > > > Key: ARROW-3108 > URL: https://issues.apache.org/jira/browse/ARROW-3108 > Project: Apache Arrow > Issue Type: New Feature > Components: C++ >Affects Versions: 0.10.0 >Reporter: Uwe L. Korn >Assignee: Uwe L. Korn >Priority: Major > Labels: beginner, pull-request-available > Fix For: 0.12.0 > > > Extend the {{arrow::PrettyPrint}} functionality to also support > {{arrow::Table}} instances in addition to {{RecordBatch}}. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3108) [C++] arrow::PrettyPrint for Table instances
[ https://issues.apache.org/jira/browse/ARROW-3108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn reassigned ARROW-3108: -- Assignee: Uwe L. Korn > [C++] arrow::PrettyPrint for Table instances > > > Key: ARROW-3108 > URL: https://issues.apache.org/jira/browse/ARROW-3108 > Project: Apache Arrow > Issue Type: New Feature > Components: C++ >Affects Versions: 0.10.0 >Reporter: Uwe L. Korn >Assignee: Uwe L. Korn >Priority: Major > Labels: beginner > Fix For: 0.12.0 > > > Extend the {{arrow::PrettyPrint}} functionality to also support > {{arrow::Table}} instances in addition to {{RecordBatch}}. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-3639) [Packaging] Run gandiva nightly packaging tasks
[ https://issues.apache.org/jira/browse/ARROW-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-3639. Resolution: Fixed > [Packaging] Run gandiva nightly packaging tasks > --- > > Key: ARROW-3639 > URL: https://issues.apache.org/jira/browse/ARROW-3639 > Project: Apache Arrow > Issue Type: Improvement > Components: Packaging >Reporter: Krisztian Szucs >Assignee: Krisztian Szucs >Priority: Major > Fix For: 0.12.0 > > > Crossbow now supports building gandiva jars: > https://github.com/apache/arrow/pull/2835 > Add `-g gandiva` to nightly crossbow trigger. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3639) [Packaging] Run gandiva nightly packaging tasks
[ https://issues.apache.org/jira/browse/ARROW-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-3639: -- Assignee: Krisztian Szucs > [Packaging] Run gandiva nightly packaging tasks > --- > > Key: ARROW-3639 > URL: https://issues.apache.org/jira/browse/ARROW-3639 > Project: Apache Arrow > Issue Type: Improvement > Components: Packaging >Reporter: Krisztian Szucs >Assignee: Krisztian Szucs >Priority: Major > Fix For: 0.12.0 > > > Crossbow now supports building gandiva jars: > https://github.com/apache/arrow/pull/2835 > Add `-g gandiva` to nightly crossbow trigger. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-3639) [Packaging] Run gandiva nightly packaging tasks
[ https://issues.apache.org/jira/browse/ARROW-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1358#comment-1358 ] Krisztian Szucs commented on ARROW-3639: https://github.com/kszucs/crossbow/commit/d57d191105d85f763c47e7c6b94a147765687f16 > [Packaging] Run gandiva nightly packaging tasks > --- > > Key: ARROW-3639 > URL: https://issues.apache.org/jira/browse/ARROW-3639 > Project: Apache Arrow > Issue Type: Improvement > Components: Packaging >Reporter: Krisztian Szucs >Priority: Major > Fix For: 0.12.0 > > > Crossbow now supports building gandiva jars: > https://github.com/apache/arrow/pull/2835 > Add `-g gandiva` to nightly crossbow trigger. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3642) [C++] Add arrowConfig.cmake generation
[ https://issues.apache.org/jira/browse/ARROW-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3642: -- Labels: pull-request-available (was: ) > [C++] Add arrowConfig.cmake generation > -- > > Key: ARROW-3642 > URL: https://issues.apache.org/jira/browse/ARROW-3642 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Reporter: Uwe L. Korn >Assignee: Uwe L. Korn >Priority: Major > Labels: pull-request-available > > This allows simple usage of Arrow in C++ packages using > {{find_package(arrow)}} with no additional {{FindArrow.cmake}} in > {{cmake_modules}}. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3642) [C++] Add arrowConfig.cmake generation
Uwe L. Korn created ARROW-3642: -- Summary: [C++] Add arrowConfig.cmake generation Key: ARROW-3642 URL: https://issues.apache.org/jira/browse/ARROW-3642 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Uwe L. Korn Assignee: Uwe L. Korn This allows simple usage of Arrow in C++ packages using {{find_package(arrow)}} with no additional {{FindArrow.cmake}} in {{cmake_modules}}. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3641) [C++/Python] remove public keyword from Cython api functions
Uwe L. Korn created ARROW-3641: -- Summary: [C++/Python] remove public keyword from Cython api functions Key: ARROW-3641 URL: https://issues.apache.org/jira/browse/ARROW-3641 Project: Apache Arrow Issue Type: Improvement Components: C++, Python Reporter: Uwe L. Korn Fix For: 0.12.0 Based on a conversation with Stefan Behnel, we should be able to change the {{cdef public api}} statements in pyarrow/public-api.pxi to simply {{cdef api}} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3636) [C++/Python] Update arrow/python/pyarrow_api.h
[ https://issues.apache.org/jira/browse/ARROW-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3636: -- Labels: pull-request-available (was: ) > [C++/Python] Update arrow/python/pyarrow_api.h > -- > > Key: ARROW-3636 > URL: https://issues.apache.org/jira/browse/ARROW-3636 > Project: Apache Arrow > Issue Type: Improvement > Components: C++, Python >Reporter: Wes McKinney >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > > This file should be updated to one generated by Cython 0.29. Also based on a > conversation with Stefan Behnel, we should be able to change the {{cdef > public api}} statements in pyarrow/public-api.pxi to simply {{cdef api}} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-3161) [Packaging] Ensure to run pyarrow unit tests in conda and wheel builds
[ https://issues.apache.org/jira/browse/ARROW-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-3161. Resolution: Fixed > [Packaging] Ensure to run pyarrow unit tests in conda and wheel builds > -- > > Key: ARROW-3161 > URL: https://issues.apache.org/jira/browse/ARROW-3161 > Project: Apache Arrow > Issue Type: Improvement > Components: Packaging >Reporter: Krisztian Szucs >Assignee: Krisztian Szucs >Priority: Major > > Wheel linux and wheel osx run pyarrow tests too, others only test the imports. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-3638) [C++][Python] Move reading from Feather as Table feature to C++ from Python
[ https://issues.apache.org/jira/browse/ARROW-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn resolved ARROW-3638. Resolution: Fixed Fix Version/s: 0.12.0 Issue resolved by pull request 2853 [https://github.com/apache/arrow/pull/2853] > [C++][Python] Move reading from Feather as Table feature to C++ from Python > --- > > Key: ARROW-3638 > URL: https://issues.apache.org/jira/browse/ARROW-3638 > Project: Apache Arrow > Issue Type: Improvement > Components: C++, Python >Reporter: Kouhei Sutou >Assignee: Kouhei Sutou >Priority: Minor > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 50m > Remaining Estimate: 0h > > It's for using the feature from GLib. -- This message was sent by Atlassian JIRA (v7.6.3#76005)