[jira] [Assigned] (ARROW-3358) [Gandiva][C++] Replace usages of gandiva/status.h with arrow/status.h

2018-10-28 Thread Praveen Kumar Desabandu (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Praveen Kumar Desabandu reassigned ARROW-3358:
--

Assignee: Praveen Kumar Desabandu

> [Gandiva][C++] Replace usages of gandiva/status.h with arrow/status.h
> -
>
> Key: ARROW-3358
> URL: https://issues.apache.org/jira/browse/ARROW-3358
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Praveen Kumar Desabandu
>Priority: Major
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-1382) [Python] Deduplicate non-scalar Python objects when using pyarrow.serialize

2018-10-28 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-1382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-1382:
--
Labels: pull-request-available  (was: )

> [Python] Deduplicate non-scalar Python objects when using pyarrow.serialize
> ---
>
> Key: ARROW-1382
> URL: https://issues.apache.org/jira/browse/ARROW-1382
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Reporter: Robert Nishihara
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> If a Python object appears multiple times within a list/tuple/dictionary, 
> then when pyarrow serializes the object, it will duplicate the object many 
> times. This leads to a potentially huge expansion in the size of the object 
> (e.g., the serialized version of {{100 * [np.zeros(10 ** 6)]}} will be 100 
> times bigger than it needs to be).
> {code}
> import pyarrow as pa
> l = [0]
> original_object = [l, l]
> # Serialize and deserialize the object.
> buf = pa.serialize(original_object).to_buffer()
> new_object = pa.deserialize(buf)
> # This works.
> assert original_object[0] is original_object[1]
> # This fails.
> assert new_object[0] is new_object[1]
> {code}
> One potential way to address this is to use the Arrow dictionary encoding.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3644) [Rust] Implement ListArrayBuilder

2018-10-28 Thread Paddy Horan (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Paddy Horan updated ARROW-3644:
---
Issue Type: Sub-task  (was: New Feature)
Parent: ARROW-3089

> [Rust] Implement ListArrayBuilder
> -
>
> Key: ARROW-3644
> URL: https://issues.apache.org/jira/browse/ARROW-3644
> Project: Apache Arrow
>  Issue Type: Sub-task
>  Components: Rust
>Reporter: Paddy Horan
>Assignee: Paddy Horan
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3644) [Rust] Implement ListArrayBuilder

2018-10-28 Thread Paddy Horan (JIRA)
Paddy Horan created ARROW-3644:
--

 Summary: [Rust] Implement ListArrayBuilder
 Key: ARROW-3644
 URL: https://issues.apache.org/jira/browse/ARROW-3644
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Rust
Reporter: Paddy Horan
Assignee: Paddy Horan






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3643) Optimize `push_slice` of `BufferBuilder`

2018-10-28 Thread Paddy Horan (JIRA)
Paddy Horan created ARROW-3643:
--

 Summary: Optimize `push_slice` of `BufferBuilder`
 Key: ARROW-3643
 URL: https://issues.apache.org/jira/browse/ARROW-3643
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Rust
Reporter: Paddy Horan


Current implementation just repeatedly calls `push`, this should be optimized.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3108) [C++] arrow::PrettyPrint for Table instances

2018-10-28 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3108:
--
Labels: beginner pull-request-available  (was: beginner)

> [C++] arrow::PrettyPrint for Table instances
> 
>
> Key: ARROW-3108
> URL: https://issues.apache.org/jira/browse/ARROW-3108
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: C++
>Affects Versions: 0.10.0
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: beginner, pull-request-available
> Fix For: 0.12.0
>
>
> Extend the {{arrow::PrettyPrint}} functionality to also support 
> {{arrow::Table}} instances in addition to {{RecordBatch}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3108) [C++] arrow::PrettyPrint for Table instances

2018-10-28 Thread Uwe L. Korn (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe L. Korn reassigned ARROW-3108:
--

Assignee: Uwe L. Korn

> [C++] arrow::PrettyPrint for Table instances
> 
>
> Key: ARROW-3108
> URL: https://issues.apache.org/jira/browse/ARROW-3108
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: C++
>Affects Versions: 0.10.0
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: beginner
> Fix For: 0.12.0
>
>
> Extend the {{arrow::PrettyPrint}} functionality to also support 
> {{arrow::Table}} instances in addition to {{RecordBatch}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-3639) [Packaging] Run gandiva nightly packaging tasks

2018-10-28 Thread Krisztian Szucs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Krisztian Szucs resolved ARROW-3639.

Resolution: Fixed

> [Packaging] Run gandiva nightly packaging tasks
> ---
>
> Key: ARROW-3639
> URL: https://issues.apache.org/jira/browse/ARROW-3639
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Packaging
>Reporter: Krisztian Szucs
>Assignee: Krisztian Szucs
>Priority: Major
> Fix For: 0.12.0
>
>
> Crossbow now supports building gandiva jars: 
> https://github.com/apache/arrow/pull/2835
> Add `-g gandiva` to nightly crossbow trigger.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3639) [Packaging] Run gandiva nightly packaging tasks

2018-10-28 Thread Krisztian Szucs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Krisztian Szucs reassigned ARROW-3639:
--

Assignee: Krisztian Szucs

> [Packaging] Run gandiva nightly packaging tasks
> ---
>
> Key: ARROW-3639
> URL: https://issues.apache.org/jira/browse/ARROW-3639
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Packaging
>Reporter: Krisztian Szucs
>Assignee: Krisztian Szucs
>Priority: Major
> Fix For: 0.12.0
>
>
> Crossbow now supports building gandiva jars: 
> https://github.com/apache/arrow/pull/2835
> Add `-g gandiva` to nightly crossbow trigger.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-3639) [Packaging] Run gandiva nightly packaging tasks

2018-10-28 Thread Krisztian Szucs (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-3639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1358#comment-1358
 ] 

Krisztian Szucs commented on ARROW-3639:


https://github.com/kszucs/crossbow/commit/d57d191105d85f763c47e7c6b94a147765687f16

> [Packaging] Run gandiva nightly packaging tasks
> ---
>
> Key: ARROW-3639
> URL: https://issues.apache.org/jira/browse/ARROW-3639
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Packaging
>Reporter: Krisztian Szucs
>Priority: Major
> Fix For: 0.12.0
>
>
> Crossbow now supports building gandiva jars: 
> https://github.com/apache/arrow/pull/2835
> Add `-g gandiva` to nightly crossbow trigger.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3642) [C++] Add arrowConfig.cmake generation

2018-10-28 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3642:
--
Labels: pull-request-available  (was: )

> [C++] Add arrowConfig.cmake generation
> --
>
> Key: ARROW-3642
> URL: https://issues.apache.org/jira/browse/ARROW-3642
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
>
> This allows simple usage of Arrow in C++ packages using 
> {{find_package(arrow)}} with no additional {{FindArrow.cmake}} in 
> {{cmake_modules}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3642) [C++] Add arrowConfig.cmake generation

2018-10-28 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-3642:
--

 Summary: [C++] Add arrowConfig.cmake generation
 Key: ARROW-3642
 URL: https://issues.apache.org/jira/browse/ARROW-3642
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: Uwe L. Korn
Assignee: Uwe L. Korn


This allows simple usage of Arrow in C++ packages using {{find_package(arrow)}} 
with no additional {{FindArrow.cmake}} in {{cmake_modules}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3641) [C++/Python] remove public keyword from Cython api functions

2018-10-28 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-3641:
--

 Summary: [C++/Python] remove public keyword from Cython api 
functions
 Key: ARROW-3641
 URL: https://issues.apache.org/jira/browse/ARROW-3641
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++, Python
Reporter: Uwe L. Korn
 Fix For: 0.12.0


Based on a conversation with Stefan Behnel, we should be able to change the 
{{cdef public api}} statements in pyarrow/public-api.pxi to simply {{cdef api}}




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3636) [C++/Python] Update arrow/python/pyarrow_api.h

2018-10-28 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3636:
--
Labels: pull-request-available  (was: )

> [C++/Python] Update arrow/python/pyarrow_api.h
> --
>
> Key: ARROW-3636
> URL: https://issues.apache.org/jira/browse/ARROW-3636
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Python
>Reporter: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>
> This file should be updated to one generated by Cython 0.29. Also based on a 
> conversation with Stefan Behnel, we should be able to change the {{cdef 
> public api}} statements in pyarrow/public-api.pxi to simply {{cdef api}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-3161) [Packaging] Ensure to run pyarrow unit tests in conda and wheel builds

2018-10-28 Thread Krisztian Szucs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Krisztian Szucs resolved ARROW-3161.

Resolution: Fixed

> [Packaging] Ensure to run pyarrow unit tests in conda and wheel builds
> --
>
> Key: ARROW-3161
> URL: https://issues.apache.org/jira/browse/ARROW-3161
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Packaging
>Reporter: Krisztian Szucs
>Assignee: Krisztian Szucs
>Priority: Major
>
> Wheel linux and wheel osx run pyarrow tests too, others only test the imports.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-3638) [C++][Python] Move reading from Feather as Table feature to C++ from Python

2018-10-28 Thread Uwe L. Korn (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe L. Korn resolved ARROW-3638.

   Resolution: Fixed
Fix Version/s: 0.12.0

Issue resolved by pull request 2853
[https://github.com/apache/arrow/pull/2853]

> [C++][Python] Move reading from Feather as Table feature to C++ from Python
> ---
>
> Key: ARROW-3638
> URL: https://issues.apache.org/jira/browse/ARROW-3638
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Python
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> It's for using the feature from GLib.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)