[jira] [Updated] (ARROW-6820) [C++] [Doc] [Format] Map specification and implementation inconsistent

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6820: -- Description: In https://arrow.apache.org/docs/format/Layout.html#map-type, the map type is spe

[jira] [Commented] (ARROW-6820) [C++] [Doc] [Format] Map specification and implementation inconsistent

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968173#comment-16968173 ] Antoine Pitrou commented on ARROW-6820: --- In the Java implementation, a map type als

[jira] [Created] (ARROW-7072) [Java] Support concating validity bits efficiently

2019-11-06 Thread Liya Fan (Jira)
Liya Fan created ARROW-7072: --- Summary: [Java] Support concating validity bits efficiently Key: ARROW-7072 URL: https://issues.apache.org/jira/browse/ARROW-7072 Project: Apache Arrow Issue Type: New

[jira] [Created] (ARROW-7073) [Java] Support concating vectors values in batch

2019-11-06 Thread Liya Fan (Jira)
Liya Fan created ARROW-7073: --- Summary: [Java] Support concating vectors values in batch Key: ARROW-7073 URL: https://issues.apache.org/jira/browse/ARROW-7073 Project: Apache Arrow Issue Type: New F

[jira] [Updated] (ARROW-7072) [Java] Support concating validity bits efficiently

2019-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7072: -- Labels: pull-request-available (was: ) > [Java] Support concating validity bits efficiently >

[jira] [Updated] (ARROW-6367) [C++][Gandiva] Implement string reverse

2019-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6367: -- Labels: pull-request-available (was: ) > [C++][Gandiva] Implement string reverse > ---

[jira] [Commented] (ARROW-7074) [C++] ASSERT_OK_AND_ASSIGN crashes when failing

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968298#comment-16968298 ] Antoine Pitrou commented on ARROW-7074: --- cc [~bkietz] [~fsaintjacques] > [C++] ASS

[jira] [Created] (ARROW-7074) [C++] ASSERT_OK_AND_ASSIGN crashes when failing

2019-11-06 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-7074: - Summary: [C++] ASSERT_OK_AND_ASSIGN crashes when failing Key: ARROW-7074 URL: https://issues.apache.org/jira/browse/ARROW-7074 Project: Apache Arrow Issue

[jira] [Created] (ARROW-7075) [C++] Boolean kernels should not allocate in Call()

2019-11-06 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-7075: --- Summary: [C++] Boolean kernels should not allocate in Call() Key: ARROW-7075 URL: https://issues.apache.org/jira/browse/ARROW-7075 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-7071) [Python] Add Array convenience method to create "masked" view with different validity bitmap

2019-11-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968345#comment-16968345 ] Joris Van den Bossche commented on ARROW-7071: -- > NB: I'm not sure what kind

[jira] [Created] (ARROW-7076) `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

2019-11-06 Thread Fabien (Jira)
Fabien created ARROW-7076: - Summary: `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly Key: ARROW-7076 URL: https://issues.apa

[jira] [Commented] (ARROW-7074) [C++] ASSERT_OK_AND_ASSIGN crashes when failing

2019-11-06 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968350#comment-16968350 ] Francois Saint-Jacques commented on ARROW-7074: --- Go ahead, I need this righ

[jira] [Commented] (ARROW-7071) [Python] Add Array convenience method to create "masked" view with different validity bitmap

2019-11-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968354#comment-16968354 ] Joris Van den Bossche commented on ARROW-7071: -- Now, I think the main questi

[jira] [Commented] (ARROW-7074) [C++] ASSERT_OK_AND_ASSIGN crashes when failing

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968358#comment-16968358 ] Antoine Pitrou commented on ARROW-7074: --- I'll let you fix it, there are compile err

[jira] [Commented] (ARROW-6820) [C++] [Doc] [Format] Map specification and implementation inconsistent

2019-11-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968370#comment-16968370 ] Joris Van den Bossche commented on ARROW-6820: -- To see the description in th

[jira] [Commented] (ARROW-6820) [C++] [Doc] [Format] Map specification and implementation inconsistent

2019-11-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968373#comment-16968373 ] Joris Van den Bossche commented on ARROW-6820: -- Another inconsistency is tha

[jira] [Resolved] (ARROW-6984) [C++] Update LZ4 to 1.9.2 for CVE-2019-17543

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-6984. --- Resolution: Fixed Issue resolved by pull request 5728 [https://github.com/apache/arrow/pull/5

[jira] [Commented] (ARROW-7064) [R] Implement null type

2019-11-06 Thread Zachary Lawrence (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968394#comment-16968394 ] Zachary Lawrence commented on ARROW-7064: - Got it - thanks! Do you know if there'

[jira] [Commented] (ARROW-7076) `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

2019-11-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968398#comment-16968398 ] Joris Van den Bossche commented on ARROW-7076: -- There are not yet binary whe

[jira] [Comment Edited] (ARROW-7076) `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

2019-11-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968398#comment-16968398 ] Joris Van den Bossche edited comment on ARROW-7076 at 11/6/19 2:31 PM:

[jira] [Commented] (ARROW-7076) `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

2019-11-06 Thread Fabien (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968407#comment-16968407 ] Fabien commented on ARROW-7076: --- Hi ! Thanks for the answer ! It's nice to have it on cond

[jira] [Commented] (ARROW-7076) `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

2019-11-06 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968417#comment-16968417 ] Joris Van den Bossche commented on ARROW-7076: -- See ARROW-6920 for wheels fo

[jira] [Assigned] (ARROW-3408) [C++] Add option to CSV reader to dictionary encode individual columns or all string / binary columns

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-3408: - Assignee: Antoine Pitrou > [C++] Add option to CSV reader to dictionary encode individua

[jira] [Comment Edited] (ARROW-6920) [Python] create manylinux wheels for python3.8

2019-11-06 Thread Fabien (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968462#comment-16968462 ] Fabien edited comment on ARROW-6920 at 11/6/19 3:54 PM: Hello, I

[jira] [Commented] (ARROW-6920) [Python] create manylinux wheels for python3.8

2019-11-06 Thread Fabien (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968462#comment-16968462 ] Fabien commented on ARROW-6920: --- Hello, I would be very intereted in wheel for 3.8 on pypi,

[jira] [Created] (ARROW-7077) [C++] Unsupported Dict->T cast crashes instead of returning error

2019-11-06 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-7077: - Summary: [C++] Unsupported Dict->T cast crashes instead of returning error Key: ARROW-7077 URL: https://issues.apache.org/jira/browse/ARROW-7077 Project: Apache Arr

[jira] [Updated] (ARROW-7077) [C++] Unsupported Dict->T cast crashes instead of returning error

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7077: -- Description: {code:python} >>> arr = pa.array(["foo", "bar"])

[jira] [Commented] (ARROW-7071) [Python] Add Array convenience method to create "masked" view with different validity bitmap

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968479#comment-16968479 ] Wes McKinney commented on ARROW-7071: - It'd be best to not mutate any existing arrays

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-18-42-783.png > [Python] Reading parquet file with many colum

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-19-11-662.png > [Python] Reading parquet file with many colum

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-23-18-897.png > [Python] Reading parquet file with many colum

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-08-25-05-885.png > [Python] Reading parquet file with many colum

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968488#comment-16968488 ] Eric Kisslinger commented on ARROW-7059: Thanks for the suggestion. I was unfamil

[jira] [Created] (ARROW-7078) [Developer] Add Windows utility script to use Dependencies.exe to dump DLL dependencies for diagnostic purposes

2019-11-06 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-7078: --- Summary: [Developer] Add Windows utility script to use Dependencies.exe to dump DLL dependencies for diagnostic purposes Key: ARROW-7078 URL: https://issues.apache.org/jira/browse/A

[jira] [Resolved] (ARROW-7067) [CI] Disable code coverage on Travis-CI

2019-11-06 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-7067. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5778 [https://gi

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-09-23-54-372.png > [Python] Reading parquet file with many colum

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968557#comment-16968557 ] Eric Kisslinger commented on ARROW-7059: In  [https://github.com/apache/arrow/blo

[jira] [Resolved] (ARROW-7058) [C++] FileSystemDataSourceDiscovery should apply partition schemes relative to the base_dir of its selector

2019-11-06 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-7058. --- Resolution: Fixed Issue resolved by pull request 5772 [https://github.com/apa

[jira] [Created] (ARROW-7079) [C++][Dataset] Implement ScalarAsStatisctics for non-primitive types

2019-11-06 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-7079: - Summary: [C++][Dataset] Implement ScalarAsStatisctics for non-primitive types Key: ARROW-7079 URL: https://issues.apache.org/jira/browse/ARROW-7079

[jira] [Created] (ARROW-7080) [Python][Parquet] Expose parquet field_id in Schema objects

2019-11-06 Thread Ted Gooch (Jira)
Ted Gooch created ARROW-7080: Summary: [Python][Parquet] Expose parquet field_id in Schema objects Key: ARROW-7080 URL: https://issues.apache.org/jira/browse/ARROW-7080 Project: Apache Arrow Iss

[jira] [Created] (ARROW-7081) [R] Add methods for introspecting parquet files

2019-11-06 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-7081: --- Summary: [R] Add methods for introspecting parquet files Key: ARROW-7081 URL: https://issues.apache.org/jira/browse/ARROW-7081 Project: Apache Arrow Issue Type

[jira] [Updated] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] V Luong updated ARROW-6910: --- Description: I realize that when I read up a lot of Parquet files using pyarrow.parquet.read_table(...), my

[jira] [Commented] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968654#comment-16968654 ] V Luong commented on ARROW-6910: [~apitrou] [~wesm] I'm re-testing this issue using the n

[jira] [Comment Edited] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968654#comment-16968654 ] V Luong edited comment on ARROW-6910 at 11/6/19 8:05 PM: - [~apitr

[jira] [Comment Edited] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968654#comment-16968654 ] V Luong edited comment on ARROW-6910 at 11/6/19 8:08 PM: - [~apitr

[jira] [Comment Edited] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968654#comment-16968654 ] V Luong edited comment on ARROW-6910 at 11/6/19 8:08 PM: - [~apitr

[jira] [Commented] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968667#comment-16968667 ] Wes McKinney commented on ARROW-6910: - What platform are you on? It's possible the ba

[jira] [Commented] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968670#comment-16968670 ] Wes McKinney commented on ARROW-6910: - If you can open a new JIRA for further investi

[jira] [Commented] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread V Luong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968675#comment-16968675 ] V Luong commented on ARROW-6910: ok [~wesm] let me create a new JIRA ticket for 0.15.1 >

[jira] [Commented] (ARROW-6910) [Python] pyarrow.parquet.read_table(...) takes up lots of memory which is not released until program exits

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968683#comment-16968683 ] Wes McKinney commented on ARROW-6910: - The place to start will be twiddling with the

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968701#comment-16968701 ] Francois Saint-Jacques commented on ARROW-7059: --- Yes, that's an O(|columns|

[jira] [Closed] (ARROW-7076) `pip install pyarrow` with python 3.8 fail with message : Could not build wheels for pyarrow which use PEP 517 and cannot be installed directly

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-7076. --- Resolution: Duplicate Duplicate of ARROW-6920 > `pip install pyarrow` with python 3.8 fail with mess

[jira] [Commented] (ARROW-7064) [R] Implement null type

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968711#comment-16968711 ] Wes McKinney commented on ARROW-7064: - I would guess that this will be fixed at some

[jira] [Updated] (ARROW-7077) [C++] Unsupported Dict->T cast crashes instead of returning error

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7077: Fix Version/s: 1.0.0 > [C++] Unsupported Dict->T cast crashes instead of returning error >

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968718#comment-16968718 ] Eric Kisslinger commented on ARROW-7059: np. For completeness I also noticed what

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Eric Kisslinger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Kisslinger updated ARROW-7059: --- Attachment: image-2019-11-06-13-16-05-102.png > [Python] Reading parquet file with many colum

[jira] [Commented] (ARROW-7064) [R] Implement null type

2019-11-06 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968743#comment-16968743 ] Neal Richardson commented on ARROW-7064: You can read the file with as_data_frame

[jira] [Commented] (ARROW-7028) [R] Date roundtrip results in different R storage mode

2019-11-06 Thread Bai Ming (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968798#comment-16968798 ] Bai Ming commented on ARROW-7028: - Thanks for the workaround, tested it and it works, wil

[jira] [Updated] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7059: Fix Version/s: 1.0.0 > [Python] Reading parquet file with many columns is much slower in 0.15.x >

[jira] [Commented] (ARROW-7059) [Python] Reading parquet file with many columns is much slower in 0.15.x versus 0.14.x

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968828#comment-16968828 ] Wes McKinney commented on ARROW-7059: - We should ensure that we have both ASV and C++

[jira] [Updated] (ARROW-7080) [Python][Parquet] Expose parquet field_id in Schema objects

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7080: Fix Version/s: 1.0.0 > [Python][Parquet] Expose parquet field_id in Schema objects > --

[jira] [Commented] (ARROW-7080) [Python][Parquet] Expose parquet field_id in Schema objects

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968829#comment-16968829 ] Wes McKinney commented on ARROW-7080: - Please feel free. You can propagate it to the

[jira] [Created] (ARROW-7082) [Packaging][deb] Add apache-arrow-archive-keyring

2019-11-06 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-7082: --- Summary: [Packaging][deb] Add apache-arrow-archive-keyring Key: ARROW-7082 URL: https://issues.apache.org/jira/browse/ARROW-7082 Project: Apache Arrow Issue Ty

[jira] [Updated] (ARROW-7082) [Packaging][deb] Add apache-arrow-archive-keyring

2019-11-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7082: -- Labels: pull-request-available (was: ) > [Packaging][deb] Add apache-arrow-archive-keyring > -

[jira] [Resolved] (ARROW-6743) [C++] Completely remove usage of boost::filesystem (except in hdfs_internal)

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6743. - Fix Version/s: (was: 2.0.0) 1.0.0 Resolution: Fixed Issue resolved

[jira] [Assigned] (ARROW-6743) [C++] Completely remove usage of boost::filesystem (except in hdfs_internal)

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6743: --- Assignee: Antoine Pitrou > [C++] Completely remove usage of boost::filesystem (except in hdf

[jira] [Resolved] (ARROW-7054) [Docs] Add option to override displayed docs version with an environment variable

2019-11-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7054. - Resolution: Fixed Issue resolved by pull request 5766 [https://github.com/apache/arrow/pull/5766]

[jira] [Assigned] (ARROW-6367) [C++][Gandiva] Implement string reverse

2019-11-06 Thread Projjal Chanda (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Projjal Chanda reassigned ARROW-6367: - Assignee: Projjal Chanda (was: Prudhvi Porandla) > [C++][Gandiva] Implement string reve

[jira] [Commented] (ARROW-7017) [C++] Refactor AddKernel to support other operations and types

2019-11-06 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968992#comment-16968992 ] Micah Kornfield commented on ARROW-7017: This seems plausible, but it still sound

[jira] [Created] (ARROW-7083) [C++] Determine the feasibility and build a prototype to replace compute/kernels with gandiva kernels

2019-11-06 Thread Micah Kornfield (Jira)
Micah Kornfield created ARROW-7083: -- Summary: [C++] Determine the feasibility and build a prototype to replace compute/kernels with gandiva kernels Key: ARROW-7083 URL: https://issues.apache.org/jira/browse/ARROW

[jira] [Commented] (ARROW-7017) [C++] Refactor AddKernel to support other operations and types

2019-11-06 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968993#comment-16968993 ] Micah Kornfield commented on ARROW-7017: I created [https://jira.apache.org/jira/

[jira] [Updated] (ARROW-7083) [C++] Determine the feasibility and build a prototype to replace compute/kernels with gandiva kernels

2019-11-06 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield updated ARROW-7083: --- Component/s: C++ - Gandiva C++ - Compute C++ > [C++] Determ

[jira] [Updated] (ARROW-6931) [Java] Consider starting to use Google Truth Fluent Assertions library

2019-11-06 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield updated ARROW-6931: --- Description: This can offer more readable asserts than the limited JUnit assertions.   we m

[jira] [Commented] (ARROW-6931) [Java] Consider starting to use Google Truth Fluent Assertions library

2019-11-06 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968996#comment-16968996 ] Micah Kornfield commented on ARROW-6931: probably worth discussing on the ML if p

[jira] [Commented] (ARROW-7048) [Java] Support for combining multiple vectors under VectorSchemaRoot

2019-11-06 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16968998#comment-16968998 ] Micah Kornfield commented on ARROW-7048: "For VariableWidthVectors, we need to tr

[jira] [Commented] (ARROW-7048) [Java] Support for combining multiple vectors under VectorSchemaRoot

2019-11-06 Thread Liya Fan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16969016#comment-16969016 ] Liya Fan commented on ARROW-7048: - [~emkornfi...@gmail.com] Agreed. Adding a constant to