[jira] [Created] (ARROW-11518) [C++] [Parquet] Parquet reader crashes when reading boolean columns

2021-02-05 Thread Andrey Klochkov (Jira)
Andrey Klochkov created ARROW-11518: --- Summary: [C++] [Parquet] Parquet reader crashes when reading boolean columns Key: ARROW-11518 URL: https://issues.apache.org/jira/browse/ARROW-11518 Project:

[jira] [Updated] (ARROW-11518) [C++] [Parquet] Parquet reader crashes when reading boolean columns

2021-02-05 Thread Andrey Klochkov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrey Klochkov updated ARROW-11518: Description: Parquet file reader crashes while reading boolean columns in

[jira] [Commented] (ARROW-11326) utf8 vector buffers don't work if allocated within Web Assembly memory of Node.js

2021-02-05 Thread Dmitri Bronnikov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17280101#comment-17280101 ] Dmitri Bronnikov commented on ARROW-11326: -- buffer instanceof ArrayBuffer returns false for the

[jira] [Commented] (ARROW-11501) [C++] endianness check does not work on Solaris

2021-02-05 Thread Kazuaki Ishizaki (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17280077#comment-17280077 ] Kazuaki Ishizaki commented on ARROW-11501: -- I see. Could you please try this code by applying a

[jira] [Commented] (ARROW-11501) [C++] endianness check does not work on Solaris

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17280076#comment-17280076 ] Neal Richardson commented on ARROW-11501: - I have no idea, this isn't an area (neither

[jira] [Comment Edited] (ARROW-11501) [C++] endianness check does not work on Solaris

2021-02-05 Thread Kazuaki Ishizaki (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17280070#comment-17280070 ] Kazuaki Ishizaki edited comment on ARROW-11501 at 2/6/21, 4:04 AM: ---

[jira] [Commented] (ARROW-11501) [C++] endianness check does not work on Solaris

2021-02-05 Thread Kazuaki Ishizaki (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17280072#comment-17280072 ] Kazuaki Ishizaki commented on ARROW-11501: -- Is it the simplest solution to add the following

[jira] [Commented] (ARROW-11501) [C++] endianness check does not work on Solaris

2021-02-05 Thread Kazuaki Ishizaki (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17280070#comment-17280070 ] Kazuaki Ishizaki commented on ARROW-11501: -- Thank you for raising the issue. I will address

[jira] [Updated] (ARROW-11517) [Developer][Archery] Report both items and bytes per second

2021-02-05 Thread Diana Clarke (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diana Clarke updated ARROW-11517: - Summary: [Developer][Archery] Report both items and bytes per second (was: [Developer]

[jira] [Updated] (ARROW-11517) [Developer] Archery: report both items and bytes per second

2021-02-05 Thread Diana Clarke (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diana Clarke updated ARROW-11517: - Description: If google bench yields all 3 observations: bytes, items, and execution time,

[jira] [Updated] (ARROW-11517) [Developer] Archery: report both items and bytes per second

2021-02-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11517: --- Labels: pull-request-available (was: ) > [Developer] Archery: report both items and bytes

[jira] [Updated] (ARROW-11517) [Developer] Archery: report both items and bytes per second

2021-02-05 Thread Diana Clarke (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diana Clarke updated ARROW-11517: - Description: If google bench yields all 3 observations: bytes, items, and execution time,

[jira] [Updated] (ARROW-11517) [Developer] Archery: report both items and bytes per second

2021-02-05 Thread Diana Clarke (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diana Clarke updated ARROW-11517: - Description: If google bench yields all 3 observations: bytes, items, and execution time,

[jira] [Updated] (ARROW-11517) [Developer] Archery: report both items and bytes per second

2021-02-05 Thread Diana Clarke (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diana Clarke updated ARROW-11517: - Summary: [Developer] Archery: report both items and bytes per second (was: [Developer] Report

[jira] [Created] (ARROW-11517) [Developer] Report both items and bytes per second

2021-02-05 Thread Diana Clarke (Jira)
Diana Clarke created ARROW-11517: Summary: [Developer] Report both items and bytes per second Key: ARROW-11517 URL: https://issues.apache.org/jira/browse/ARROW-11517 Project: Apache Arrow

[jira] [Created] (ARROW-11516) [R] Allow all C++ compute functions to be called by name in dplyr

2021-02-05 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11516: --- Summary: [R] Allow all C++ compute functions to be called by name in dplyr Key: ARROW-11516 URL: https://issues.apache.org/jira/browse/ARROW-11516 Project:

[jira] [Created] (ARROW-11515) [R] Bindings for strsplit

2021-02-05 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11515: --- Summary: [R] Bindings for strsplit Key: ARROW-11515 URL: https://issues.apache.org/jira/browse/ARROW-11515 Project: Apache Arrow Issue Type: New

[jira] [Created] (ARROW-11514) [R] Bindings for str_c

2021-02-05 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11514: --- Summary: [R] Bindings for str_c Key: ARROW-11514 URL: https://issues.apache.org/jira/browse/ARROW-11514 Project: Apache Arrow Issue Type: New Feature

[jira] [Updated] (ARROW-10305) [R] Filter with regular expressions

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10305: Fix Version/s: 4.0.0 > [R] Filter with regular expressions >

[jira] [Created] (ARROW-11513) [R] Bindings for sub/gsub

2021-02-05 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11513: --- Summary: [R] Bindings for sub/gsub Key: ARROW-11513 URL: https://issues.apache.org/jira/browse/ARROW-11513 Project: Apache Arrow Issue Type: New

[jira] [Updated] (ARROW-10305) [R] Filter with regular expressions

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10305: Summary: [R] Filter with regular expressions (was: [C++][R] Filter datasets with string

[jira] [Resolved] (ARROW-11500) [R] Allow bundled build script to run on Solaris

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-11500. - Resolution: Fixed Issue resolved by pull request 9422

[jira] [Created] (ARROW-11512) [Packaging][deb] Add missing gRPC dependency for Ubuntu 21.04

2021-02-05 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-11512: Summary: [Packaging][deb] Add missing gRPC dependency for Ubuntu 21.04 Key: ARROW-11512 URL: https://issues.apache.org/jira/browse/ARROW-11512 Project: Apache Arrow

[jira] [Updated] (ARROW-11512) [Packaging][deb] Add missing gRPC dependency for Ubuntu 21.04

2021-02-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11512: --- Labels: pull-request-available (was: ) > [Packaging][deb] Add missing gRPC dependency for

[jira] [Created] (ARROW-11511) [Rust] Replace Arc by ArrayData

2021-02-05 Thread Jira
Jorge Leitão created ARROW-11511: Summary: [Rust] Replace Arc by ArrayData Key: ARROW-11511 URL: https://issues.apache.org/jira/browse/ARROW-11511 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-11511) [Rust] Replace Arc by ArrayData

2021-02-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11511: --- Labels: pull-request-available (was: ) > [Rust] Replace Arc by ArrayData >

[jira] [Updated] (ARROW-11510) [Python] Add note that pip >= 19.0 is required to get binary packages

2021-02-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11510: --- Labels: pull-request-available (was: ) > [Python] Add note that pip >= 19.0 is required to

[jira] [Updated] (ARROW-11510) [Python] Add note that pip >= 19.0 is required to get binary packages

2021-02-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-11510: Description: Since ARROW-5756 dropped support for manylinux1 the required pip version is now 19. 

[jira] [Created] (ARROW-11510) [Python] Add note that pip >= 19.0 is required to get binary packages

2021-02-05 Thread Weston Pace (Jira)
Weston Pace created ARROW-11510: --- Summary: [Python] Add note that pip >= 19.0 is required to get binary packages Key: ARROW-11510 URL: https://issues.apache.org/jira/browse/ARROW-11510 Project: Apache

[jira] [Assigned] (ARROW-11510) [Python] Add note that pip >= 19.0 is required to get binary packages

2021-02-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-11510: --- Assignee: Weston Pace > [Python] Add note that pip >= 19.0 is required to get binary

[jira] [Created] (ARROW-11509) [C++][Compute] Overhaul CanCast() helper function

2021-02-05 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-11509: Summary: [C++][Compute] Overhaul CanCast() helper function Key: ARROW-11509 URL: https://issues.apache.org/jira/browse/ARROW-11509 Project: Apache Arrow

[jira] [Commented] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279926#comment-17279926 ] Travis Smith commented on ARROW-11506: -- Is this file something that should be linked at compilation

[jira] [Commented] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279924#comment-17279924 ] Weston Pace commented on ARROW-11506: - I don't know much about pybind11.  That symbol is defined in

[jira] [Comment Edited] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279918#comment-17279918 ] Pac A. He edited comment on ARROW-11456 at 2/5/21, 7:01 PM: I see. I have

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Description: When reading or writing a large parquet file, I have this error: {noformat} df:

[jira] [Commented] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279918#comment-17279918 ] Pac A. He commented on ARROW-11456: --- I see. I have now added code to reproduce the issue. Basically,

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Environment: pyarrow 3.0.0 / 2.0.0 pandas 1.1.5 / 1.2.1 smart_open 4.1.2 python 3.8.6 was: pyarrow

[jira] [Created] (ARROW-11508) [C++][Compute] Add support for generic conversions to Function::DispatchBest

2021-02-05 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-11508: Summary: [C++][Compute] Add support for generic conversions to Function::DispatchBest Key: ARROW-11508 URL: https://issues.apache.org/jira/browse/ARROW-11508

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Description: When reading a large parquet file, I have this error: {noformat} df: Final =

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Description: When reading a large parquet file, I have this error: {noformat} df: Final =

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Description: When reading a large parquet file, I have this error: {noformat} df: Final =

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Description: When reading a large parquet file, I have this error: {noformat} df: Final =

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Description: When reading a large parquet file, I have this error: {noformat} df: Final =

[jira] [Updated] (ARROW-11456) [Python] Parquet reader cannot read large strings

2021-02-05 Thread Pac A. He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pac A. He updated ARROW-11456: -- Description: When reading a large parquet file, I have this error:   {noformat} df: Final =

[jira] [Comment Edited] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279906#comment-17279906 ] Travis Smith edited comment on ARROW-11506 at 2/5/21, 6:49 PM: --- Thank you.

[jira] [Comment Edited] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279906#comment-17279906 ] Travis Smith edited comment on ARROW-11506 at 2/5/21, 6:48 PM: --- Thank you.

[jira] [Updated] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Travis Smith updated ARROW-11506: - Description: pyarrow 3.0.0 fails on install in a clean environment created using virtualenv on

[jira] [Commented] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279906#comment-17279906 ] Travis Smith commented on ARROW-11506: -- Thank you. The version of pip was 9.0.1 and I upgraded it

[jira] [Commented] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279889#comment-17279889 ] Weston Pace commented on ARROW-11506: - I suspect your version of `pip` is too old.   First, did

[jira] [Commented] (ARROW-10899) [C++] Investigate radix sort for integer arrays

2021-02-05 Thread Kirill Lykov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279881#comment-17279881 ] Kirill Lykov commented on ARROW-10899: -- Sounds interesting to me, I would like to have a look.

[jira] [Updated] (ARROW-10354) [Rust] [DataFusion] Add support for regex extract

2021-02-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10354: --- Labels: beginner pull-request-available (was: beginner) > [Rust] [DataFusion] Add support

[jira] [Updated] (ARROW-11412) [Python] Expressions not working wiht logical boolean operators (and, or, not)

2021-02-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-11412: -- Summary: [Python] Expressions not working wiht logical boolean operators

[jira] [Updated] (ARROW-11412) [Python] Expressions not working with logical boolean operators (and, or, not)

2021-02-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-11412: -- Summary: [Python] Expressions not working with logical boolean operators

[jira] [Resolved] (ARROW-11412) [Python] Expression evaluation problem for logical boolean expressions (and, or, not)

2021-02-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-11412. --- Resolution: Fixed Issue resolved by pull request 9352

[jira] [Resolved] (ARROW-11412) [Python] Expression evaluation problem for logical boolean expressions (and, or, not)

2021-02-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-11412. --- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull

[jira] [Reopened] (ARROW-11412) [Python] Expression evaluation problem for logical boolean expressions (and, or, not)

2021-02-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reopened ARROW-11412: --- Assignee: Joris Van den Bossche > [Python] Expression evaluation problem

[jira] [Created] (ARROW-11507) [R] Bindings for GetRuntimeInfo

2021-02-05 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11507: --- Summary: [R] Bindings for GetRuntimeInfo Key: ARROW-11507 URL: https://issues.apache.org/jira/browse/ARROW-11507 Project: Apache Arrow Issue Type: New

[jira] [Resolved] (ARROW-11427) [C++] Arrow uses AVX512 instructions even when not supported by the OS

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-11427. - Resolution: Fixed Issue resolved by pull request 9398

[jira] [Updated] (ARROW-11482) [Rust] Improve take primitive performance

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11482: Summary: [Rust] Improve take primitive performance (was: Improve take primitive

[jira] [Updated] (ARROW-11496) [Rust] NAN aggreagation behavior

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11496: Component/s: Rust Summary: [Rust] NAN aggreagation behavior (was: NAN

[jira] [Updated] (ARROW-11491) [Rust] Support json schema inference for nested list and struct

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11491: Summary: [Rust] Support json schema inference for nested list and struct (was: support

[jira] [Updated] (ARROW-11473) [Python] Needs a handling for missing columns while reading parquet file

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11473: Summary: [Python] Needs a handling for missing columns while reading parquet file (was:

[jira] [Updated] (ARROW-11481) [Rust] More cast implementations

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11481: Summary: [Rust] More cast implementations (was: More cast implementations) > [Rust] More

[jira] [Updated] (ARROW-11465) [C++] Parquet file writer snapshot API and proper ColumnChunk.file_path utilization

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11465: Summary: [C++] Parquet file writer snapshot API and proper ColumnChunk.file_path

[jira] [Updated] (ARROW-11463) [Python] Allow configuration of IpcWriterOptions 64Bit from PyArrow

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11463: Summary: [Python] Allow configuration of IpcWriterOptions 64Bit from PyArrow (was: Allow

[jira] [Updated] (ARROW-11484) [Rust] Derive Clone for ExecutionContext

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11484: Summary: [Rust] Derive Clone for ExecutionContext (was: Derive Clone for

[jira] [Updated] (ARROW-11505) [Rust] Add support for LargeUtf8 in csv-writer

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11505: Summary: [Rust] Add support for LargeUtf8 in csv-writer (was: Add support for LargeUtf8

[jira] [Updated] (ARROW-11506) [Python] Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11506: Summary: [Python] Pyarrow fail on install and eventual run/compilation (was: Pyarrow

[jira] [Updated] (ARROW-11351) [JavaScript] Reconsider proxy objects instead of defineProperty

2021-02-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11351: Summary: [JavaScript] Reconsider proxy objects instead of defineProperty (was:

[jira] [Commented] (ARROW-11007) [Python] Memory leak in pq.read_table and table.to_pandas

2021-02-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279763#comment-17279763 ] Antoine Pitrou commented on ARROW-11007: Ah, I didn't know that gRPC was involved. Since Arrow

[jira] [Updated] (ARROW-11506) Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Travis Smith updated ARROW-11506: - Description: pyarrow 3.0.0 fails on install in a clean environment created using virtualenv on

[jira] [Updated] (ARROW-11506) Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Travis Smith updated ARROW-11506: - Description: pyarrow 3.0.0 fails on install in a clean environment created using virtualenv on

[jira] [Created] (ARROW-11506) Pyarrow fail on install and eventual run/compilation

2021-02-05 Thread Travis Smith (Jira)
Travis Smith created ARROW-11506: Summary: Pyarrow fail on install and eventual run/compilation Key: ARROW-11506 URL: https://issues.apache.org/jira/browse/ARROW-11506 Project: Apache Arrow

[jira] [Commented] (ARROW-11007) [Python] Memory leak in pq.read_table and table.to_pandas

2021-02-05 Thread Dmitry Kashtanov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279733#comment-17279733 ] Dmitry Kashtanov commented on ARROW-11007: --   > That doesn't really answer the question: what

[jira] [Updated] (ARROW-11494) [Rust] Fix take bench

2021-02-05 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale updated ARROW-11494: --- Summary: [Rust] Fix take bench (was: Fix take bench.) > [Rust] Fix take bench >

[jira] [Resolved] (ARROW-11505) Add support for LargeUtf8 in csv-writer

2021-02-05 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-11505. Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 9426

[jira] [Commented] (ARROW-11007) [Python] Memory leak in pq.read_table and table.to_pandas

2021-02-05 Thread Dmitry Kashtanov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279688#comment-17279688 ] Dmitry Kashtanov commented on ARROW-11007: -- {code:java}

[jira] [Commented] (ARROW-11007) [Python] Memory leak in pq.read_table and table.to_pandas

2021-02-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279687#comment-17279687 ] Antoine Pitrou commented on ARROW-11007: > "Mem usage" is by `memory_profiler`. That doesn't

[jira] [Commented] (ARROW-11007) [Python] Memory leak in pq.read_table and table.to_pandas

2021-02-05 Thread Dmitry Kashtanov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279681#comment-17279681 ] Dmitry Kashtanov commented on ARROW-11007: -- "Mem usage" is by `memory_profiler`. And as we may

[jira] [Commented] (ARROW-11007) [Python] Memory leak in pq.read_table and table.to_pandas

2021-02-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279664#comment-17279664 ] Antoine Pitrou commented on ARROW-11007: As you can see, the memory was returned to the

[jira] [Commented] (ARROW-11007) [Python] Memory leak in pq.read_table and table.to_pandas

2021-02-05 Thread Dmitry Kashtanov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279642#comment-17279642 ] Dmitry Kashtanov commented on ARROW-11007: -- I have a somewhat similar issue observed both on

[jira] [Assigned] (ARROW-11427) [C++] Arrow uses AVX512 instructions even when not supported by the OS

2021-02-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-11427: -- Assignee: Antoine Pitrou > [C++] Arrow uses AVX512 instructions even when not

[jira] [Closed] (ARROW-10771) [Rust] Extend JSON schema inference to nested types

2021-02-05 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale closed ARROW-10771. -- Assignee: QP Hou Resolution: Duplicate > [Rust] Extend JSON schema inference to nested

[jira] [Created] (ARROW-11505) Add support for LargeUtf8 in csv-writer

2021-02-05 Thread Ritchie (Jira)
Ritchie created ARROW-11505: --- Summary: Add support for LargeUtf8 in csv-writer Key: ARROW-11505 URL: https://issues.apache.org/jira/browse/ARROW-11505 Project: Apache Arrow Issue Type: Improvement

[jira] [Commented] (ARROW-11427) [C++] Arrow uses AVX512 instructions even when not supported by the OS

2021-02-05 Thread Ali Cetin (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17279467#comment-17279467 ] Ali Cetin commented on ARROW-11427: --- [~apitrou], I have tested the fix in WS2012 and WS2016. I can