[jira] [Commented] (ARROW-10517) [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235244#comment-17235244 ] Joris Van den Bossche commented on ARROW-10517: --- For the first error in th

[jira] [Commented] (ARROW-10517) [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235231#comment-17235231 ] Joris Van den Bossche commented on ARROW-10517: --- [~ldacey] thanks for the

[jira] [Comment Edited] (ARROW-10517) [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob

2020-11-18 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235084#comment-17235084 ] Lance Dacey edited comment on ARROW-10517 at 11/19/20, 7:44 AM: --

[jira] [Commented] (ARROW-10599) Prebuilt distributions (aka. pyarrow and libarrow-dev) should use the same ABI (with or without the DUAL abi)

2020-11-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235223#comment-17235223 ] Kouhei Sutou commented on ARROW-10599: -- Could you provide a Python script and wheel

[jira] [Commented] (ARROW-10587) [Ruby] Table#initialize examples are out of date

2020-11-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235218#comment-17235218 ] Kouhei Sutou commented on ARROW-10587: -- Great! > [Ruby] Table#initialize examples

[jira] [Assigned] (ARROW-10587) [Ruby] Table#initialize examples are out of date

2020-11-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou reassigned ARROW-10587: Assignee: fonsan (was: Kouhei Sutou) > [Ruby] Table#initialize examples are out of date

[jira] [Updated] (ARROW-10652) [C++][Gandiva] Make gandiva cache size configurable

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10652: --- Labels: pull-request-available (was: ) > [C++][Gandiva] Make gandiva cache size configurabl

[jira] [Commented] (ARROW-10617) [Python] RecordBatchStreamReader's iterator doesn't work with python 3.8

2020-11-18 Thread Tao He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235188#comment-17235188 ] Tao He commented on ARROW-10617: The issue was found on Ubuntu 20.02, where the `libarro

[jira] [Created] (ARROW-10652) [C++][Gandiva] Make gandiva cache size configurable

2020-11-18 Thread Projjal Chanda (Jira)
Projjal Chanda created ARROW-10652: -- Summary: [C++][Gandiva] Make gandiva cache size configurable Key: ARROW-10652 URL: https://issues.apache.org/jira/browse/ARROW-10652 Project: Apache Arrow

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking [https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking [https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking [https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking [https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008a

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking [https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008a

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008a

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking [https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008

[jira] [Updated] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chiyang Wan updated ARROW-10651: Description: Checking [https://github.com/apache/arrow/blob/256d0dc3f712154100aa6e0a610383b189008

[jira] [Created] (ARROW-10651) [C++] alloc-dealloc-mismatch in s3fs.cc

2020-11-18 Thread Chiyang Wan (Jira)
Chiyang Wan created ARROW-10651: --- Summary: [C++] alloc-dealloc-mismatch in s3fs.cc Key: ARROW-10651 URL: https://issues.apache.org/jira/browse/ARROW-10651 Project: Apache Arrow Issue Type: Impr

[jira] [Updated] (ARROW-10650) [C++]memory leak when read parquet file from hadoop

2020-11-18 Thread yzr (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yzr updated ARROW-10650: Summary: [C++]memory leak when read parquet file from hadoop (was: memory leak when read parquet file from hadoop

[jira] [Created] (ARROW-10650) memory leak when read parquet file from hadoop

2020-11-18 Thread yzr (Jira)
yzr created ARROW-10650: --- Summary: memory leak when read parquet file from hadoop Key: ARROW-10650 URL: https://issues.apache.org/jira/browse/ARROW-10650 Project: Apache Arrow Issue Type: Bug

[jira] [Updated] (ARROW-10517) [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob

2020-11-18 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lance Dacey updated ARROW-10517: Description:   {code:python} # adal==1.2.5 # adlfs==0.2.5 # fsspec==0.7.4 # pandas==1.1.3 # pyarro

[jira] [Commented] (ARROW-10517) [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob

2020-11-18 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235084#comment-17235084 ] Lance Dacey commented on ARROW-10517: - Added an edit with the results of pure fsspec

[jira] [Updated] (ARROW-10517) [Python] Unable to read/write Parquet datasets with fsspec on Azure Blob

2020-11-18 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lance Dacey updated ARROW-10517: Description:   {code:python} # adal==1.2.5 # adlfs==0.2.5 # fsspec==0.7.4 # pandas==1.1.3 # pyarro

[jira] [Updated] (ARROW-10649) [Rust] Parsing manually in infer_field_schema, remove lazy static dependency

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-10649: Description: Removes one direct dependency (lazy_static) > [Rust] Parsing manually in infer_field_

[jira] [Updated] (ARROW-10649) [Rust] Parsing manually in infer_field_schema, remove lazy static dependency

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-10649: Component/s: Rust > [Rust] Parsing manually in infer_field_schema, remove lazy static dependency

[jira] [Resolved] (ARROW-10633) [Rust][DataFusion] Dependency version upgrades

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb resolved ARROW-10633. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 8697 [https://githu

[jira] [Updated] (ARROW-10633) [Rust][DataFusion] Dependency version upgrades

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-10633: Component/s: Rust - DataFusion > [Rust][DataFusion] Dependency version upgrades >

[jira] [Commented] (ARROW-10633) [Rust][DataFusion] Dependency version upgrades

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17235066#comment-17235066 ] Andrew Lamb commented on ARROW-10633: - I am not able to assign this ticket to [~Dand

[jira] [Updated] (ARROW-10649) [Rust] Parsing manually in infer_field_schema, remove lazy static dependency

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10649: --- Labels: pull-request-available (was: ) > [Rust] Parsing manually in infer_field_schema, rem

[jira] [Created] (ARROW-10649) [Rust] Parsing manually in infer_field_schema, remove lazy static dependency

2020-11-18 Thread Jira
Daniël Heres created ARROW-10649: Summary: [Rust] Parsing manually in infer_field_schema, remove lazy static dependency Key: ARROW-10649 URL: https://issues.apache.org/jira/browse/ARROW-10649 Projec

[jira] [Resolved] (ARROW-10637) [Rust] Add examples to boolean kernels

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb resolved ARROW-10637. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 8699 [https://githu

[jira] [Resolved] (ARROW-10464) [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb resolved ARROW-10464. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 8705 [https://githu

[jira] [Created] (ARROW-10648) [Java] Prepare Java codebase for source release without requiring any git tags to be created or pushed

2020-11-18 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-10648: Summary: [Java] Prepare Java codebase for source release without requiring any git tags to be created or pushed Key: ARROW-10648 URL: https://issues.apache.org/jira/browse/ARROW-1

[jira] [Commented] (ARROW-8908) [Rust][DataFusion] improve performance of building literal arrays

2020-11-18 Thread Yordan Pavlov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234942#comment-17234942 ] Yordan Pavlov commented on ARROW-8908: -- direct comparison to scalar values was imple

[jira] [Closed] (ARROW-10483) [C++] Move Executor into a separate header

2020-11-18 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman closed ARROW-10483. Resolution: Won't Fix > [C++] Move Executor into a separate header > -

[jira] [Updated] (ARROW-10647) [Rust] [Parquet] Port parquet benchmarks to new repo

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10647: --- Labels: pull-request-available (was: ) > [Rust] [Parquet] Port parquet benchmarks to new re

[jira] [Created] (ARROW-10647) [Rust] [Parquet] Port parquet benchmarks to new repo

2020-11-18 Thread Andrew Lamb (Jira)
Andrew Lamb created ARROW-10647: --- Summary: [Rust] [Parquet] Port parquet benchmarks to new repo Key: ARROW-10647 URL: https://issues.apache.org/jira/browse/ARROW-10647 Project: Apache Arrow Iss

[jira] [Updated] (ARROW-10646) [C++][FlightRPC] Disable flaky test

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10646: --- Labels: pull-request-available (was: ) > [C++][FlightRPC] Disable flaky test >

[jira] [Created] (ARROW-10646) [C++][FlightRPC] Disable flaky test

2020-11-18 Thread David Li (Jira)
David Li created ARROW-10646: Summary: [C++][FlightRPC] Disable flaky test Key: ARROW-10646 URL: https://issues.apache.org/jira/browse/ARROW-10646 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-10645) [Rust] [DataFusion] Add support for writing results to Parquet

2020-11-18 Thread Andy Grove (Jira)
Andy Grove created ARROW-10645: -- Summary: [Rust] [DataFusion] Add support for writing results to Parquet Key: ARROW-10645 URL: https://issues.apache.org/jira/browse/ARROW-10645 Project: Apache Arrow

[jira] [Updated] (ARROW-10032) [Documentation] C++ Windows docs are out of date

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10032: --- Labels: pull-request-available (was: ) > [Documentation] C++ Windows docs are out of date >

[jira] [Assigned] (ARROW-10032) [Documentation] C++ Windows docs are out of date

2020-11-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-10032: Assignee: David Li > [Documentation] C++ Windows docs are out of date > -

[jira] [Updated] (ARROW-10464) [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10464: --- Labels: pull-request-available (was: ) > [Rust] Implement utility to convert TPC-H tbl file

[jira] [Updated] (ARROW-10644) [Python] Consolidate path/filesystem handling in pyarrow.dataset and pyarrow.fs

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10644: --- Labels: pull-request-available (was: ) > [Python] Consolidate path/filesystem handling in p

[jira] [Assigned] (ARROW-10464) [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet

2020-11-18 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reassigned ARROW-10464: -- Assignee: Andy Grove > [Rust] Implement utility to convert TPC-H tbl files to CSV and Parquet

[jira] [Commented] (ARROW-10587) [Ruby] Table#initialize examples are out of date

2020-11-18 Thread fonsan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234777#comment-17234777 ] fonsan commented on ARROW-10587: I will attempt validating the examples this weekend, If

[jira] [Commented] (ARROW-10636) Remove specialisation from Rust parquet

2020-11-18 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234775#comment-17234775 ] Andrew Lamb commented on ARROW-10636: - Related to (or perhaps a dupe) of ARROW-6717

[jira] [Commented] (ARROW-10641) [C++] A "replace" or "map" kernel to replace values in array based on mapping

2020-11-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234767#comment-17234767 ] Neal Richardson commented on ARROW-10641: - There are other "recode" functions ou

[jira] [Updated] (ARROW-10642) [R] as.data.frame.Table crashes R with schema and no record batches

2020-11-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10642: Fix Version/s: 3.0.0 > [R] as.data.frame.Table crashes R with schema and no record batches

[jira] [Updated] (ARROW-10642) [R] as.data.frame.Table crashes R with schema and no record batches

2020-11-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10642: Summary: [R] as.data.frame.Table crashes R with schema and no record batches (was: as.dat

[jira] [Created] (ARROW-10644) [Python] Consolidate path/filesystem handling in pyarrow.dataset and pyarrow.fs

2020-11-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10644: - Summary: [Python] Consolidate path/filesystem handling in pyarrow.dataset and pyarrow.fs Key: ARROW-10644 URL: https://issues.apache.org/jira/browse/ARROW-10644

[jira] [Resolved] (ARROW-10631) [Rust] Equality of fixed-sized binary is incorrect.

2020-11-18 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-10631. Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 8695 [https:/

[jira] [Resolved] (ARROW-10638) [Rust] Improve tests of boolean kernels

2020-11-18 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-10638. Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 8700 [https:/

[jira] [Updated] (ARROW-10635) [C++] ORC reader issue with bool column

2020-11-18 Thread Ramakrishna Prabhu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramakrishna Prabhu updated ARROW-10635: --- Description: The ORC file contains single column of boolean type, from row number `2

[jira] [Updated] (ARROW-10581) IPC dictionary reference to relevant section

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10581: -- Fix Version/s: 3.0.0 > IPC dictionary reference to relevant section > --

[jira] [Updated] (ARROW-5008) [Python] ORC Reader Core Dumps in PyArrow if `/etc/localtime` does not exist

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5008: - Labels: orc (was: ORC) > [Python] ORC Reader Core Dumps in PyArrow if `/etc/loca

[jira] [Updated] (ARROW-2681) [C++] Use source releases when building ORC instead of using GitHub tag snapshots

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-2681: - Labels: orc (was: ) > [C++] Use source releases when building ORC instead of usi

[jira] [Updated] (ARROW-10635) [C++] ORC reader issue with bool column

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10635: -- Summary: [C++] ORC reader issue with bool column (was: ORC reader issue with

[jira] [Updated] (ARROW-10276) [Python] Armv7 orc and flight not supported for build. Compat error on using with spark

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10276: -- Labels: orc (was: ) > [Python] Armv7 orc and flight not supported for build.

[jira] [Commented] (ARROW-10635) ORC reader issue with bool column.

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234673#comment-17234673 ] Joris Van den Bossche commented on ARROW-10635: --- [~rgsl888] Thanks for the

[jira] [Updated] (ARROW-10635) [C++] ORC reader issue with bool column

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10635: -- Component/s: C++ > [C++] ORC reader issue with bool column > -

[jira] [Updated] (ARROW-8056) [R] Support read and write orc file format

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8056: - Labels: orc (was: ) > [R] Support read and write orc file format > -

[jira] [Updated] (ARROW-9299) [Python] Expose ORC metadata() in Python ORCFile

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9299: - Labels: orc (was: ) > [Python] Expose ORC metadata() in Python ORCFile > ---

[jira] [Updated] (ARROW-7906) [C++][Python] Full functionality for ORC format

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-7906: - Labels: orc pull-request-available (was: pull-request-available) > [C++][Python]

[jira] [Updated] (ARROW-4713) [C++] Improve C++ Orc Adapter performance and memory footprint

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-4713: - Labels: orc pull-request-available (was: pull-request-available) > [C++] Improve

[jira] [Updated] (ARROW-7811) [Python] Re-enable pyarrow.orc in wheel packages

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-7811: - Labels: orc (was: ) > [Python] Re-enable pyarrow.orc in wheel packages > ---

[jira] [Updated] (ARROW-10635) ORC reader issue with bool column.

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10635: -- Labels: orc (was: ) > ORC reader issue with bool column. > --

[jira] [Updated] (ARROW-10122) [Python] Selecting one column of multi-index results in a duplicated value column.

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10122: -- Fix Version/s: 3.0.0 > [Python] Selecting one column of multi-index results in

[jira] [Assigned] (ARROW-10122) [Python] Selecting one column of multi-index results in a duplicated value column.

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-10122: - Assignee: Joris Van den Bossche > [Python] Selecting one column of mult

[jira] [Commented] (ARROW-10617) [Python] RecordBatchStreamReader's iterator doesn't work with python 3.8

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234664#comment-17234664 ] Joris Van den Bossche commented on ARROW-10617: --- If I run this example in

[jira] [Updated] (ARROW-10617) [Python] RecordBatchStreamReader's iterator doesn't work with python 3.8

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10617: -- Description: The following example code doesn't work with python 3.8: {code}

[jira] [Created] (ARROW-10643) [Python] Pandas<->pyarrow roundtrip failing to recreate index for empty dataframe

2020-11-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10643: - Summary: [Python] Pandas<->pyarrow roundtrip failing to recreate index for empty dataframe Key: ARROW-10643 URL: https://issues.apache.org/jira/browse/ARROW-1064

[jira] [Created] (ARROW-10642) as.data.frame.Table crash R with schema and no record batche

2020-11-18 Thread Bruno Tremblay (Jira)
Bruno Tremblay created ARROW-10642: -- Summary: as.data.frame.Table crash R with schema and no record batche Key: ARROW-10642 URL: https://issues.apache.org/jira/browse/ARROW-10642 Project: Apache Arro

[jira] [Commented] (ARROW-10641) [C++] A "replace" or "map" kernel to replace values in array based on mapping

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234609#comment-17234609 ] Joris Van den Bossche commented on ARROW-10641: --- In the R world, something

[jira] [Updated] (ARROW-10641) [C++] A "replace" or "map" kernel to replace values in array based on mapping

2020-11-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10641: -- Description: A "replace" or "map" kernel to replace values in array based on m

[jira] [Created] (ARROW-10641) [C++] A "replace" or "map" kernel to replace values in array based on mapping

2020-11-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10641: - Summary: [C++] A "replace" or "map" kernel to replace values in array based on mapping Key: ARROW-10641 URL: https://issues.apache.org/jira/browse/ARROW-10641

[jira] [Commented] (ARROW-10052) [Python] Incrementally using ParquetWriter keeps data in memory (eventually running out of RAM for large datasets)

2020-11-18 Thread Niklas B (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234575#comment-17234575 ] Niklas B commented on ARROW-10052: -- Closing this, I think the memory usage is OK  > [P

[jira] [Closed] (ARROW-10052) [Python] Incrementally using ParquetWriter keeps data in memory (eventually running out of RAM for large datasets)

2020-11-18 Thread Niklas B (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niklas B closed ARROW-10052. Resolution: Not A Problem > [Python] Incrementally using ParquetWriter keeps data in memory (eventually >

[jira] [Commented] (ARROW-10599) Prebuilt distributions (aka. pyarrow and libarrow-dev) should use the same ABI (with or without the DUAL abi)

2020-11-18 Thread Tao He (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234566#comment-17234566 ] Tao He commented on ARROW-10599: Without `setdlopenflags(RTLD_GLOBAL)` pyarrow won't wor

[jira] [Commented] (ARROW-10640) [C++] A "where" kernel to combine two arrays based on a mask

2020-11-18 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234481#comment-17234481 ] Maarten Breddels commented on ARROW-10640: -- Another idea would be to have a 'ch

[jira] [Commented] (ARROW-10640) [C++] A "where" kernel to combine two arrays based on a mask

2020-11-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234451#comment-17234451 ] Antoine Pitrou commented on ARROW-10640: A boolean entry can be true, false or n

[jira] [Commented] (ARROW-10627) [Rust] Github master does not compile for WASM target

2020-11-18 Thread Junyuan Tan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17234447#comment-17234447 ] Junyuan Tan commented on ARROW-10627: - Sure - you'll need to install the wasm32-unkn

[jira] [Created] (ARROW-10640) [C++] A "where" kernel to combine two arrays based on a mask

2020-11-18 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10640: - Summary: [C++] A "where" kernel to combine two arrays based on a mask Key: ARROW-10640 URL: https://issues.apache.org/jira/browse/ARROW-10640 Projec

[jira] [Updated] (ARROW-10143) [C++] ArrayRangeEquals should accept EqualOptions

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10143: --- Labels: pull-request-available (was: ) > [C++] ArrayRangeEquals should accept EqualOptions

[jira] [Updated] (ARROW-10634) [C#][CI] Change the build version from 2.2 to 3.1 in CI

2020-11-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10634: --- Labels: pull-request-available (was: ) > [C#][CI] Change the build version from 2.2 to 3.1

[jira] [Updated] (ARROW-10634) [C#][CI] Change the build version from 2.2 to 3.1 in CI

2020-11-18 Thread Alexander (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander updated ARROW-10634: -- Description: This change is required to use gRPC and flight in net core. The build version should cha