[jira] [Commented] (ARROW-17107) [Java] JSONFileWriter throws IOOBE writing an empty list

2022-07-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568038#comment-17568038 ] David Li commented on ARROW-17107: -- Note that the JSON writer won't construct JSONL as you might

[jira] [Created] (ARROW-17107) [Java] JSONFileWriter throws IOOBE writing an empty list

2022-07-18 Thread James Henderson (Jira)
James Henderson created ARROW-17107: --- Summary: [Java] JSONFileWriter throws IOOBE writing an empty list Key: ARROW-17107 URL: https://issues.apache.org/jira/browse/ARROW-17107 Project: Apache Arrow

[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2022-07-18 Thread Pavel Solodovnikov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568021#comment-17568021 ] Pavel Solodovnikov commented on ARROW-8163: --- I plan to start working on this item soon, can you

[jira] [Updated] (ARROW-17087) [C++] Race condition in scanner test

2022-07-18 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido updated ARROW-17087: -- Labels: Nightly (was: ) > [C++] Race condition in scanner test >

[jira] [Closed] (ARROW-17103) [C++][CI] arrow-dataset-scanner-test has been failing on some nightly cpp builds

2022-07-18 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido closed ARROW-17103. - Fix Version/s: (was: 9.0.0) Resolution: Duplicate This is a duplicate > [C++][CI]

[jira] [Commented] (ARROW-17103) [C++][CI] arrow-dataset-scanner-test has been failing on some nightly cpp builds

2022-07-18 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567987#comment-17567987 ] Raúl Cumplido commented on ARROW-17103: --- Yep, I'll tag the other as Nightly and will close this

[jira] [Updated] (ARROW-17106) [Python] Move parquet code from __init__.py and expose only API

2022-07-18 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido updated ARROW-17106: -- Description: As discussed on

[jira] [Updated] (ARROW-17106) [Python] Move parquet code from __init__.py and expose only API

2022-07-18 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido updated ARROW-17106: -- Description: As discussed on

[jira] [Created] (ARROW-17106) [Python] Move parquet code from __init__.py and expose only API

2022-07-18 Thread Jira
Raúl Cumplido created ARROW-17106: - Summary: [Python] Move parquet code from __init__.py and expose only API Key: ARROW-17106 URL: https://issues.apache.org/jira/browse/ARROW-17106 Project: Apache

[jira] [Created] (ARROW-17105) [Python] test_filesystem_dataset_no_filesystem_interaction segfault on s390x

2022-07-18 Thread David Li (Jira)
David Li created ARROW-17105: Summary: [Python] test_filesystem_dataset_no_filesystem_interaction segfault on s390x Key: ARROW-17105 URL: https://issues.apache.org/jira/browse/ARROW-17105 Project: Apache

[jira] [Commented] (ARROW-17103) [C++][CI] arrow-dataset-scanner-test has been failing on some nightly cpp builds

2022-07-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567958#comment-17567958 ] David Li commented on ARROW-17103: -- Duplicate of ARROW-17087? > [C++][CI] arrow-dataset-scanner-test

[jira] [Commented] (ARROW-17057) [Python] S3FileSystem has no parameter for retry strategy

2022-07-18 Thread Duncan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567952#comment-17567952 ] Duncan commented on ARROW-17057: [~kou]  [https://github.com/apache/arrow/pull/13633] I don't love it,

[jira] [Updated] (ARROW-17104) [CI][Python] Pyarrow cannot be imported on CI job AMD64 MacOS 10.15 Python 3

2022-07-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17104: --- Labels: pull-request-available (was: ) > [CI][Python] Pyarrow cannot be imported on CI job

[jira] [Updated] (ARROW-17057) [Python] S3FileSystem has no parameter for retry strategy

2022-07-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17057: --- Labels: pull-request-available (was: ) > [Python] S3FileSystem has no parameter for retry

[jira] [Commented] (ARROW-17104) [CI][Python] Pyarrow cannot be imported on CI job AMD64 MacOS 10.15 Python 3

2022-07-18 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567941#comment-17567941 ] Raúl Cumplido commented on ARROW-17104: --- Just to add some more info, the homebrew formulas for

[jira] [Commented] (ARROW-15838) [C++] Key column behavior in joins

2022-07-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567939#comment-17567939 ] Joris Van den Bossche commented on ARROW-15838: --- This is indeed what pyarrow does

[jira] [Assigned] (ARROW-17104) [CI][Python] Pyarrow cannot be imported on CI job AMD64 MacOS 10.15 Python 3

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-17104: -- Assignee: Antoine Pitrou > [CI][Python] Pyarrow cannot be imported on CI job AMD64

[jira] [Commented] (ARROW-17104) [CI][Python] Pyarrow cannot be imported on CI job AMD64 MacOS 10.15 Python 3

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567923#comment-17567923 ] Antoine Pitrou commented on ARROW-17104: And ultimately it is caused by the upstream issue

[jira] [Commented] (ARROW-17104) [CI][Python] Pyarrow cannot be imported on CI job AMD64 MacOS 10.15 Python 3

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567922#comment-17567922 ] Antoine Pitrou commented on ARROW-17104: I think this is exactly the same issue as ARROW-16520,

[jira] [Created] (ARROW-17104) [CI][Python] Pyarrow cannot be imported on CI job AMD64 MacOS 10.15 Python 3

2022-07-18 Thread Jira
Raúl Cumplido created ARROW-17104: - Summary: [CI][Python] Pyarrow cannot be imported on CI job AMD64 MacOS 10.15 Python 3 Key: ARROW-17104 URL: https://issues.apache.org/jira/browse/ARROW-17104

[jira] [Updated] (ARROW-17102) [R] Test fails on R minimal nightly builds due to Parquet writing

2022-07-18 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-17102: - Summary: [R] Test fails on R minimal nightly builds due to Parquet writing (was: [R] Test

[jira] [Created] (ARROW-17103) [C++][CI] arrow-dataset-scanner-test has been failing on some nightly cpp builds

2022-07-18 Thread Jira
Raúl Cumplido created ARROW-17103: - Summary: [C++][CI] arrow-dataset-scanner-test has been failing on some nightly cpp builds Key: ARROW-17103 URL: https://issues.apache.org/jira/browse/ARROW-17103

[jira] [Comment Edited] (ARROW-16802) [Docs] Improve Acero Documentation

2022-07-18 Thread Kexin Su (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567908#comment-17567908 ] Kexin Su edited comment on ARROW-16802 at 7/18/22 9:49 AM: --- Hi, I am doing an

[jira] [Comment Edited] (ARROW-16802) [Docs] Improve Acero Documentation

2022-07-18 Thread Kexin Su (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567908#comment-17567908 ] Kexin Su edited comment on ARROW-16802 at 7/18/22 9:48 AM: --- Hi, I am doing an

[jira] [Comment Edited] (ARROW-16802) [Docs] Improve Acero Documentation

2022-07-18 Thread Kexin Su (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567908#comment-17567908 ] Kexin Su edited comment on ARROW-16802 at 7/18/22 9:48 AM: --- Hi, I am doing an

[jira] [Commented] (ARROW-16802) [Docs] Improve Acero Documentation

2022-07-18 Thread Kexin Su (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567908#comment-17567908 ] Kexin Su commented on ARROW-16802: -- Hi, I am doing an (academic) project about using GPU to accelerate

[jira] [Updated] (ARROW-9843) [C++][Python] Implement Between ternary kernel

2022-07-18 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-9843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido updated ARROW-9843: - Fix Version/s: 10.0.0 (was: 9.0.0) > [C++][Python] Implement Between

[jira] [Assigned] (ARROW-16719) [Python] Add path/URI /+ filesystem handling to parquet.read_metadata

2022-07-18 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alenka Frim reassigned ARROW-16719: --- Assignee: Kshiteej K > [Python] Add path/URI /+ filesystem handling to

[jira] [Updated] (ARROW-16737) [C++] Bump version of bundled zstd library

2022-07-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-16737: --- Labels: pull-request-available (was: ) > [C++] Bump version of bundled zstd library >

[jira] [Commented] (ARROW-17096) [C++] Mode kernel incorrect for boolean inputs

2022-07-18 Thread Yibo Cai (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567883#comment-17567883 ] Yibo Cai commented on ARROW-17096: -- Ah, it's indeed a bug in C++. > [C++] Mode kernel incorrect for

[jira] [Commented] (ARROW-17096) [C++] Mode kernel incorrect for boolean inputs

2022-07-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567877#comment-17567877 ] Joris Van den Bossche commented on ARROW-17096: --- bq. Fiddling the buffer directly, looks

[jira] [Updated] (ARROW-17101) [Java] Prepare new protoc-gen-grpc-java for s390x

2022-07-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17101: --- Labels: pull-request-available (was: ) > [Java] Prepare new protoc-gen-grpc-java for s390x

[jira] [Updated] (ARROW-17096) [C++] Mode kernel incorrect for boolean inputs

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-17096: --- Summary: [C++] Mode kernel incorrect for boolean inputs (was: pyarrow.compute.mode for

[jira] [Commented] (ARROW-17096) pyarrow.compute.mode for boolean arrays does not return true when mixed with false

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567874#comment-17567874 ] Antoine Pitrou commented on ARROW-17096: I think this is a C++ bug. In {{aggregate_test.cc}},

[jira] [Commented] (ARROW-17096) pyarrow.compute.mode for boolean arrays does not return true when mixed with false

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567872#comment-17567872 ] Antoine Pitrou commented on ARROW-17096: Hmm, this is weird. PyArrow doesn't do any

[jira] [Commented] (ARROW-17096) pyarrow.compute.mode for boolean arrays does not return true when mixed with false

2022-07-18 Thread Yibo Cai (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567865#comment-17567865 ] Yibo Cai commented on ARROW-17096: -- cc [~jorisvandenbossche] , [~apitrou] for comments. In below test,

[jira] [Commented] (ARROW-17100) [C++][Parquet] Fix backwards compatibility for ParquetV2 data pages written prior to 3.0.0 per ARROW-10353

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567863#comment-17567863 ] Antoine Pitrou commented on ARROW-17100: Note that this issue has existed since 3.0.0, so the

[jira] [Commented] (ARROW-17100) [C++][Parquet] Fix backwards compatibility for ParquetV2 data pages written prior to 3.0.0 per ARROW-10353

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567851#comment-17567851 ] Antoine Pitrou commented on ARROW-17100: That changeset is ARROW-10353, which fixes bugs both in

[jira] [Closed] (ARROW-17098) TypeError: __init__() got an unexpected keyword argument 'invalid_row_handler'

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-17098. -- Resolution: Cannot Reproduce > TypeError: __init__() got an unexpected keyword argument

[jira] [Commented] (ARROW-17098) TypeError: __init__() got an unexpected keyword argument 'invalid_row_handler'

2022-07-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17567846#comment-17567846 ] Antoine Pitrou commented on ARROW-17098: Glad you found the issue :-) > TypeError: __init__()

[jira] [Updated] (ARROW-17102) [R] Test fails on R minimal nightly builds

2022-07-18 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-17102: - Summary: [R] Test fails on R minimal nightly builds (was: [R] Test fails on

[jira] [Updated] (ARROW-17102) [R] Test fails on test-r-offline-minimal nightly build

2022-07-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17102: --- Labels: pull-request-available (was: ) > [R] Test fails on test-r-offline-minimal nightly

[jira] [Created] (ARROW-17102) [R] Test fails on test-r-offline-minimal nightly build

2022-07-18 Thread Nicola Crane (Jira)
Nicola Crane created ARROW-17102: Summary: [R] Test fails on test-r-offline-minimal nightly build Key: ARROW-17102 URL: https://issues.apache.org/jira/browse/ARROW-17102 Project: Apache Arrow

[jira] [Created] (ARROW-17101) [Java] Prepare new protoc-gen-grpc-java for s390x

2022-07-18 Thread Kazuaki Ishizaki (Jira)
Kazuaki Ishizaki created ARROW-17101: Summary: [Java] Prepare new protoc-gen-grpc-java for s390x Key: ARROW-17101 URL: https://issues.apache.org/jira/browse/ARROW-17101 Project: Apache Arrow

<    1   2