[jira] [Commented] (ARROW-10739) [Python] Pickling a sliced array serializes all the buffers

2022-08-05 Thread Clark Zinzow (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17576108#comment-17576108 ] Clark Zinzow commented on ARROW-10739: -- Ping on this, [~amol-] [~jcrist] are either

[jira] [Comment Edited] (ARROW-10739) [Python] Pickling a sliced array serializes all the buffers

2022-08-05 Thread Clark Zinzow (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17576108#comment-17576108 ] Clark Zinzow edited comment on ARROW-10739 at 8/6/22 1:49 AM:

[jira] [Commented] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17576079#comment-17576079 ] Ziheng Wang commented on ARROW-17313: - Ideally we update the Dataset Scanner to be a

[jira] [Closed] (ARROW-17309) [Python] arrow-python-devel-9.0.0-1.el7.x86_64.rpm is missing from Centos 7 repo

2022-08-05 Thread Lei (Eddy) Xu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lei (Eddy) Xu closed ARROW-17309. - Resolution: Information Provided > [Python] arrow-python-devel-9.0.0-1.el7.x86_64.rpm is missing

[jira] [Commented] (ARROW-17309) [Python] arrow-python-devel-9.0.0-1.el7.x86_64.rpm is missing from Centos 7 repo

2022-08-05 Thread Lei (Eddy) Xu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17576030#comment-17576030 ] Lei (Eddy) Xu commented on ARROW-17309: --- Got it. Thanks for the explanation [~kou]

[jira] [Commented] (ARROW-17328) [C++] Add hash_mode function

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17576024#comment-17576024 ] David Li commented on ARROW-17328: -- FWIW, I punted on this one last year since it might

[jira] [Created] (ARROW-17328) [C++] Add hash_mode function

2022-08-05 Thread Ian Cook (Jira)
Ian Cook created ARROW-17328: Summary: [C++] Add hash_mode function Key: ARROW-17328 URL: https://issues.apache.org/jira/browse/ARROW-17328 Project: Apache Arrow Issue Type: New Feature

[jira] [Assigned] (ARROW-17057) [Python] S3FileSystem has no parameter for retry strategy

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-17057: Assignee: Duncan (was: David Li) > [Python] S3FileSystem has no parameter for retry strategy > -

[jira] [Assigned] (ARROW-17057) [Python] S3FileSystem has no parameter for retry strategy

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-17057: Assignee: David Li (was: Duncan) > [Python] S3FileSystem has no parameter for retry strategy > -

[jira] [Assigned] (ARROW-17057) [Python] S3FileSystem has no parameter for retry strategy

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-17057: Assignee: Duncan > [Python] S3FileSystem has no parameter for retry strategy > --

[jira] [Created] (ARROW-17327) Parquet should be listed in PyArrow's get_libraries() function

2022-08-05 Thread Steven Silvester (Jira)
Steven Silvester created ARROW-17327: Summary: Parquet should be listed in PyArrow's get_libraries() function Key: ARROW-17327 URL: https://issues.apache.org/jira/browse/ARROW-17327 Project: Apach

[jira] [Updated] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ziheng Wang updated ARROW-17313: Description: Sometimes it's desirable to just read a portion of a CSV. The best way to do that is

[jira] [Updated] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ziheng Wang updated ARROW-17313: Description: Sometimes it's desirable to just read a portion of a CSV. The best way to do that is

[jira] [Resolved] (ARROW-17297) [Java][Docs] Add example of Java to C++ via C Data Interface

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-17297. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13788 [https://github.co

[jira] [Resolved] (ARROW-17323) [Go] Clean up and upgrade dependencies

2022-08-05 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Topol resolved ARROW-17323. --- Resolution: Fixed Issue resolved by pull request 13807 [https://github.com/apache/arrow/pull

[jira] [Closed] (ARROW-17325) AQE should use available column statistics from completed query stages

2022-08-05 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-17325. -- Resolution: Invalid > AQE should use available column statistics from completed query stages > ---

[jira] [Updated] (ARROW-17325) AQE should use available column statistics from completed query stages

2022-08-05 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-17325: --- Description: In QueryStageExec.computeStats we copy partial statistics from materlized query stages

[jira] [Created] (ARROW-17326) [Go][FlightSQL] Add Support for FlightSQL to Go

2022-08-05 Thread Matthew Topol (Jira)
Matthew Topol created ARROW-17326: - Summary: [Go][FlightSQL] Add Support for FlightSQL to Go Key: ARROW-17326 URL: https://issues.apache.org/jira/browse/ARROW-17326 Project: Apache Arrow Issu

[jira] [Created] (ARROW-17325) AQE should use available column statistics from completed query stages

2022-08-05 Thread Andy Grove (Jira)
Andy Grove created ARROW-17325: -- Summary: AQE should use available column statistics from completed query stages Key: ARROW-17325 URL: https://issues.apache.org/jira/browse/ARROW-17325 Project: Apache Ar

[jira] [Updated] (ARROW-17310) [C++] Expose RecordBatchReader::MakeFromIterator

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-17310: - Summary: [C++] Expose RecordBatchReader::MakeFromIterator (was: [C++] Expose SimpleRecordBatchReader pu

[jira] [Resolved] (ARROW-17310) [C++] Expose SimpleRecordBatchReader publicly

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-17310. -- Resolution: Fixed Issue resolved by pull request 13798 [https://github.com/apache/arrow/pull/13798] >

[jira] [Created] (ARROW-17324) [Go][CI] Add new Go CI job with -asan

2022-08-05 Thread Matthew Topol (Jira)
Matthew Topol created ARROW-17324: - Summary: [Go][CI] Add new Go CI job with -asan Key: ARROW-17324 URL: https://issues.apache.org/jira/browse/ARROW-17324 Project: Apache Arrow Issue Type: Im

[jira] [Closed] (ARROW-15953) [CI][Go][Flight] Go Flight integration client seems flaky

2022-08-05 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Topol closed ARROW-15953. - Resolution: Cannot Reproduce > [CI][Go][Flight] Go Flight integration client seems flaky > -

[jira] [Created] (ARROW-17323) [Go] Clean up and upgrade dependencies

2022-08-05 Thread Matthew Topol (Jira)
Matthew Topol created ARROW-17323: - Summary: [Go] Clean up and upgrade dependencies Key: ARROW-17323 URL: https://issues.apache.org/jira/browse/ARROW-17323 Project: Apache Arrow Issue Type: I

[jira] [Commented] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575967#comment-17575967 ] Ziheng Wang commented on ARROW-17313: - Also this will not support compressed formats

[jira] [Updated] (ARROW-17323) [Go] Clean up and upgrade dependencies

2022-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17323: --- Labels: pull-request-available (was: ) > [Go] Clean up and upgrade dependencies > -

[jira] [Updated] (ARROW-17276) [Go][Integration] Implement IPC handling for Union Arrays

2022-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17276: --- Labels: pull-request-available (was: ) > [Go][Integration] Implement IPC handling for Union

[jira] [Resolved] (ARROW-17321) [JS] Update dependencies

2022-08-05 Thread Dominik Moritz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dominik Moritz resolved ARROW-17321. Resolution: Fixed Issue resolved by pull request 13758 [https://github.com/apache/arrow/pu

[jira] [Resolved] (ARROW-10600) [Go] Support Decimal256 type

2022-08-05 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Topol resolved ARROW-10600. --- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13792 [https:/

[jira] [Updated] (ARROW-17322) [Docs] Add issue handling guidance to docs

2022-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17322: --- Labels: pull-request-available (was: ) > [Docs] Add issue handling guidance to docs > -

[jira] [Assigned] (ARROW-17322) [Docs] Add issue handling guidance to docs

2022-08-05 Thread Todd Farmer (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Farmer reassigned ARROW-17322: --- Assignee: Todd Farmer > [Docs] Add issue handling guidance to docs > --

[jira] [Created] (ARROW-17322) [Docs] Add issue handling guidance to docs

2022-08-05 Thread Todd Farmer (Jira)
Todd Farmer created ARROW-17322: --- Summary: [Docs] Add issue handling guidance to docs Key: ARROW-17322 URL: https://issues.apache.org/jira/browse/ARROW-17322 Project: Apache Arrow Issue Type: I

[jira] [Commented] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575937#comment-17575937 ] Weston Pace commented on ARROW-17313: - > We should reject a partial read if newlines

[jira] [Commented] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575928#comment-17575928 ] Antoine Pitrou commented on ARROW-17313: Nothing. The Substrait producer should

[jira] [Commented] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575930#comment-17575930 ] Weston Pace commented on ARROW-17313: - > It's not too late to change the Substrait s

[jira] [Updated] (ARROW-17321) [JS] Update dependencies

2022-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17321: --- Labels: pull-request-available (was: ) > [JS] Update dependencies > ---

[jira] [Updated] (ARROW-17321) [JS] Update dependencies

2022-08-05 Thread Dominik Moritz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dominik Moritz updated ARROW-17321: --- Summary: [JS] Update dependencies (was: Update dependencies) > [JS] Update dependencies > -

[jira] [Created] (ARROW-17321) Update dependencies

2022-08-05 Thread Dominik Moritz (Jira)
Dominik Moritz created ARROW-17321: -- Summary: Update dependencies Key: ARROW-17321 URL: https://issues.apache.org/jira/browse/ARROW-17321 Project: Apache Arrow Issue Type: Task Com

[jira] [Commented] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575916#comment-17575916 ] Ziheng Wang commented on ARROW-17313: - Ah I meant what we should do about the linbre

[jira] [Commented] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575902#comment-17575902 ] Antoine Pitrou commented on ARROW-17313: There's not much to elaborate. {{Random

[jira] [Updated] (ARROW-17313) [C++] Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-17313: --- Summary: [C++] Add Byte Range to CSV Reader ReadOptions (was: Add Byte Range to CSV Reader

[jira] [Comment Edited] (ARROW-17313) Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575600#comment-17575600 ] Antoine Pitrou edited comment on ARROW-17313 at 8/5/22 3:13 PM: --

[jira] [Commented] (ARROW-17319) [Python] pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 is not available

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575901#comment-17575901 ] Kouhei Sutou commented on ARROW-17319: -- Yes. We use v0.10.7 because we use vcpkg 38

[jira] [Updated] (ARROW-17319) [Python] pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 is not available

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-17319: - Summary: [Python] pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 i

[jira] [Commented] (ARROW-17319) pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 is not available

2022-08-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575882#comment-17575882 ] Antoine Pitrou commented on ARROW-17319: Wow, I had no idea the AWS SDK was goin

[jira] [Commented] (ARROW-17319) pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 is not available

2022-08-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575884#comment-17575884 ] Antoine Pitrou commented on ARROW-17319: cc [~kou] > pyarrow seems to set defau

[jira] [Commented] (ARROW-17313) Add Byte Range to CSV Reader ReadOptions

2022-08-05 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17313?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575862#comment-17575862 ] Ziheng Wang commented on ARROW-17313: - [~apitrou] can you elaborate a bit on your wa

[jira] [Updated] (ARROW-17315) [CI] Source Release and Merge Script job failed

2022-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17315: --- Labels: pull-request-available (was: ) > [CI] Source Release and Merge Script job failed >

[jira] [Updated] (ARROW-17320) [Python] Refine pyarrow.parquet API exposure

2022-08-05 Thread Miles Granger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miles Granger updated ARROW-17320: -- Description: Spawning from ARROW-17106, moving code from `pyarrow/parquet/_init_{_}` to `pyar

[jira] [Updated] (ARROW-17320) [Python] Refine pyarrow.parquet API exposure

2022-08-05 Thread Miles Granger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miles Granger updated ARROW-17320: -- Description: Spawning from ARROW-17106, moving code from `pyarrow/parquet/__{_}init__{_}{_}`

[jira] [Closed] (ARROW-17314) [R] Unable to install dev version of arrow: Unable to identify current OS/version

2022-08-05 Thread Neil Currie (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neil Currie closed ARROW-17314. --- Resolution: Fixed > [R] Unable to install dev version of arrow: Unable to identify current > OS/ver

[jira] [Updated] (ARROW-17320) [Python] Refine pyarrow.parquet API exposure

2022-08-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-17320: - Summary: [Python] Refine pyarrow.parquet API exposure (was: Refine pyarrow.parquet API exposure) > [Py

[jira] [Updated] (ARROW-17318) [C++][Dataset] Support async streaming interface for getting fragments in Dataset

2022-08-05 Thread Pavel Solodovnikov (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pavel Solodovnikov updated ARROW-17318: --- Component/s: C++ > [C++][Dataset] Support async streaming interface for getting frag

[jira] [Commented] (ARROW-17312) [R] R session aborts when using dplyr::filter after setting as_data_frame = FALSE in arrow::read_csv_arrow

2022-08-05 Thread Neil Currie (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575831#comment-17575831 ] Neil Currie commented on ARROW-17312: - Perfect Neal thanks, working now. > [R] R se

[jira] [Updated] (ARROW-17318) [C++][Dataset] Support async streaming interface for getting fragments in Dataset

2022-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17318: --- Labels: c++ dataset pull-request-available (was: c++ dataset) > [C++][Dataset] Support asyn

[jira] [Commented] (ARROW-17312) [R] R session aborts when using dplyr::filter after setting as_data_frame = FALSE in arrow::read_csv_arrow

2022-08-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575818#comment-17575818 ] Neal Richardson commented on ARROW-17312: - Actually it was right there in front

[jira] [Created] (ARROW-17320) Refine pyarrow.parquet API exposure

2022-08-05 Thread Miles Granger (Jira)
Miles Granger created ARROW-17320: - Summary: Refine pyarrow.parquet API exposure Key: ARROW-17320 URL: https://issues.apache.org/jira/browse/ARROW-17320 Project: Apache Arrow Issue Type: Impr

[jira] [Commented] (ARROW-17312) [R] R session aborts when using dplyr::filter after setting as_data_frame = FALSE in arrow::read_csv_arrow

2022-08-05 Thread Neil Currie (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575808#comment-17575808 ] Neil Currie commented on ARROW-17312: - Thanks Neal - at the risk of sounding ignoran

[jira] [Commented] (ARROW-17319) pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 is not available

2022-08-05 Thread Mike Gevaert (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575804#comment-17575804 ] Mike Gevaert commented on ARROW-17319: -- Is it possible that an old version of {{aws

[jira] [Commented] (ARROW-17312) [R] R session aborts when using dplyr::filter after setting as_data_frame = FALSE in arrow::read_csv_arrow

2022-08-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575795#comment-17575795 ] Neal Richardson commented on ARROW-17312: - Hard to say without more details, but

[jira] [Created] (ARROW-17319) pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 is not available

2022-08-05 Thread Mike Gevaert (Jira)
Mike Gevaert created ARROW-17319: Summary: pyarrow seems to set default CPU affinity to 0 on shutdown, crashes if CPU 0 is not available Key: ARROW-17319 URL: https://issues.apache.org/jira/browse/ARROW-17319

[jira] [Comment Edited] (ARROW-17312) [R] R session aborts when using dplyr::filter after setting as_data_frame = FALSE in arrow::read_csv_arrow

2022-08-05 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575795#comment-17575795 ] Neal Richardson edited comment on ARROW-17312 at 8/5/22 11:47 AM:

[jira] [Commented] (ARROW-16346) [Python] Add a migration path for external packages due to Python code being moved to PyArrow

2022-08-05 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575709#comment-17575709 ] Alenka Frim commented on ARROW-16346: - No, there seem to be no need to change anythi

[jira] [Created] (ARROW-17318) [C++][Dataset] Support async streaming interface for getting fragments in Dataset

2022-08-05 Thread Pavel Solodovnikov (Jira)
Pavel Solodovnikov created ARROW-17318: -- Summary: [C++][Dataset] Support async streaming interface for getting fragments in Dataset Key: ARROW-17318 URL: https://issues.apache.org/jira/browse/ARROW-17318

[jira] [Commented] (ARROW-17309) [Python] arrow-python-devel-9.0.0-1.el7.x86_64.rpm is missing from Centos 7 repo

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575670#comment-17575670 ] Kouhei Sutou commented on ARROW-17309: -- We dropped support for {{arrow-python-devel

[jira] (ARROW-17265) build python lib failed on both X86 and ARMv8

2022-08-05 Thread chendan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17265 ] chendan deleted comment on ARROW-17265: - was (Author: JIRAUSER283005): [~rokm]  Thanks a lot! Bundled build seems to be a good choice. I make on this configration. Here is the error. Boost_ep d

[jira] [Commented] (ARROW-17265) build python lib failed on both X86 and ARMv8

2022-08-05 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575637#comment-17575637 ] Rok Mihevc commented on ARROW-17265: [~atptour2017] It seems like a boost issue. If

[jira] [Comment Edited] (ARROW-17291) [C++] Build jemalloc_ep source code failed with Apache Arrow 2.0.0 on aarch64 CentOS 7

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575633#comment-17575633 ] Kouhei Sutou edited comment on ARROW-17291 at 8/5/22 8:17 AM:

[jira] [Commented] (ARROW-17291) Build jemalloc_ep source code failed

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17575633#comment-17575633 ] Kouhei Sutou commented on ARROW-17291: -- How about disabling jemalloc by `-DARROW_JE

[jira] [Updated] (ARROW-17314) [R] Unable to install dev version of arrow: Unable to identify current OS/version

2022-08-05 Thread Neil Currie (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neil Currie updated ARROW-17314: Description: Reporting as per the instructions here: [https://arrow.apache.org/docs/r/articles/in

[jira] [Updated] (ARROW-17291) [C++] Build jemalloc_ep source code failed with Apache Arrow 2.0.0 on aarch64 CentOS 7

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-17291: - Summary: [C++] Build jemalloc_ep source code failed with Apache Arrow 2.0.0 on aarch64 CentOS 7

[jira] [Comment Edited] (ARROW-17291) Build jemalloc_ep source code failed

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17574624#comment-17574624 ] Kouhei Sutou edited comment on ARROW-17291 at 8/5/22 8:03 AM:

[jira] [Updated] (ARROW-17291) Build jemalloc_ep source code failed

2022-08-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-17291: - Description: I want to build pyarrow for arm platform.  I follow the steps in [https://arrow.ap

[jira] [Updated] (ARROW-17106) [Python] Move parquet code from __init__.py and expose only API

2022-08-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17106: --- Labels: good-first-issue pull-request-available (was: good-first-issue) > [Python] Move par

[jira] [Assigned] (ARROW-17106) [Python] Move parquet code from __init__.py and expose only API

2022-08-05 Thread Miles Granger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Miles Granger reassigned ARROW-17106: - Assignee: Miles Granger > [Python] Move parquet code from __init__.py and expose only A

[jira] [Created] (ARROW-17317) [Release][Docs] Normalize previous document version directory

2022-08-05 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-17317: Summary: [Release][Docs] Normalize previous document version directory Key: ARROW-17317 URL: https://issues.apache.org/jira/browse/ARROW-17317 Project: Apache Arrow