[jira] [Updated] (ARROW-17891) [Docs][Python] Update and sync Win section of the developers/python page

2022-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17891: --- Labels: pull-request-available (was: ) > [Docs][Python] Update and sync Win section of the

[jira] [Commented] (ARROW-17943) [Python] Coredump when joining big large_strings

2022-10-07 Thread Yibo Cai (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614361#comment-17614361 ] Yibo Cai commented on ARROW-17943: -- The code below triggers same error log. Try it online:

[jira] [Updated] (ARROW-17904) [C++] Parquet support read page with crc32 checking

2022-10-07 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xuwei Fu updated ARROW-17904: - Description: Currently, C++'s Parquet support write page with checksum, but `ReadPage` doesn't have

[jira] [Updated] (ARROW-17904) [C++] Parquet support read page with crc32 checking

2022-10-07 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xuwei Fu updated ARROW-17904: - Description: Currently, C++'s Parquet support write page with checksum, but `ReadPage` doesn't have

[jira] [Commented] (ARROW-16211) [C++][Python] Unregister compute functions

2022-10-07 Thread Vibhatha Lakmal Abeykoon (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614337#comment-17614337 ] Vibhatha Lakmal Abeykoon commented on ARROW-16211: -- I am also not against the safety

[jira] [Commented] (ARROW-17904) [C++] Parquet support read page with crc32 checking

2022-10-07 Thread Xuwei Fu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614332#comment-17614332 ] Xuwei Fu commented on ARROW-17904: -- [~apitrou] I'm on holidays last week. Now I found that in

[jira] [Created] (ARROW-17966) [C++] Adjust to new format for Substrait optional arguments

2022-10-07 Thread Weston Pace (Jira)
Weston Pace created ARROW-17966: --- Summary: [C++] Adjust to new format for Substrait optional arguments Key: ARROW-17966 URL: https://issues.apache.org/jira/browse/ARROW-17966 Project: Apache Arrow

[jira] [Updated] (ARROW-17737) [R] Groups before conversion to a Table must not be restored after `collect()`

2022-10-07 Thread SHIMA Tatsuya (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SHIMA Tatsuya updated ARROW-17737: -- Summary: [R] Groups before conversion to a Table must not be restored after `collect()`

[jira] [Commented] (ARROW-16340) [C++][Python] Move all Python related code into PyArrow

2022-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614268#comment-17614268 ] Kouhei Sutou commented on ARROW-16340: -- Yes, {{pyarrow.ipc.RecordBatchReader._import_from_c}} is a

[jira] [Updated] (ARROW-17965) [C++] ExecBatch support for ChunkedArray values

2022-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17965: --- Labels: pull-request-available (was: ) > [C++] ExecBatch support for ChunkedArray values >

[jira] [Commented] (ARROW-17965) [C++] ExecBatch support for ChunkedArray values

2022-10-07 Thread Yaron Gvili (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614229#comment-17614229 ] Yaron Gvili commented on ARROW-17965: - Not sure how much context you are asking for. I bumped into

[jira] [Updated] (ARROW-17965) [C++] ExecBatch support for ChunkedArray values

2022-10-07 Thread Yaron Gvili (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yaron Gvili updated ARROW-17965: Description: Currently, `ExecBatch` does not handle chunked arrays when printing or slicing. The

[jira] [Commented] (ARROW-17965) [C++] ExecBatch support for ChunkedArray values

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614224#comment-17614224 ] David Li commented on ARROW-17965: -- Without context, this seems odd; why wouldn't the ChunkedArray

[jira] [Resolved] (ARROW-17738) [R] dplyr::compute should convert from grouped arrow_dplyr_query to arrow Table

2022-10-07 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dewey Dunnington resolved ARROW-17738. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 14160

[jira] [Created] (ARROW-17965) [C++] ExecBatch support for ChunkedArray values

2022-10-07 Thread Yaron Gvili (Jira)
Yaron Gvili created ARROW-17965: --- Summary: [C++] ExecBatch support for ChunkedArray values Key: ARROW-17965 URL: https://issues.apache.org/jira/browse/ARROW-17965 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-17964) [C++] Range data comparison for struct type may go out of bounds

2022-10-07 Thread Yaron Gvili (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yaron Gvili updated ARROW-17964: Description: When the struct-typed items being compared do not have the same length as the

[jira] [Updated] (ARROW-17964) [C++] Range data comparison for struct type may go out of bounds

2022-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17964: --- Labels: pull-request-available (was: ) > [C++] Range data comparison for struct type may

[jira] [Updated] (ARROW-17962) [Java] Bug in test causes CI failures

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-17962: - Component/s: Java > [Java] Bug in test causes CI failures > - > >

[jira] [Assigned] (ARROW-17964) [C++] Range data comparison for struct type may go out of bounds

2022-10-07 Thread Yaron Gvili (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yaron Gvili reassigned ARROW-17964: --- Assignee: Yaron Gvili > [C++] Range data comparison for struct type may go out of bounds >

[jira] [Created] (ARROW-17964) [C++] Range data comparison for struct type may go out of bounds

2022-10-07 Thread Yaron Gvili (Jira)
Yaron Gvili created ARROW-17964: --- Summary: [C++] Range data comparison for struct type may go out of bounds Key: ARROW-17964 URL: https://issues.apache.org/jira/browse/ARROW-17964 Project: Apache Arrow

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614216#comment-17614216 ] Weston Pace commented on ARROW-17913: - Yes, I think caching will always need to be a higher level

[jira] [Commented] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614195#comment-17614195 ] Antoine Pitrou commented on ARROW-17927: I see, thanks. > [C++] Sporadic crashes in

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614194#comment-17614194 ] David Li commented on ARROW-17913: -- (The 'caching' and 'coalescing' parts of ReadRangeCache also

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614191#comment-17614191 ] Weston Pace commented on ARROW-17913: - I think we could add a ReadRanges method to the filesystem

[jira] [Comment Edited] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614190#comment-17614190 ] Percy Camilo Triveño Aucahuasi edited comment on ARROW-17927 at 10/7/22 5:58 PM:

[jira] [Commented] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614190#comment-17614190 ] Percy Camilo Triveño Aucahuasi commented on ARROW-17927: Using the recent

[jira] [Commented] (ARROW-17961) Add read/write optimization for pyarrow.fs.S3FileSystem

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614189#comment-17614189 ] Weston Pace commented on ARROW-17961: - I think David's right. If you know you're going to read the

[jira] [Commented] (ARROW-15640) [Docs] UDFs on Cookbook

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614188#comment-17614188 ] Apache Arrow JIRA Bot commented on ARROW-15640: --- This issue was last updated over 90 days

[jira] [Assigned] (ARROW-17023) [C++] Add initial Acero design documents

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-17023: - Assignee: (was: Weston Pace) > [C++] Add initial Acero design

[jira] [Assigned] (ARROW-16624) [C++] Adding Suffixes support for Substrait-Join

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-16624: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [C++] Adding Suffixes

[jira] [Commented] (ARROW-17023) [C++] Add initial Acero design documents

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614183#comment-17614183 ] Apache Arrow JIRA Bot commented on ARROW-17023: --- This issue was last updated over 90 days

[jira] [Assigned] (ARROW-16859) [C++] Adding Aggregate Relation ToProto

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-16859: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [C++] Adding Aggregate

[jira] [Assigned] (ARROW-16492) [C++] Improving the Substrait-Python Handler

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-16492: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [C++] Improving the

[jira] [Assigned] (ARROW-17024) [Java] Ensure Flight with native Netty transport is actually being tested

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-17024: - Assignee: (was: David Li) > [Java] Ensure Flight with native Netty

[jira] [Commented] (ARROW-15638) [Docs] UDF Documentation

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614187#comment-17614187 ] Apache Arrow JIRA Bot commented on ARROW-15638: --- This issue was last updated over 90 days

[jira] [Assigned] (ARROW-15640) [Docs] UDFs on Cookbook

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-15640: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [Docs] UDFs on

[jira] [Assigned] (ARROW-16485) [C++] Improving Substrait-Join Integration

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-16485: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [C++] Improving

[jira] [Commented] (ARROW-16859) [C++] Adding Aggregate Relation ToProto

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614184#comment-17614184 ] Apache Arrow JIRA Bot commented on ARROW-16859: --- This issue was last updated over 90 days

[jira] [Commented] (ARROW-16492) [C++] Improving the Substrait-Python Handler

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614182#comment-17614182 ] Apache Arrow JIRA Bot commented on ARROW-16492: --- This issue was last updated over 90 days

[jira] [Commented] (ARROW-16485) [C++] Improving Substrait-Join Integration

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16485?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614185#comment-17614185 ] Apache Arrow JIRA Bot commented on ARROW-16485: --- This issue was last updated over 90 days

[jira] [Commented] (ARROW-16624) [C++] Adding Suffixes support for Substrait-Join

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614186#comment-17614186 ] Apache Arrow JIRA Bot commented on ARROW-16624: --- This issue was last updated over 90 days

[jira] [Commented] (ARROW-17024) [Java] Ensure Flight with native Netty transport is actually being tested

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614181#comment-17614181 ] Apache Arrow JIRA Bot commented on ARROW-17024: --- This issue was last updated over 90 days

[jira] [Assigned] (ARROW-15638) [Docs] UDF Documentation

2022-10-07 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-15638: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [Docs] UDF

[jira] [Commented] (ARROW-16211) [C++][Python] Unregister compute functions

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614178#comment-17614178 ] Weston Pace commented on ARROW-16211: - I'm +1 on Yaron's points regarding safety. Use cases: *

[jira] [Resolved] (ARROW-17955) [Docs][Java] Add Arrow user documentation for Table

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-17955. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 14344

[jira] [Commented] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614171#comment-17614171 ] Percy Camilo Triveño Aucahuasi commented on ARROW-17927: Definitely, I'll try

[jira] [Comment Edited] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614168#comment-17614168 ] Percy Camilo Triveño Aucahuasi edited comment on ARROW-17927 at 10/7/22 5:20 PM:

[jira] [Commented] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614169#comment-17614169 ] Antoine Pitrou commented on ARROW-17927: [~aucahuasi] This seems to be different issues, can you

[jira] [Commented] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614168#comment-17614168 ] Percy Camilo Triveño Aucahuasi commented on ARROW-17927: Just in case there are

[jira] [Updated] (ARROW-17962) [Java] Bug in test causes CI failures

2022-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17962: --- Labels: pull-request-available (was: ) > [Java] Bug in test causes CI failures >

[jira] [Created] (ARROW-17963) [C++] Implement cast_dictionary for string

2022-10-07 Thread Will Jones (Jira)
Will Jones created ARROW-17963: -- Summary: [C++] Implement cast_dictionary for string Key: ARROW-17963 URL: https://issues.apache.org/jira/browse/ARROW-17963 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614155#comment-17614155 ] David Li commented on ARROW-17913: -- I agree ReadRangeCache is not relevant here, I was trying to make

[jira] [Commented] (ARROW-17961) Add read/write optimization for pyarrow.fs.S3FileSystem

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614153#comment-17614153 ] David Li commented on ARROW-17961: -- That said assuming that is the cause, I don't think we necessarily

[jira] [Created] (ARROW-17962) [Java] Bug in test causes CI failures

2022-10-07 Thread Larry White (Jira)
Larry White created ARROW-17962: --- Summary: [Java] Bug in test causes CI failures Key: ARROW-17962 URL: https://issues.apache.org/jira/browse/ARROW-17962 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-5033) [C++] JSON table writer

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614150#comment-17614150 ] David Li commented on ARROW-5033: - That's a new can of worms :) There's been some discussion about a way

[jira] [Commented] (ARROW-17961) Add read/write optimization for pyarrow.fs.S3FileSystem

2022-10-07 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614148#comment-17614148 ] David Li commented on ARROW-17961: -- [~westonpace] probably has better context here, but from what I

[jira] [Updated] (ARROW-17953) [Archery] Add a docker info command

2022-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17953: --- Labels: pull-request-available (was: ) > [Archery] Add a docker info command >

[jira] [Assigned] (ARROW-17953) [Archery] Add a docker info command

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido reassigned ARROW-17953: - Assignee: Raúl Cumplido > [Archery] Add a docker info command >

[jira] [Resolved] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-17927. Resolution: Fixed Issue resolved by pull request 14339

[jira] [Commented] (ARROW-16211) [C++][Python] Unregister compute functions

2022-10-07 Thread Yaron Gvili (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614131#comment-17614131 ] Yaron Gvili commented on ARROW-16211: - I'm not against a specific and simple solution for a simple

[jira] [Updated] (ARROW-17955) [Docs][Java] Add Arrow user documentation for Table

2022-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17955: --- Labels: pull-request-available (was: ) > [Docs][Java] Add Arrow user documentation for

[jira] [Comment Edited] (ARROW-16340) [C++][Python] Move all Python related code into PyArrow

2022-10-07 Thread Yue Ni (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614113#comment-17614113 ] Yue Ni edited comment on ARROW-16340 at 10/7/22 2:47 PM: - [~kou]  > // Call the

[jira] [Commented] (ARROW-16340) [C++][Python] Move all Python related code into PyArrow

2022-10-07 Thread Yue Ni (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614113#comment-17614113 ] Yue Ni commented on ARROW-16340: [~kou]  > // Call the following Python code from pybind11 > py_reader

[jira] [Commented] (ARROW-17438) [R] glimpse() errors if there is a UDF

2022-10-07 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614070#comment-17614070 ] Dewey Dunnington commented on ARROW-17438: -- Thanks Will! I just tested on master too...I think

[jira] [Commented] (ARROW-17961) Add read/write optimization for pyarrow.fs.S3FileSystem

2022-10-07 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614053#comment-17614053 ] Jacob Wujciak-Jens commented on ARROW-17961: Things like readahead and metadata caching cc

[jira] [Commented] (ARROW-17953) [Archery] Add a docker info command

2022-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614045#comment-17614045 ] Antoine Pitrou commented on ARROW-17953: That's an interesting idea, I hadn't thought of that.

[jira] [Resolved] (ARROW-17366) [R] Support purrr-style lambda functions in .fns argument to across()

2022-10-07 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane resolved ARROW-17366. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 14327

[jira] [Updated] (ARROW-17863) [Python] Deprecate Plasma Python bindings

2022-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17863: --- Labels: pull-request-available (was: ) > [Python] Deprecate Plasma Python bindings >

[jira] [Commented] (ARROW-17953) [Archery] Add a docker info command

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17614007#comment-17614007 ] Raúl Cumplido commented on ARROW-17953: --- would something that prints the whole expanded yaml for

[jira] [Comment Edited] (ARROW-16340) [C++][Python] Move all Python related code into PyArrow

2022-10-07 Thread Yue Ni (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613979#comment-17613979 ] Yue Ni edited comment on ARROW-16340 at 10/7/22 9:18 AM: - [~kou] thanks so much.

[jira] [Commented] (ARROW-16340) [C++][Python] Move all Python related code into PyArrow

2022-10-07 Thread Yue Ni (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613979#comment-17613979 ] Yue Ni commented on ARROW-16340: [~kou] thanks so much. I will give it a try and get back to you later.

[jira] [Commented] (ARROW-17961) Add read/write optimization for pyarrow.fs.S3FileSystem

2022-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613978#comment-17613978 ] Antoine Pitrou commented on ARROW-17961: Which "optimization" is that? > Add read/write

[jira] [Created] (ARROW-17961) Add read/write optimization for pyarrow.fs.S3FileSystem

2022-10-07 Thread Volker Lorrmann (Jira)
Volker Lorrmann created ARROW-17961: --- Summary: Add read/write optimization for pyarrow.fs.S3FileSystem Key: ARROW-17961 URL: https://issues.apache.org/jira/browse/ARROW-17961 Project: Apache Arrow

[jira] [Created] (ARROW-17960) [C++] Add kernel for slicing list values

2022-10-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-17960: - Summary: [C++] Add kernel for slicing list values Key: ARROW-17960 URL: https://issues.apache.org/jira/browse/ARROW-17960 Project: Apache Arrow

[jira] [Assigned] (ARROW-17863) [Python] Deprecate Plasma Python bindings

2022-10-07 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alenka Frim reassigned ARROW-17863: --- Assignee: Alenka Frim > [Python] Deprecate Plasma Python bindings >

[jira] [Updated] (ARROW-17959) [C++][Dataset] Optimize Parquet column projection for subset of nested field

2022-10-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-17959: -- Summary: [C++][Dataset] Optimize Parquet column projection for subset of

[jira] [Created] (ARROW-17959) [C++][Dataset]

2022-10-07 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-17959: - Summary: [C++][Dataset] Key: ARROW-17959 URL: https://issues.apache.org/jira/browse/ARROW-17959 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-15678) [C++][CI] a crossbow job with MinRelSize enabled

2022-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613939#comment-17613939 ] Kouhei Sutou commented on ARROW-15678: -- We don't have this problem with

[jira] [Comment Edited] (ARROW-15678) [C++][CI] a crossbow job with MinRelSize enabled

2022-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613912#comment-17613912 ] Kouhei Sutou edited comment on ARROW-15678 at 10/7/22 7:54 AM: --- I propose

[jira] [Commented] (ARROW-15678) [C++][CI] a crossbow job with MinRelSize enabled

2022-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613930#comment-17613930 ] Antoine Pitrou commented on ARROW-15678: Well, I don't think we can force the compiler to inline

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613923#comment-17613923 ] Håkon Magne Holmen commented on ARROW-17913: I'll use it for the time being. Thanks for the

[jira] [Commented] (ARROW-15678) [C++][CI] a crossbow job with MinRelSize enabled

2022-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613912#comment-17613912 ] Kouhei Sutou commented on ARROW-15678: -- I propose one more approach for the proposed solution 1.:

[jira] [Commented] (ARROW-15678) [C++][CI] a crossbow job with MinRelSize enabled

2022-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15678?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613909#comment-17613909 ] Kouhei Sutou commented on ARROW-15678: -- Summary of this problem: Problem: * Parquet module is

[jira] [Created] (ARROW-17958) [Python] Change the base directory for PyArrow CPP header files

2022-10-07 Thread Alenka Frim (Jira)
Alenka Frim created ARROW-17958: --- Summary: [Python] Change the base directory for PyArrow CPP header files Key: ARROW-17958 URL: https://issues.apache.org/jira/browse/ARROW-17958 Project: Apache Arrow