[jira] [Updated] (ARROW-17923) [C++] Consider dictionary arrays for special fragment fields

2022-10-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17923: --- Labels: pull-request-available (was: ) > [C++] Consider dictionary arrays for special

[jira] [Commented] (ARROW-18149) [C++] Failed to compile join_example without `-DARROW_CSV=ON` option

2022-10-24 Thread Sho Nakatani (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623506#comment-17623506 ] Sho Nakatani commented on ARROW-18149: -- Created a pull-request:

[jira] [Updated] (ARROW-18149) [C++] Failed to compile join_example without `-DARROW_CSV=ON` option

2022-10-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-18149: --- Labels: pull-request-available (was: ) > [C++] Failed to compile join_example without

[jira] [Created] (ARROW-18149) [C++] Failed to compile join_example without `-DARROW_CSV=ON` option

2022-10-24 Thread Sho Nakatani (Jira)
Sho Nakatani created ARROW-18149: Summary: [C++] Failed to compile join_example without `-DARROW_CSV=ON` option Key: ARROW-18149 URL: https://issues.apache.org/jira/browse/ARROW-18149 Project: Apache

[jira] [Updated] (ARROW-18148) [R] Rename read_ipc_file to read_arrow_file & highlight arrow over feather

2022-10-24 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-18148: - Description: Following up from [this mailing list

[jira] [Updated] (ARROW-18147) [Go] Add Scalar Add/Sub for Decimal types

2022-10-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-18147: --- Labels: pull-request-available (was: ) > [Go] Add Scalar Add/Sub for Decimal types >

[jira] [Created] (ARROW-18148) [R] Rename read_ipc_file to read_arrow_file & highlight arrow over feather

2022-10-24 Thread Stephanie Hazlitt (Jira)
Stephanie Hazlitt created ARROW-18148: - Summary: [R] Rename read_ipc_file to read_arrow_file & highlight arrow over feather Key: ARROW-18148 URL: https://issues.apache.org/jira/browse/ARROW-18148

[jira] [Created] (ARROW-18147) [Go] Add Scalar Add/Sub for Decimal types

2022-10-24 Thread Matthew Topol (Jira)
Matthew Topol created ARROW-18147: - Summary: [Go] Add Scalar Add/Sub for Decimal types Key: ARROW-18147 URL: https://issues.apache.org/jira/browse/ARROW-18147 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-14999) [C++] List types with different field names are not equal

2022-10-24 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623449#comment-17623449 ] Will Jones commented on ARROW-14999: So here are the conclusions I've gathered so far: 1. Equality

[jira] [Resolved] (ARROW-18137) [Python][Docs] Allow passing no aggregations to TableGroupBy.aggregate

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18137. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14482

[jira] [Updated] (ARROW-18146) [C++] arrow::UInt64Builder::Reset() doesn't affect the builder's length()

2022-10-24 Thread Edward Nolan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Edward Nolan updated ARROW-18146: - Description: Git hash of Arrow version tested: eb01350b1a6e4588df4653495fb962065409250b Steps

[jira] [Created] (ARROW-18146) [C++] arrow::UInt64Builder::Reset() doesn't affect the builder's length()

2022-10-24 Thread Edward Nolan (Jira)
Edward Nolan created ARROW-18146: Summary: [C++] arrow::UInt64Builder::Reset() doesn't affect the builder's length() Key: ARROW-18146 URL: https://issues.apache.org/jira/browse/ARROW-18146 Project:

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Sasha Krassovsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623421#comment-17623421 ] Sasha Krassovsky commented on ARROW-18115: -- It would mostly meet my needs, but I'd still want

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623420#comment-17623420 ] Antoine Pitrou commented on ARROW-18115: IIRC there isn't much contention on the PR (at least

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-10-24 Thread Sasha Krassovsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623419#comment-17623419 ] Sasha Krassovsky commented on ARROW-18113: -- Yes, io_uring has `io_uring_wait_cqe` which will

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623417#comment-17623417 ] Weston Pace commented on ARROW-18115: - {quote} I'm not sure which PR it is about, but I think

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-10-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623416#comment-17623416 ] David Li commented on ARROW-18113: -- For Parquet, the reader can submit reads for all ColumnChunks at

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623386#comment-17623386 ] Weston Pace commented on ARROW-18113: - {quote} Linking io_uring to the Future API could be a bit

[jira] [Closed] (ARROW-18139) [C++][Release] Verification failures on CentOS7

2022-10-24 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou closed ARROW-18139. Resolution: Not A Problem The {{strtptime()}} related failure is a bug in glibc on CentOS 7:

[jira] [Commented] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623366#comment-17623366 ] Jacek Pliszka commented on ARROW-15474: --- I added one more speedup - definitely you can sort it at

[jira] [Comment Edited] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623355#comment-17623355 ] Jacek Pliszka edited comment on ARROW-15474 at 10/24/22 7:33 PM: - Lance

[jira] [Commented] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-10-24 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623362#comment-17623362 ] Lance Dacey commented on ARROW-15474: - Nice - I will give that a shot, thanks. I have been using a

[jira] [Comment Edited] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623355#comment-17623355 ] Jacek Pliszka edited comment on ARROW-15474 at 10/24/22 7:25 PM: - Lance

[jira] [Comment Edited] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623355#comment-17623355 ] Jacek Pliszka edited comment on ARROW-15474 at 10/24/22 7:24 PM: - Lance

[jira] [Comment Edited] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623356#comment-17623356 ] Antoine Pitrou edited comment on ARROW-18115 at 10/24/22 7:23 PM: -- >

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623356#comment-17623356 ] Antoine Pitrou commented on ARROW-18115: > Would this still meet your needs? Would you be

[jira] [Comment Edited] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623355#comment-17623355 ] Jacek Pliszka edited comment on ARROW-15474 at 10/24/22 7:20 PM: - Lance

[jira] [Commented] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623355#comment-17623355 ] Jacek Pliszka commented on ARROW-15474: --- Lance - the code you have posted might not be very

[jira] [Commented] (ARROW-17608) [JS] Implement C Data Interface

2022-10-24 Thread Chang She (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623345#comment-17623345 ] Chang She commented on ARROW-17608: --- I would be very excited about this as well. Would love to be able

[jira] [Commented] (ARROW-18137) [Python][Docs] Allow passing no aggregations to TableGroupBy.aggregate

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623344#comment-17623344 ] Jacek Pliszka commented on ARROW-18137: --- one was added on ARROW-13993 and for more there is

[jira] [Commented] (ARROW-14378) [R] Make custom extension classes for (some) cols with row-level metadata

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623339#comment-17623339 ] Neal Richardson commented on ARROW-14378: - [~jonkeane][~paleolimbot] can this issue and subtasks

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623337#comment-17623337 ] Weston Pace commented on ARROW-18115: - [~sakras] FYI, there was some further discussion on this

[jira] [Commented] (ARROW-12826) [R] [CI] Add caching to revdepchecks

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623336#comment-17623336 ] Neal Richardson commented on ARROW-12826: - Still valid [~assignUser]? > [R] [CI] Add caching to

[jira] [Closed] (ARROW-12844) [R] Implement common date and time functions for dplyr

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-12844. --- Resolution: Fixed > [R] Implement common date and time functions for dplyr >

[jira] [Commented] (ARROW-12282) [R] Refactor collect and compute methods

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623335#comment-17623335 ] Neal Richardson commented on ARROW-12282: - Doing in ARROW-15460. > [R] Refactor collect and

[jira] [Assigned] (ARROW-12282) [R] Refactor collect and compute methods

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-12282: --- Assignee: Neal Richardson > [R] Refactor collect and compute methods >

[jira] [Commented] (ARROW-18114) [R] unify_schemas=FALSE does not improve open_dataset() read times

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623334#comment-17623334 ] Weston Pace commented on ARROW-18114: - Yes, I would expect there to be a difference. I'll try and

[jira] [Closed] (ARROW-11963) [R][C++] Installation issues on Fedora 33 with hardening flags

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-11963. --- Resolution: Cannot Reproduce A lot has changed in the package build system since this issue

[jira] [Commented] (ARROW-16029) [Python] Runaway process with generator in "write_dataset()"

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623330#comment-17623330 ] Weston Pace commented on ARROW-16029: - {quote} Weston Pace that might be related to the fact that

[jira] [Closed] (ARROW-11755) [R] Add tests from dplyr/test-mutate.r

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-11755. --- Resolution: Won't Fix > [R] Add tests from dplyr/test-mutate.r >

[jira] [Closed] (ARROW-11249) [R] installation failure on CentOS 7 with bad/missing unzip

2022-10-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-11249. --- Resolution: Cannot Reproduce A lot has changed in the package build system in the last 2

[jira] [Commented] (ARROW-18140) The metadata info will lost in parquet file schema after writing the parquet file by calling the FileSystemDataset::Write() method.

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623326#comment-17623326 ] Weston Pace commented on ARROW-18140: - This could definitely be improved. The write node, in Acero,

[jira] [Commented] (ARROW-18137) [Python][Docs] Allow passing no aggregations to TableGroupBy.aggregate

2022-10-24 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623317#comment-17623317 ] Jacek Pliszka commented on ARROW-18137: --- I would be more for pandas drop_duplicates() It is an

[jira] [Commented] (ARROW-17116) [C++][Gandiva] Add RepeatStr

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623308#comment-17623308 ] Apache Arrow JIRA Bot commented on ARROW-17116: --- This issue was last updated over 90 days

[jira] [Assigned] (ARROW-17210) [C++][Docs] Substrait Usage in Acero

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-17210: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [C++][Docs] Substrait

[jira] [Assigned] (ARROW-17116) [C++][Gandiva] Add RepeatStr

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-17116: - Assignee: (was: Sahaj Gupta) > [C++][Gandiva] Add RepeatStr >

[jira] [Commented] (ARROW-17183) [C++] Adding ExecNode with Sort and Fetch capability

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623310#comment-17623310 ] Apache Arrow JIRA Bot commented on ARROW-17183: --- This issue was last updated over 90 days

[jira] [Assigned] (ARROW-17189) [Python][Docs] Nightly build instructions install release version

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-17189: - Assignee: (was: Todd Farmer) > [Python][Docs] Nightly build

[jira] [Commented] (ARROW-17210) [C++][Docs] Substrait Usage in Acero

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623309#comment-17623309 ] Apache Arrow JIRA Bot commented on ARROW-17210: --- This issue was last updated over 90 days

[jira] [Assigned] (ARROW-17183) [C++] Adding ExecNode with Sort and Fetch capability

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-17183: - Assignee: (was: Vibhatha Lakmal Abeykoon) > [C++] Adding ExecNode

[jira] [Commented] (ARROW-17189) [Python][Docs] Nightly build instructions install release version

2022-10-24 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623311#comment-17623311 ] Apache Arrow JIRA Bot commented on ARROW-17189: --- This issue was last updated over 90 days

[jira] [Updated] (ARROW-17867) [C++][FlightRPC] Expose bulk parameter binding in Flight SQL client

2022-10-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-17867: - Component/s: C++ FlightRPC > [C++][FlightRPC] Expose bulk parameter binding in Flight

[jira] [Created] (ARROW-18145) [C++] Populate Substrait producer version from cmake config variables

2022-10-24 Thread Weston Pace (Jira)
Weston Pace created ARROW-18145: --- Summary: [C++] Populate Substrait producer version from cmake config variables Key: ARROW-18145 URL: https://issues.apache.org/jira/browse/ARROW-18145 Project: Apache

[jira] [Updated] (ARROW-18144) [C++] Improve JSONTypeError error message in testing

2022-10-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18144: --- Fix Version/s: 11.0.0 > [C++] Improve JSONTypeError error message in testing >

[jira] [Commented] (ARROW-18141) [C++] Alignment not enforced; undefined behavior

2022-10-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623264#comment-17623264 ] Antoine Pitrou commented on ARROW-18141: Could you try with draft PR

[jira] [Updated] (ARROW-18141) [C++] Alignment not enforced; undefined behavior

2022-10-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-18141: --- Labels: pull-request-available (was: ) > [C++] Alignment not enforced; undefined behavior

[jira] [Commented] (ARROW-17608) [JS] Implement C Data Interface

2022-10-24 Thread Dominik Moritz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623263#comment-17623263 ] Dominik Moritz commented on ARROW-17608: That's fantastic. I very much look forward to continued

[jira] [Updated] (ARROW-18129) get_include() gives wrong directory

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18129: -- Priority: Critical (was: Minor) > get_include() gives wrong directory >

[jira] [Updated] (ARROW-18129) get_include() gives wrong directory

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18129: -- Labels: triaged (was: ) > get_include() gives wrong directory >

[jira] [Updated] (ARROW-18116) [R][Doc] correct paths for the read_parquet examples in cloud storage vignette

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18116: -- Labels: triaged (was: ) > [R][Doc] correct paths for the read_parquet examples in

[jira] [Commented] (ARROW-18114) [R] unify_schemas=FALSE does not improve open_dataset() read times

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623258#comment-17623258 ] Alessandro Molina commented on ARROW-18114: --- [~westonpace] I checked that the R bindings seems

[jira] [Commented] (ARROW-18137) [Python][Docs] Allow passing no aggregations to TableGroupBy.aggregate

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623257#comment-17623257 ] Alessandro Molina commented on ARROW-18137: --- I wonder if we should have a dedicated helper

[jira] [Commented] (ARROW-18141) [C++] Alignment not enforced; undefined behavior

2022-10-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623251#comment-17623251 ] Antoine Pitrou commented on ARROW-18141: Hmm, this is unfortunate. It's true that we mostly

[jira] [Updated] (ARROW-18114) [R] unify_schemas=FALSE does not improve open_dataset() read times

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18114: -- Description: open_dataset() provides the very helpful optional argument to set

[jira] [Updated] (ARROW-18089) [R] Cannot read_parquet on http URL

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18089: -- Priority: Critical (was: Major) > [R] Cannot read_parquet on http URL >

[jira] [Updated] (ARROW-18089) [R] Cannot read_parquet on http URL

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18089: -- Labels: triaged (was: ) > [R] Cannot read_parquet on http URL >

[jira] [Updated] (ARROW-18123) [Python] Cannot use multi-byte characters in file names in write_table

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18123: -- Summary: [Python] Cannot use multi-byte characters in file names in write_table

[jira] [Commented] (ARROW-18123) [Python] Cannot use multi-byte characters in file names

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623249#comment-17623249 ] Alessandro Molina commented on ARROW-18123: --- Fair point, I got distracted by the ticket

[jira] [Reopened] (ARROW-18123) [Python] Cannot use multi-byte characters in file names

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina reopened ARROW-18123: --- > [Python] Cannot use multi-byte characters in file names >

[jira] [Updated] (ARROW-18123) [Python] Cannot use multi-byte characters in file names in write_table

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18123: -- Priority: Critical (was: Major) > [Python] Cannot use multi-byte characters in file

[jira] [Commented] (ARROW-18099) [Python] Cannot create pandas categorical from table only with nulls

2022-10-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623241#comment-17623241 ] Joris Van den Bossche commented on ARROW-18099: --- The direct cause of this buggy behaviour

[jira] [Commented] (ARROW-18139) [C++][Release] Verification failures on CentOS7

2022-10-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-18139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623234#comment-17623234 ] Raúl Cumplido commented on ARROW-18139: --- Based on this email:

[jira] [Commented] (ARROW-18099) [Python] Cannot create pandas categorical from table only with nulls

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623235#comment-17623235 ] Alessandro Molina commented on ARROW-18099: --- [~jorisvandenbossche] what is your thinking on

[jira] [Commented] (ARROW-18139) [C++][Release] Verification failures on CentOS7

2022-10-24 Thread Benson Muite (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623236#comment-17623236 ] Benson Muite commented on ARROW-18139: -- >From the mailing list discussion, It seems only the

[jira] [Commented] (ARROW-18090) Dictionary Style array for Keywords or Tags

2022-10-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623229#comment-17623229 ] David Li commented on ARROW-18090: -- I'm not familiar with the Rust APIs, but in Python/C++ it's pretty

[jira] [Commented] (ARROW-18035) [Java] Enable allocator logging in CI

2022-10-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623224#comment-17623224 ] David Li commented on ARROW-18035: -- Basically: if we leak memory in a test, we should print the history

[jira] [Commented] (ARROW-18139) [C++][Release] Verification failures on CentOS7

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623220#comment-17623220 ] Alessandro Molina commented on ARROW-18139: --- [~raulcd] is this something that we should bring

[jira] [Commented] (ARROW-18090) Dictionary Style array for Keywords or Tags

2022-10-24 Thread Sven Cattell (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623219#comment-17623219 ] Sven Cattell commented on ARROW-18090: -- [~lidavidm]  I'm not sure how to create that with the Rust

[jira] [Updated] (ARROW-18133) [C++] Update "options" handling for Substrait functions

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18133: -- Labels: triaged (was: ) > [C++] Update "options" handling for Substrait functions >

[jira] [Updated] (ARROW-18134) [C++][CI] Add Substrait integration testing to CI

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18134: -- Labels: triaged (was: ) > [C++][CI] Add Substrait integration testing to CI >

[jira] [Updated] (ARROW-18134) [C++][CI] Add Substrait integration testing to CI

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18134: -- Issue Type: Improvement (was: Bug) > [C++][CI] Add Substrait integration testing to

[jira] [Updated] (ARROW-18100) [C++] Intermittent failure in TestNewScanner.Backpressure

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18100: -- Labels: triaged (was: ) > [C++] Intermittent failure in TestNewScanner.Backpressure

[jira] [Commented] (ARROW-18142) [Java] Compression closes uncompressed buffers

2022-10-24 Thread Michal Zaborec (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623214#comment-17623214 ] Michal Zaborec commented on ARROW-18142: The whole interaction with `ArrowWriter` looks weird.

[jira] [Updated] (ARROW-18140) The metadata info will lost in parquet file schema after writing the parquet file by calling the FileSystemDataset::Write() method.

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18140: -- Component/s: C++ > The metadata info will lost in parquet file schema after writing

[jira] [Updated] (ARROW-18025) [C++][Python] SubstraitSinkConsumer should handle backpressure

2022-10-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-18025: --- Labels: pull-request-available (was: ) > [C++][Python] SubstraitSinkConsumer should handle

[jira] [Updated] (ARROW-18025) [C++][Python] SubstraitSinkConsumer should handle backpressure

2022-10-24 Thread Vibhatha Lakmal Abeykoon (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vibhatha Lakmal Abeykoon updated ARROW-18025: - Summary: [C++][Python] SubstraitSinkConsumer should handle backpressure

[jira] [Commented] (ARROW-17608) [JS] Implement C Data Interface

2022-10-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623196#comment-17623196 ] Joris Van den Bossche commented on ARROW-17608: --- [~kylebarron2] I am not familiar with the

[jira] [Commented] (ARROW-18123) [Python] Cannot use multi-byte characters in file names

2022-10-24 Thread SHIMA Tatsuya (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623177#comment-17623177 ] SHIMA Tatsuya commented on ARROW-18123: --- Thanks for your comment. But does it explain that

[jira] [Closed] (ARROW-18123) [Python] Cannot use multi-byte characters in file names

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina closed ARROW-18123. - Resolution: Not A Bug > [Python] Cannot use multi-byte characters in file names >

[jira] [Commented] (ARROW-18123) [Python] Cannot use multi-byte characters in file names

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623170#comment-17623170 ] Alessandro Molina commented on ARROW-18123: --- The documentation states {code:java} the argument

[jira] [Resolved] (ARROW-18131) [R] Correctly handle .data pronoun in group_by()

2022-10-24 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane resolved ARROW-18131. -- Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14484

[jira] [Commented] (ARROW-6981) [R] Implement HDFS file-system interface in R

2022-10-24 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623158#comment-17623158 ] Nicola Crane commented on ARROW-6981: - More user interest:

[jira] [Updated] (ARROW-18144) [C++] Improve JSONTypeError error message in testing

2022-10-24 Thread Jin Shang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jin Shang updated ARROW-18144: -- Description: If there is a type error, ArrayFromJSON returns an error message like "Invalid: Expected

[jira] [Updated] (ARROW-18144) [C++] Improve JSONTypeError error message in testing

2022-10-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-18144: --- Component/s: C++ > [C++] Improve JSONTypeError error message in testing >

[jira] [Assigned] (ARROW-18144) [C++] Improve JSONTypeError error message in testing

2022-10-24 Thread Jin Shang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jin Shang reassigned ARROW-18144: - Assignee: Jin Shang > [C++] Improve JSONTypeError error message in testing >

[jira] [Updated] (ARROW-18144) [C++] Improve JSONTypeError error message in testing

2022-10-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-18144: --- Labels: pull-request-available (was: ) > [C++] Improve JSONTypeError error message in

[jira] [Created] (ARROW-18144) [C++] Improve JSONTypeError error message in testing

2022-10-24 Thread Jin Shang (Jira)
Jin Shang created ARROW-18144: - Summary: [C++] Improve JSONTypeError error message in testing Key: ARROW-18144 URL: https://issues.apache.org/jira/browse/ARROW-18144 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-18143) [Java] Allow to set Compression Codec in Arrow Writer

2022-10-24 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alenka Frim updated ARROW-18143: Summary: [Java] Allow to set Compression Codec in Arrow Writer (was: Allow to set Compression

[jira] [Created] (ARROW-18143) Allow to set Compression Codec in Arrow Writer

2022-10-24 Thread Michal Zaborec (Jira)
Michal Zaborec created ARROW-18143: -- Summary: Allow to set Compression Codec in Arrow Writer Key: ARROW-18143 URL: https://issues.apache.org/jira/browse/ARROW-18143 Project: Apache Arrow

[jira] [Updated] (ARROW-18123) [Python] Cannot use multi-byte characters in file names

2022-10-24 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-18123: -- Issue Type: Bug (was: Improvement) > [Python] Cannot use multi-byte characters in

  1   2   >