[jira] [Commented] (ARROW-15081) [R][C++] Arrow crashes (OOM) on R client with large remote parquet files

2022-05-03 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531476#comment-17531476 ] Weston Pace commented on ARROW-15081: - One mystery solved, a few more remained, I ma

[jira] [Updated] (ARROW-16452) [R] After dataset scan, some RAM is left consumed until a garbage collection pass

2022-05-03 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-16452: Description: This might be "not a bug" but I wonder if we can do something better here. When I c

[jira] [Created] (ARROW-16452) [R] After dataset scan, some RAM is left consumed until a garbage collection pass

2022-05-03 Thread Weston Pace (Jira)
Weston Pace created ARROW-16452: --- Summary: [R] After dataset scan, some RAM is left consumed until a garbage collection pass Key: ARROW-16452 URL: https://issues.apache.org/jira/browse/ARROW-16452 Proje

[jira] [Assigned] (ARROW-5409) [C++] Improvement for IsIn Kernel when right array is small

2022-05-03 Thread Alvin Chunga Mamani (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alvin Chunga Mamani reassigned ARROW-5409: -- Assignee: Alvin Chunga Mamani > [C++] Improvement for IsIn Kernel when right a

[jira] [Created] (ARROW-16451) [C++] ParquetFileFragment caches parquet file metadata and there is no way to disable this

2022-05-03 Thread Weston Pace (Jira)
Weston Pace created ARROW-16451: --- Summary: [C++] ParquetFileFragment caches parquet file metadata and there is no way to disable this Key: ARROW-16451 URL: https://issues.apache.org/jira/browse/ARROW-16451

[jira] [Updated] (ARROW-16243) [C++][Python] Remove Parquet ReadSchemaField method

2022-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-16243: --- Labels: good-first-issue pull-request-available (was: good-first-issue) > [C++][Python] Rem

[jira] [Assigned] (ARROW-16423) [R] arrow/dplyr: simple join and collect crashes session

2022-05-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-16423: -- Assignee: Will Jones > [R] arrow/dplyr: simple join and collect crashes session > ---

[jira] [Closed] (ARROW-15730) [R] Memory usage in R blows up

2022-05-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones closed ARROW-15730. -- Resolution: Cannot Reproduce > [R] Memory usage in R blows up > -- > >

[jira] [Updated] (ARROW-10739) [Python] Pickling a sliced array serializes all the buffers

2022-05-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10739: -- Priority: Critical (was: Major) > [Python] Pickling a sliced array serializes

[jira] [Commented] (ARROW-16421) [R] Permission error on Windows when deleting file in dataset

2022-05-03 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531411#comment-17531411 ] Weston Pace commented on ARROW-16421: - Right now the destruction of the record batch

[jira] [Commented] (ARROW-16421) [R] Permission error on Windows when deleting file in dataset

2022-05-03 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531410#comment-17531410 ] Weston Pace commented on ARROW-16421: - It's rather spread out and not at all obvious

[jira] [Commented] (ARROW-16421) [R] Permission error on Windows when deleting file in dataset

2022-05-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531409#comment-17531409 ] Will Jones commented on ARROW-16421: {quote}Windows is notoriously stubborn about de

[jira] [Updated] (ARROW-16450) [Go][Docs] Include error handling in csv examples

2022-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-16450: --- Labels: pull-request-available (was: ) > [Go][Docs] Include error handling in csv examples

[jira] [Updated] (ARROW-16450) [Go][Docs] Include error handling in csv examples

2022-05-03 Thread Mark Wolfe (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mark Wolfe updated ARROW-16450: --- Summary: [Go][Docs] Include error handling in csv examples (was: [Go][Docs] Include error handling

[jira] [Created] (ARROW-16450) [Go][Docs] Include error handling in csv reader/writer examples

2022-05-03 Thread Mark Wolfe (Jira)
Mark Wolfe created ARROW-16450: -- Summary: [Go][Docs] Include error handling in csv reader/writer examples Key: ARROW-16450 URL: https://issues.apache.org/jira/browse/ARROW-16450 Project: Apache Arrow

[jira] [Commented] (ARROW-16421) [R] Permission error on Windows when deleting file in dataset

2022-05-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531402#comment-17531402 ] Will Jones commented on ARROW-16421: It seems like this issue isn't resolved by {{rm

[jira] [Commented] (ARROW-10739) [Python] Pickling a sliced array serializes all the buffers

2022-05-03 Thread Jim Crist (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531399#comment-17531399 ] Jim Crist commented on ARROW-10739: --- We're running into this in Dask right now when at

[jira] [Resolved] (ARROW-16276) [R] Release News

2022-05-03 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-16276. - Fix Version/s: 8.0.0 Resolution: Fixed Issue resolved by pull request 13005 [http

[jira] [Commented] (ARROW-16421) [R] Permission error on Windows when deleting file in dataset

2022-05-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531364#comment-17531364 ] Will Jones commented on ARROW-16421: Yeah this was inspired by an issue I was told a

[jira] [Resolved] (ARROW-15959) [Java][Docs] Fix IntelliJ IDE setup instructions

2022-05-03 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-15959. -- Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13017 [https://github.com

[jira] [Commented] (ARROW-16421) [R] Permission error on Windows when deleting file in dataset

2022-05-03 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531338#comment-17531338 ] Weston Pace commented on ARROW-16421: - The scanner does need to close its files. It

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531311#comment-17531311 ] Matthew Topol commented on ARROW-16441: --- Fair enough. I've put up the PR and linke

[jira] [Updated] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-16441: --- Labels: pull-request-available (was: ) > [Release][Integration] Integration test primitive_

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531309#comment-17531309 ] Krisztian Szucs commented on ARROW-16441: - Just merge to master then I can cherr

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531306#comment-17531306 ] Matthew Topol commented on ARROW-16441: --- [~lidavidm] I can give that a try. You're

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531304#comment-17531304 ] David Li commented on ARROW-16441: -- Judging from https://grpc.io/docs/languages/go/bas

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531303#comment-17531303 ] David Li commented on ARROW-16441: -- [~zeroshade] possibly a race condition - does the G

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531302#comment-17531302 ] Matthew Topol commented on ARROW-16441: --- [~raulcd] [~lidavidm] Judging from the er

[jira] [Updated] (ARROW-16434) [R] [CI] Nightly crossbow job for windows + devdocs

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-16434: Fix Version/s: 9.0.0 (was: 8.0.0) > [R] [CI] Nightly crossbow job f

[jira] [Updated] (ARROW-15639) [C++][Python] UDF Scalar Function Implementation

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-15639: Fix Version/s: 9.0.0 (was: 8.0.0) > [C++][Python] UDF Scalar Functi

[jira] [Updated] (ARROW-16035) [Java] Arrow to JDBC ArrowVectorIterator with does not terminate with empty result set

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-16035: Fix Version/s: 8.0.0 (was: 9.0.0) > [Java] Arrow to JDBC ArrowVecto

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531298#comment-17531298 ] Krisztian Szucs commented on ARROW-16441: - Thanks [~zeroshade] for investigating

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531296#comment-17531296 ] Matthew Topol commented on ARROW-16441: --- [~kszucs] I'm currently unable to reprodu

[jira] [Resolved] (ARROW-16413) [Python] FileFormat::GetReaderAsync hangs with an fsspec filesystem

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-16413. - Resolution: Fixed Issue resolved by pull request 13033 [https://github.com/apache/arrow/

[jira] [Commented] (ARROW-16449) [Java] java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531277#comment-17531277 ] Daniel Glöckner commented on ARROW-16449: - Thanks [~lidavidm] and [~raulcd]. I w

[jira] [Updated] (ARROW-16267) [Java] Support Java 18

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido updated ARROW-16267: -- Component/s: Java > [Java] Support Java 18 > -- > > Key: A

[jira] [Commented] (ARROW-16449) [Java] java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531271#comment-17531271 ] Raúl Cumplido commented on ARROW-16449: --- Hi Daniel, Thanks for your report. We ar

[jira] [Updated] (ARROW-16449) [Java] java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-16449: - Summary: [Java] java.lang.reflect.InaccessibleObjectException on Java 18 (was: java.lang.reflect.Inacce

[jira] [Commented] (ARROW-16449) [Java] java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531270#comment-17531270 ] David Li commented on ARROW-16449: -- CC [~dsusanibara] > [Java] java.lang.reflect.Inacc

[jira] [Commented] (ARROW-16449) java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531269#comment-17531269 ] David Li commented on ARROW-16449: -- Can you test with Arrow 8.0.0 once it releases (in

[jira] [Updated] (ARROW-16449) java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Glöckner updated ARROW-16449: Description: Getting the following stack trace when running on Java 18. {{BaseAllocator}}

[jira] [Commented] (ARROW-16399) [R][C++] datetime locale support on Windows MINGW / R

2022-05-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531255#comment-17531255 ] Will Jones commented on ARROW-16399: Good question. IIRC, this will error in C++: {

[jira] [Updated] (ARROW-16449) java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Glöckner updated ARROW-16449: Description: Getting the following stack trace when running on Java 18 {code:java} Caused

[jira] [Created] (ARROW-16449) java.lang.reflect.InaccessibleObjectException on Java 18

2022-05-03 Thread Jira
Daniel Glöckner created ARROW-16449: --- Summary: java.lang.reflect.InaccessibleObjectException on Java 18 Key: ARROW-16449 URL: https://issues.apache.org/jira/browse/ARROW-16449 Project: Apache Arrow

[jira] [Comment Edited] (ARROW-16438) [Python] pyarrow dataset API fails to read s3 directory

2022-05-03 Thread Prem Sagar Gali (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531248#comment-17531248 ] Prem Sagar Gali edited comment on ARROW-16438 at 5/3/22 3:17 PM: -

[jira] [Commented] (ARROW-16438) [Python] pyarrow dataset API fails to read s3 directory

2022-05-03 Thread Prem Sagar Gali (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531248#comment-17531248 ] Prem Sagar Gali commented on ARROW-16438: - [~jorisvandenbossche] Yup, that worke

[jira] [Updated] (ARROW-16448) [Archery][CI] Refactor EmailReport to be a JinjaReport and move Report to be a utility class

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido updated ARROW-16448: -- Parent: ARROW-16333 Issue Type: Sub-task (was: Improvement) > [Archery][CI] Refactor

[jira] [Updated] (ARROW-16448) [Archery][CI] Refactor EmailReport to be a JinjaReport and move Report to be a utility class

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raúl Cumplido updated ARROW-16448: -- Description: This ticket is a follow up from the conversation on [https://github.com/apache/a

[jira] [Created] (ARROW-16448) [Archery][CI] Refactor EmailReport to be a JinjaReport and move Report to be a utility class

2022-05-03 Thread Jira
Raúl Cumplido created ARROW-16448: - Summary: [Archery][CI] Refactor EmailReport to be a JinjaReport and move Report to be a utility class Key: ARROW-16448 URL: https://issues.apache.org/jira/browse/ARROW-16448

[jira] [Created] (ARROW-16447) [R] Integer overflow causes error - (in dplyr we get an NA with a warning)

2022-05-03 Thread Nicola Crane (Jira)
Nicola Crane created ARROW-16447: Summary: [R] Integer overflow causes error - (in dplyr we get an NA with a warning) Key: ARROW-16447 URL: https://issues.apache.org/jira/browse/ARROW-16447 Project: A

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531239#comment-17531239 ] Raúl Cumplido commented on ARROW-16441: --- I was trying to add this minor logging im

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531234#comment-17531234 ] Matthew Topol commented on ARROW-16441: --- [~kszucs] That looks like the Go consumer

[jira] [Commented] (ARROW-15702) [Docs][Java] Dataset Javadocs are not being published

2022-05-03 Thread Todd Farmer (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531229#comment-17531229 ] Todd Farmer commented on ARROW-15702: - I naively attempted to add dataset to the li

[jira] [Commented] (ARROW-16190) [CI][R] Implement CI on Apple M1 for R

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531215#comment-17531215 ] Dewey Dunnington commented on ARROW-16190: -- I know this is probably on the list

[jira] [Updated] (ARROW-16445) [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article

2022-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-16445: --- Labels: pull-request-available (was: ) > [R] [Doc] Add a short summary for the Installing t

[jira] [Commented] (ARROW-16310) [R] test-fedora-r-clang-sanitizer job fails - possible tzdb installation issue

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531213#comment-17531213 ] Dewey Dunnington commented on ARROW-16310: -- I can confirm that {{docker run --r

[jira] [Created] (ARROW-16446) [R] Update parse_date_time to accept a string with no separators

2022-05-03 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-16446: Summary: [R] Update parse_date_time to accept a string with no separators Key: ARROW-16446 URL: https://issues.apache.org/jira/browse/ARROW-16446

[jira] [Assigned] (ARROW-16445) [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dragoș Moldovan-Grünfeld reassigned ARROW-16445: Assignee: Dragoș Moldovan-Grünfeld > [R] [Doc] Add a short summar

[jira] [Created] (ARROW-16445) [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article

2022-05-03 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-16445: Summary: [R] [Doc] Add a short summary for the Installing the Arrow package on Linux article Key: ARROW-16445 URL: https://issues.apache.org/jira/browse/AR

[jira] [Commented] (ARROW-16310) [R] test-fedora-r-clang-sanitizer job fails - possible tzdb installation issue

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531208#comment-17531208 ] Dewey Dunnington commented on ARROW-16310: -- It's now floating back to me that t

[jira] [Created] (ARROW-16444) [R] Implement user-defined scalar functions in R bindings

2022-05-03 Thread Dewey Dunnington (Jira)
Dewey Dunnington created ARROW-16444: Summary: [R] Implement user-defined scalar functions in R bindings Key: ARROW-16444 URL: https://issues.apache.org/jira/browse/ARROW-16444 Project: Apache Arro

[jira] [Closed] (ARROW-16330) [Release][C++] Windows source verification compiles on a single thread

2022-05-03 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-16330. -- Fix Version/s: (was: 9.0.0) Assignee: (was: Jacob Wujciak-Jens) Resolut

[jira] [Commented] (ARROW-16437) [Python] Mocking S3 tests with moto not currently feasible

2022-05-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531206#comment-17531206 ] Joris Van den Bossche commented on ARROW-16437: --- I don't think you can use

[jira] [Commented] (ARROW-16330) [Release][C++] Windows source verification compiles on a single thread

2022-05-03 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531204#comment-17531204 ] Antoine Pitrou commented on ARROW-16330: Oops, I'm confused. For some reason my

[jira] [Assigned] (ARROW-14575) [R] Allow functions with {{pkg::}} prefixes

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dewey Dunnington reassigned ARROW-14575: Assignee: Dragoș Moldovan-Grünfeld (was: Dewey Dunnington) > [R] Allow functions

[jira] [Assigned] (ARROW-14071) [R] Try to arrow_eval user-defined functions

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dewey Dunnington reassigned ARROW-14071: Assignee: Dragoș Moldovan-Grünfeld (was: Dewey Dunnington) > [R] Try to arrow_ev

[jira] [Updated] (ARROW-16372) [Python] Tests failing on s390x because they use Parquet

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16372: --- Component/s: Continuous Integration > [Python] Tests failing on s390x because they u

[jira] [Assigned] (ARROW-15804) [R] Update as.Date() to support several tryFormats

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dragoș Moldovan-Grünfeld reassigned ARROW-15804: Assignee: Dragoș Moldovan-Grünfeld > [R] Update as.Date() to supp

[jira] [Updated] (ARROW-16253) [R] Helper function for casting from float to duration via int64()

2022-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-16253: --- Labels: pull-request-available (was: ) > [R] Helper function for casting from float to dura

[jira] [Updated] (ARROW-16335) [Release][C++] Windows source verification runs C++ tests on a single thread

2022-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-16335: --- Labels: pull-request-available (was: ) > [Release][C++] Windows source verification runs C+

[jira] [Commented] (ARROW-16310) [R] test-fedora-r-clang-sanitizer job fails - possible tzdb installation issue

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16310?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531178#comment-17531178 ] Dewey Dunnington commented on ARROW-16310: -- Hmm...it looks like the {{-I../inst

[jira] [Assigned] (ARROW-16253) [R] Helper function for casting from float to duration via int64()

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dragoș Moldovan-Grünfeld reassigned ARROW-16253: Assignee: Dragoș Moldovan-Grünfeld > [R] Helper function for cast

[jira] [Commented] (ARROW-16253) [R] Helper function for casting from float to duration via int64()

2022-05-03 Thread Jira

[jira] [Commented] (ARROW-16318) [R]Timezone is not supported by to_duckdb()

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531174#comment-17531174 ] Jacob Wujciak-Jens commented on ARROW-16318: cc [~dragosmg]  > [R]Timezone

[jira] [Updated] (ARROW-16318) [R]Timezone is not supported by to_duckdb()

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16318: --- Summary: [R]Timezone is not supported by to_duckdb() (was: Timezone is not supporte

[jira] [Updated] (ARROW-16320) [R] Dataset re-partitioning consumes considerable amount of memory

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16320: --- Summary: [R] Dataset re-partitioning consumes considerable amount of memory (was: D

[jira] [Updated] (ARROW-16318) Timezone is not supported by to_duckdb()

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16318: --- Component/s: R > Timezone is not supported by to_duckdb() >

[jira] [Updated] (ARROW-16320) Dataset re-partitioning consumes considerable amount of memory

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16320: --- Component/s: R > Dataset re-partitioning consumes considerable amount of memory > --

[jira] [Commented] (ARROW-16399) [R][C++] datetime locale support on Windows MINGW / R

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531171#comment-17531171 ] Dewey Dunnington commented on ARROW-16399: -- I know there's some of this in the

[jira] [Commented] (ARROW-16421) [R] Permission error on Windows when deleting file in dataset

2022-05-03 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531169#comment-17531169 ] Dewey Dunnington commented on ARROW-16421: -- There doesn't seem to be a {{close_

[jira] [Assigned] (ARROW-16330) [Release][C++] Windows source verification compiles on a single thread

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens reassigned ARROW-16330: -- Assignee: Jacob Wujciak-Jens > [Release][C++] Windows source verification com

[jira] [Assigned] (ARROW-16335) [Release][C++] Windows source verification runs C++ tests on a single thread

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens reassigned ARROW-16335: -- Assignee: Jacob Wujciak-Jens > [Release][C++] Windows source verification run

[jira] [Updated] (ARROW-16348) [Python] ParquetWriter use_compliant_nested_type=True does not preserve ExtensionArray when reading back

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16348: --- Summary: [Python] ParquetWriter use_compliant_nested_type=True does not preserve Ext

[jira] [Updated] (ARROW-16386) Simple example arrow script fails on ubuntu:latest docker container

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16386: --- Component/s: Python > Simple example arrow script fails on ubuntu:latest docker cont

[jira] [Resolved] (ARROW-16434) [R] [CI] Nightly crossbow job for windows + devdocs

2022-05-03 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane resolved ARROW-16434. -- Fix Version/s: 8.0.0 (was: 9.0.0) Resolution: Fixed Issue resolv

[jira] [Resolved] (ARROW-16035) [Java] Arrow to JDBC ArrowVectorIterator with does not terminate with empty result set

2022-05-03 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-16035. -- Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13049 [https://github.com

[jira] [Updated] (ARROW-16391) pd.read_parquet using filters consumes too much memory

2022-05-03 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacob Wujciak-Jens updated ARROW-16391: --- Component/s: Python > pd.read_parquet using filters consumes too much memory > -

[jira] [Resolved] (ARROW-16442) [Python] The fragments for ORC dataset return base Fragment instead of FileFragment

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-16442. - Fix Version/s: 8.0.0 (was: 9.0.0) Resolution: Fixed Issue

[jira] [Assigned] (ARROW-16413) [Python] FileFormat::GetReaderAsync hangs with an fsspec filesystem

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-16413: --- Assignee: Joris Van den Bossche (was: Krisztian Szucs) > [Python] FileFormat::GetR

[jira] [Assigned] (ARROW-16413) [Python] FileFormat::GetReaderAsync hangs with an fsspec filesystem

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-16413: --- Assignee: Krisztian Szucs > [Python] FileFormat::GetReaderAsync hangs with an fsspe

[jira] [Commented] (ARROW-16441) [Release][Integration] Integration test primitive_no_batches Java producing,  Go consuming fails on verify-release-candidate.sh

2022-05-03 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531149#comment-17531149 ] Krisztian Szucs commented on ARROW-16441: - cc [~zeroshade] > [Release][Integrat

[jira] [Commented] (ARROW-16443) [C++][R] strptime fails to parse with %b or %B on Windows

2022-05-03 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-16443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531143#comment-17531143 ] Dragoș Moldovan-Grünfeld commented on ARROW-16443: -- The {{test-dplyr-fu

[jira] [Commented] (ARROW-16443) [C++][R] strptime fails to parse with %b or %B on Windows

2022-05-03 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531140#comment-17531140 ] Rok Mihevc commented on ARROW-16443: This could be a locale issue. We also noticed t

[jira] [Commented] (ARROW-16443) [C++][R] strptime fails to parse with %b or %B on Windows

2022-05-03 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531133#comment-17531133 ] Rok Mihevc commented on ARROW-16443: A relevant c++ test would in [scalar_string_te

[jira] [Created] (ARROW-16443) [C++][R] strptime fails to parse with %b or %B on Windows

2022-05-03 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-16443: Summary: [C++][R] strptime fails to parse with %b or %B on Windows Key: ARROW-16443 URL: https://issues.apache.org/jira/browse/ARROW-16443 Proj

[jira] [Commented] (ARROW-16420) [Python] pq.write_to_dataset always ignores partitioning

2022-05-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531130#comment-17531130 ] Joris Van den Bossche commented on ARROW-16420: --- Ah, good catch, that's a

[jira] [Updated] (ARROW-16431) [C++][Parquet] Improve error message in append_row_groups() when appending disjoint metadata

2022-05-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-16431: -- Component/s: C++ Parquet > [C++][Parquet] Improve error messa

[jira] [Commented] (ARROW-16431) [C++][Parquet] Improve error message in append_row_groups() when appending disjoint metadata

2022-05-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17531128#comment-17531128 ] Joris Van den Bossche commented on ARROW-16431: --- [~multimeric] thanks for

[jira] [Updated] (ARROW-16431) [C++][Parquet] Improve error message in append_row_groups() when appending disjoint metadata

2022-05-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-16431: -- Summary: [C++][Parquet] Improve error message in append_row_groups() when appe

[jira] [Updated] (ARROW-16438) [Python] pyarrow dataset API fails to read s3 directory

2022-05-03 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-16438: -- Summary: [Python] pyarrow dataset API fails to read s3 directory (was: pyarro

  1   2   >