[jira] [Updated] (ARROW-14204) [C++] Fails to compile Arrow without RE2 due to missing ifdef guard

2021-10-01 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eduardo Ponce updated ARROW-14204: -- Description: [*RegexSubstringMatcher* is available only when RE2 is enabled as it is guarded

[jira] [Commented] (ARROW-14204) [C++] Fails to compile Arrow without RE2 due to missing ifdef guard

2021-10-01 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423467#comment-17423467 ] Eduardo Ponce commented on ARROW-14204: --- Should we consider this bug as motivation to have at

[jira] [Created] (ARROW-14204) [C++] Fails to compile Arrow without RE2 due to missing ifdef guard

2021-10-01 Thread Eduardo Ponce (Jira)
Eduardo Ponce created ARROW-14204: - Summary: [C++] Fails to compile Arrow without RE2 due to missing ifdef guard Key: ARROW-14204 URL: https://issues.apache.org/jira/browse/ARROW-14204 Project:

[jira] [Commented] (ARROW-14188) link error on ubuntu

2021-10-01 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423464#comment-17423464 ] Kouhei Sutou commented on ARROW-14188: -- Thanks but could you show all log? > link error on ubuntu

[jira] [Updated] (ARROW-14180) [Packaging] Add support for AlmaLinux 8

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14180: --- Labels: pull-request-available (was: ) > [Packaging] Add support for AlmaLinux 8 >

[jira] [Updated] (ARROW-14180) [Packaging] Add support for AlmaLinux 8

2021-10-01 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-14180: - Fix Version/s: (was: 7.0.0) > [Packaging] Add support for AlmaLinux 8 >

[jira] [Updated] (ARROW-14180) [Packaging] Add support for AlmaLinux 8

2021-10-01 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-14180: - Fix Version/s: 6.0.0 > [Packaging] Add support for AlmaLinux 8 >

[jira] [Created] (ARROW-14203) [C++] Fix description of ExecBatch.length for Scalars in aggregate kernels

2021-10-01 Thread Eduardo Ponce (Jira)
Eduardo Ponce created ARROW-14203: - Summary: [C++] Fix description of ExecBatch.length for Scalars in aggregate kernels Key: ARROW-14203 URL: https://issues.apache.org/jira/browse/ARROW-14203

[jira] [Comment Edited] (ARROW-13879) [C++] Mixed support for binary types in regex functions

2021-10-01 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423460#comment-17423460 ] Eduardo Ponce edited comment on ARROW-13879 at 10/2/21, 4:03 AM: - Well,

[jira] [Updated] (ARROW-13879) [C++] Mixed support for binary types in regex functions

2021-10-01 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eduardo Ponce updated ARROW-13879: -- Description: The functions count_substring, count_substring_regex, find_substring, and

[jira] [Commented] (ARROW-13879) [C++] Mixed support for binary types in regex functions

2021-10-01 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423460#comment-17423460 ] Eduardo Ponce commented on ARROW-13879: --- Well, the issue is that `string_view` is used to

[jira] [Updated] (ARROW-14192) [C++][Dataset] Backpressure broken on ordered scans

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14192: --- Labels: pull-request-available query-engine (was: query-engine) > [C++][Dataset]

[jira] [Commented] (ARROW-13879) [C++] Mixed support for binary types in regex functions

2021-10-01 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423456#comment-17423456 ] David Li commented on ARROW-13879: -- std::string can contain null. Some place is constructing it from a

[jira] [Commented] (ARROW-13887) [R] Capture error produced when reading in CSV file with headers and using a schema, and add suggestion

2021-10-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423445#comment-17423445 ] Weston Pace commented on ARROW-13887: - I'm happy with "as well as". If someone takes this JIRA and

[jira] [Commented] (ARROW-13887) [R] Capture error produced when reading in CSV file with headers and using a schema, and add suggestion

2021-10-01 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423443#comment-17423443 ] Nicola Crane commented on ARROW-13887: -- [~westonpace] How about "as well as" rather than "instead"?

[jira] [Commented] (ARROW-14014) FlightClient.ClientStreamListener not notified on error when parsing invalid trailers

2021-10-01 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423442#comment-17423442 ] Bryan Cutler commented on ARROW-14014: -- I've run into this a couple times [~manudebouc], but can't

[jira] [Commented] (ARROW-14200) [R] strftime on a date should not use or be confused by timezones

2021-10-01 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423429#comment-17423429 ] Rok Mihevc commented on ARROW-14200: Yeah, what Weston said :). The casting happens on L717 and

[jira] [Updated] (ARROW-14202) [C++] A more RAM-efficient top-k sink node

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-14202: --- Summary: [C++] A more RAM-efficient top-k sink node (was: A more RAM-efficient top-k sink

[jira] [Commented] (ARROW-14200) [R] strftime on a date should not use or be confused by timezones

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423428#comment-17423428 ] Jonathan Keane commented on ARROW-14200: Thanks for digging (I hadn't had a chance to yet), I've

[jira] [Updated] (ARROW-14200) [R] strftime on a date should not use or be confused by timezones

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-14200: --- Component/s: (was: C++) > [R] strftime on a date should not use or be confused by

[jira] [Updated] (ARROW-14200) [R] strftime on a date should not use or be confused by timezones

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-14200: --- Summary: [R] strftime on a date should not use or be confused by timezones (was: [R] [C++]

[jira] [Commented] (ARROW-14200) [R] [C++] strftime on a date should not use or be confused by timezones

2021-10-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423421#comment-17423421 ] Weston Pace commented on ARROW-14200: - I'm pretty sure this is in the R bindings. It is working

[jira] [Created] (ARROW-14202) A more RAM-efficient top-k sink node

2021-10-01 Thread Alexander Ocsa (Jira)
Alexander Ocsa created ARROW-14202: -- Summary: A more RAM-efficient top-k sink node Key: ARROW-14202 URL: https://issues.apache.org/jira/browse/ARROW-14202 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-14201) RAM-efficient topk sink node

2021-10-01 Thread Alexander Ocsa (Jira)
Alexander Ocsa created ARROW-14201: -- Summary: RAM-efficient topk sink node Key: ARROW-14201 URL: https://issues.apache.org/jira/browse/ARROW-14201 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-14200) [R] [C++] strftime on a date should not use or be confused by timezones

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-14200: --- Issue Type: Bug (was: New Feature) > [R] [C++] strftime on a date should not use or be

[jira] [Created] (ARROW-14200) [R] [C++] strftime on a date should not use or be confused by timezones

2021-10-01 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-14200: -- Summary: [R] [C++] strftime on a date should not use or be confused by timezones Key: ARROW-14200 URL: https://issues.apache.org/jira/browse/ARROW-14200 Project:

[jira] [Updated] (ARROW-13751) [Doc][Cookbook] Searching for values matching a predicate in Arrays - Python

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13751: --- Labels: github-pullrequest pull-request-available (was: github-pullrequest) >

[jira] [Updated] (ARROW-13732) [Doc][Cookbook] Manipulating and analyze Arrow data with dplyr verbs - R

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13732: --- Labels: pull-request-available (was: ) > [Doc][Cookbook] Manipulating and analyze Arrow

[jira] [Commented] (ARROW-13887) [R] Capture error produced when reading in CSV file with headers and using a schema, and add suggestion

2021-10-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13887?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423401#comment-17423401 ] Weston Pace commented on ARROW-13887: - Can we fix the error message in C++ instead? The criteria

[jira] [Closed] (ARROW-13722) [Doc][Cookbook] Specifying Schemas - R

2021-10-01 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane closed ARROW-13722. Resolution: Fixed > [Doc][Cookbook] Specifying Schemas - R >

[jira] [Updated] (ARROW-13893) [R] Make head/tail lazy on datasets and queries

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-13893: Labels: query-engine (was: ) > [R] Make head/tail lazy on datasets and queries >

[jira] [Updated] (ARROW-14063) [R] open_dataset() does not work on CSVs without header rows

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14063: Fix Version/s: 6.0.0 > [R] open_dataset() does not work on CSVs without header rows >

[jira] [Updated] (ARROW-14198) [Java] Upgrade Netty and gRPC dependencies

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14198: --- Labels: pull-request-available (was: ) > [Java] Upgrade Netty and gRPC dependencies >

[jira] [Created] (ARROW-14199) [R] bindings for format where possible

2021-10-01 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-14199: -- Summary: [R] bindings for format where possible Key: ARROW-14199 URL: https://issues.apache.org/jira/browse/ARROW-14199 Project: Apache Arrow Issue

[jira] [Created] (ARROW-14198) [Java] Upgrade Netty and gRPC dependencies

2021-10-01 Thread Bryan Cutler (Jira)
Bryan Cutler created ARROW-14198: Summary: [Java] Upgrade Netty and gRPC dependencies Key: ARROW-14198 URL: https://issues.apache.org/jira/browse/ARROW-14198 Project: Apache Arrow Issue

[jira] [Resolved] (ARROW-9647) [Java] Cannot install arrow-memory 1.0.0 from maven central

2021-10-01 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved ARROW-9647. - Resolution: Not A Problem > [Java] Cannot install arrow-memory 1.0.0 from maven central >

[jira] [Resolved] (ARROW-13973) [C++] Add a SelectKSinkNode

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-13973. - Resolution: Fixed Issue resolved by pull request 11274

[jira] [Updated] (ARROW-12763) [R] Optimize dplyr queries that use head/tail after arrange

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12763: Labels: query-engine (was: ) > [R] Optimize dplyr queries that use head/tail after

[jira] [Updated] (ARROW-12763) [R] Optimize dplyr queries that use head/tail after arrange

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-12763: Fix Version/s: 6.0.0 > [R] Optimize dplyr queries that use head/tail after arrange >

[jira] [Updated] (ARROW-14181) [C++][Compute] Hash Join support for dictionary

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14181: Fix Version/s: (was: 7.0.0) 6.0.0 > [C++][Compute] Hash Join

[jira] [Resolved] (ARROW-13890) [R] Split up test-dataset.R and test-dplyr.R

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-13890. - Resolution: Fixed Issue resolved by pull request 11292

[jira] [Resolved] (ARROW-13634) [R] Update distro() in nixlibs.R to map from "bookworm" to 12

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-13634. - Fix Version/s: 6.0.0 Resolution: Fixed Issue resolved by pull request 10939

[jira] [Resolved] (ARROW-14195) [R] Fix ExecPlan binding annotations

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-14195. - Resolution: Fixed Issue resolved by pull request 11290

[jira] [Updated] (ARROW-14159) [R] Re-allow some multithreading on Windows

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14159: Description: Followup to ARROW-8379, which set use_threads = FALSE on Windows. See

[jira] [Updated] (ARROW-14159) [R] Re-allow some multithreading on Windows

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14159: Description: Followup to ARROW-8379, which set use_threads = FALSE on Windows. See

[jira] [Commented] (ARROW-14184) [C++] allow joins where the keys include new columns on the left

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423338#comment-17423338 ] Jonathan Keane commented on ARROW-14184: Turns out this is part of TPC-H query number 8. I've

[jira] [Commented] (ARROW-14137) PyArrow - “The kernel appears to have died”- Segmentation fault

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1742#comment-1742 ] Joris Van den Bossche commented on ARROW-14137: --- Would you be able to provide a

[jira] [Issue Comment Deleted] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Judah (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Judah updated ARROW-14196: -- Comment: was deleted (was: [~trucnguyenlam] Is this something that could be flipped now?

[jira] [Assigned] (ARROW-14036) [R] Binding for n_distinct() with no grouping

2021-10-01 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook reassigned ARROW-14036: Assignee: Percy Camilo Triveño Aucahuasi (was: Ian Cook) > [R] Binding for n_distinct() with no

[jira] [Commented] (ARROW-14036) [R] Binding for n_distinct() with no grouping

2021-10-01 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423331#comment-17423331 ] Ian Cook commented on ARROW-14036: -- [#11257|https://github.com/apache/arrow/pull/11257] (ARROW-14035)

[jira] [Updated] (ARROW-6607) [Python] Support for set/list columns when converting from Pandas

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6607: - Fix Version/s: (was: 7.0.0) 6.0.0 > [Python] Support for

[jira] [Resolved] (ARROW-6607) [Python] Support for set/list columns when converting from Pandas

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-6607. -- Resolution: Fixed > [Python] Support for set/list columns when converting from

[jira] [Assigned] (ARROW-6607) [Python] Support for set/list columns when converting from Pandas

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-6607: Assignee: Alessandro Molina (was: Krisztian Szucs) > [Python] Support

[jira] [Commented] (ARROW-6607) [Python] Support for set/list columns when converting from Pandas

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423330#comment-17423330 ] Joris Van den Bossche commented on ARROW-6607: -- Indeed, the snippet above works now. This is

[jira] [Commented] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423328#comment-17423328 ] Joris Van den Bossche commented on ARROW-14196: --- > I'm also surprised that a non-existing

[jira] [Commented] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423323#comment-17423323 ] Antoine Pitrou commented on ARROW-14196: Is it really common to read only a list's child column

[jira] [Commented] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423324#comment-17423324 ] Antoine Pitrou commented on ARROW-14196: I'm also surprised that a non-existing column name

[jira] [Comment Edited] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423310#comment-17423310 ] Joris Van den Bossche edited comment on ARROW-14196 at 10/1/21, 2:31 PM:

[jira] [Commented] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423310#comment-17423310 ] Joris Van den Bossche commented on ARROW-14196: --- I wrote this with the latest pyarrow

[jira] [Updated] (ARROW-13890) [R] Split up test-dataset.R and test-dplyr.R

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13890: --- Labels: pull-request-available (was: ) > [R] Split up test-dataset.R and test-dplyr.R >

[jira] [Updated] (ARROW-13890) [R] Split up test-dataset.R and test-dplyr.R

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-13890: Summary: [R] Split up test-dataset.R and test-dplyr.R (was: [R] Split up test-dataset.R)

[jira] [Assigned] (ARROW-13890) [R] Split up test-dataset.R

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-13890: --- Assignee: Neal Richardson > [R] Split up test-dataset.R >

[jira] [Commented] (ARROW-14190) [R] Should unify_schemas() allow change of type?

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423308#comment-17423308 ] Joris Van den Bossche commented on ARROW-14190: --- Ah, sorry, my bad. I thought we already

[jira] [Updated] (ARROW-13887) [R] Capture error produced when reading in CSV file with headers and using a schema, and add suggestion

2021-10-01 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-13887: - Fix Version/s: 6.0.0 > [R] Capture error produced when reading in CSV file with headers and

[jira] [Commented] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423307#comment-17423307 ] Antoine Pitrou commented on ARROW-14196: One possible investigation would be to write two

[jira] [Commented] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423302#comment-17423302 ] Joris Van den Bossche commented on ARROW-14196: --- Some relevant quotes from the

[jira] [Commented] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Judah (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423299#comment-17423299 ] Judah commented on ARROW-14196: --- [~trucnguyenlam] Is this something that could be flipped now?

[jira] [Commented] (ARROW-14197) [C++] Hashjoin + datasets hanging

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14197?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423296#comment-17423296 ] Jonathan Keane commented on ARROW-14197: I've attached a sample of the R process while it's hung

[jira] [Updated] (ARROW-14197) [C++] Hashjoin + datasets hanging

2021-10-01 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-14197: --- Attachment: sample-while-hung.out.txt > [C++] Hashjoin + datasets hanging >

[jira] [Created] (ARROW-14197) [C++] Hashjoin + datasets hanging

2021-10-01 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-14197: -- Summary: [C++] Hashjoin + datasets hanging Key: ARROW-14197 URL: https://issues.apache.org/jira/browse/ARROW-14197 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2021-10-01 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14196: - Summary: [C++][Parquet] Default to compliant nested types in Parquet writer Key: ARROW-14196 URL: https://issues.apache.org/jira/browse/ARROW-14196

[jira] [Updated] (ARROW-14195) [R] Fix ExecPlan binding annotations

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14195: --- Labels: pull-request-available (was: ) > [R] Fix ExecPlan binding annotations >

[jira] [Created] (ARROW-14195) [R] Fix ExecPlan binding annotations

2021-10-01 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-14195: --- Summary: [R] Fix ExecPlan binding annotations Key: ARROW-14195 URL: https://issues.apache.org/jira/browse/ARROW-14195 Project: Apache Arrow Issue

[jira] [Closed] (ARROW-14190) [R] Should unify_schemas() allow change of type?

2021-10-01 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane closed ARROW-14190. Resolution: Not A Problem > [R] Should unify_schemas() allow change of type? >

[jira] [Commented] (ARROW-14190) [R] Should unify_schemas() allow change of type?

2021-10-01 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423280#comment-17423280 ] Nicola Crane commented on ARROW-14190: -- Ah, I misread the code, thanks for explaining that [~npr]!

[jira] [Commented] (ARROW-14190) [R] Should unify_schemas() allow change of type?

2021-10-01 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423277#comment-17423277 ] Neal Richardson commented on ARROW-14190: - open_dataset isn't (by default) trying to unify

[jira] [Updated] (ARROW-13887) [R] Capture error produced when reading in CSV file with headers and using a schema, and add suggestion

2021-10-01 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-13887: - Description: When reading in a CSV with headers, and also using a schema, we get an error as

[jira] [Updated] (ARROW-13472) [R] Remove .engine = "duckdb" argument

2021-10-01 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13472: -- Fix Version/s: (was: 7.0.0) 6.0.0 > [R] Remove .engine

[jira] [Resolved] (ARROW-13727) [Doc][Cookbook] Appending Tables to an existing Table - Python

2021-10-01 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina resolved ARROW-13727. --- Resolution: Fixed > [Doc][Cookbook] Appending Tables to an existing Table - Python

[jira] [Resolved] (ARROW-13725) [Doc][Cookbook] Combining and Harmonizing Schemas - Python

2021-10-01 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina resolved ARROW-13725. --- Resolution: Fixed > [Doc][Cookbook] Combining and Harmonizing Schemas - Python >

[jira] [Updated] (ARROW-14188) link error on ubuntu

2021-10-01 Thread Amir Ghamarian (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amir Ghamarian updated ARROW-14188: --- Description: I used vcpkg to install arrow versions 4 and 5, trying to build my code that

[jira] [Assigned] (ARROW-5530) [C++] Add options to ValueCount/Unique/DictEncode kernel to toggle null behavior

2021-10-01 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc reassigned ARROW-5530: - Assignee: (was: Rok Mihevc) > [C++] Add options to ValueCount/Unique/DictEncode kernel to

[jira] [Updated] (ARROW-14188) link error on ubuntu

2021-10-01 Thread Amir Ghamarian (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amir Ghamarian updated ARROW-14188: --- Description: I used vcpkg to install arrow versions 4 and 5, trying to build my code that

[jira] [Updated] (ARROW-14188) link error on ubuntu

2021-10-01 Thread Amir Ghamarian (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Amir Ghamarian updated ARROW-14188: --- Description: I used vcpkg to install arrow versions 4 and 5, trying to build my code that

[jira] [Updated] (ARROW-14194) [Docs] Improve vertical spacing in the sphinx API docs

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14194: --- Labels: pull-request-available (was: ) > [Docs] Improve vertical spacing in the sphinx API

[jira] [Created] (ARROW-14194) [Docs] Improve vertical spacing in the sphinx API docs

2021-10-01 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14194: - Summary: [Docs] Improve vertical spacing in the sphinx API docs Key: ARROW-14194 URL: https://issues.apache.org/jira/browse/ARROW-14194 Project:

[jira] [Resolved] (ARROW-13685) [C++] Cannot write dataset to S3FileSystem if bucket already exists

2021-10-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-13685. - Fix Version/s: 6.0.0 Resolution: Fixed Issue resolved by pull request 11136

[jira] [Updated] (ARROW-14191) [C++][Dataset] Dataset writes should respect backpressure

2021-10-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-14191: Labels: pull-request-available query-engine (was: kernel pull-request-available query-engine) >

[jira] [Updated] (ARROW-14191) [C++][Dataset] Dataset writes should respect backpressure

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14191: --- Labels: kernel pull-request-available query-engine (was: kernel query-engine) >

[jira] [Created] (ARROW-14193) [C++][Gandiva] Implement INSTR function

2021-10-01 Thread Augusto Alves Silva (Jira)
Augusto Alves Silva created ARROW-14193: --- Summary: [C++][Gandiva] Implement INSTR function Key: ARROW-14193 URL: https://issues.apache.org/jira/browse/ARROW-14193 Project: Apache Arrow

[jira] [Commented] (ARROW-13611) [C++] Scanning datasets does not enforce back pressure

2021-10-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423248#comment-17423248 ] Weston Pace commented on ARROW-13611: - That is actually what I've been working on today (shame on me

[jira] [Assigned] (ARROW-13611) [C++] Scanning datasets does not enforce back pressure

2021-10-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-13611: --- Assignee: Weston Pace > [C++] Scanning datasets does not enforce back pressure >

[jira] [Updated] (ARROW-13611) [C++] Scanning datasets does not enforce back pressure

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13611: --- Labels: pull-request-available query-engine (was: query-engine) > [C++] Scanning datasets

[jira] [Created] (ARROW-14192) [C++][Dataset] Backpressure broken on ordered scans

2021-10-01 Thread Weston Pace (Jira)
Weston Pace created ARROW-14192: --- Summary: [C++][Dataset] Backpressure broken on ordered scans Key: ARROW-14192 URL: https://issues.apache.org/jira/browse/ARROW-14192 Project: Apache Arrow

[jira] [Created] (ARROW-14191) [C++][Dataset] Dataset writes should respect backpressure

2021-10-01 Thread Weston Pace (Jira)
Weston Pace created ARROW-14191: --- Summary: [C++][Dataset] Dataset writes should respect backpressure Key: ARROW-14191 URL: https://issues.apache.org/jira/browse/ARROW-14191 Project: Apache Arrow

[jira] [Updated] (ARROW-14187) [Python] File reading regression

2021-10-01 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-14187: -- Component/s: Python > [Python] File reading regression >

[jira] [Assigned] (ARROW-14187) [Python] File reading regression

2021-10-01 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina reassigned ARROW-14187: - Assignee: Alessandro Molina > [Python] File reading regression >

[jira] [Updated] (ARROW-14187) [Python] File reading regression

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14187: --- Labels: pull-request-available (was: ) > [Python] File reading regression >

[jira] [Created] (ARROW-14190) [R] Should unify_schemas() allow change of type?

2021-10-01 Thread Nicola Crane (Jira)
Nicola Crane created ARROW-14190: Summary: [R] Should unify_schemas() allow change of type? Key: ARROW-14190 URL: https://issues.apache.org/jira/browse/ARROW-14190 Project: Apache Arrow

[jira] [Updated] (ARROW-14189) [Docs] Add version dropdown to the sphinx docs

2021-10-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14189: --- Labels: pull-request-available (was: ) > [Docs] Add version dropdown to the sphinx docs >

  1   2   3   4   >