[jira] [Updated] (ARROW-14895) [C++] Vcpkg install error for abseil on windows when building Arrow C++

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14895: --- Labels: pull-request-available (was: ) > [C++] Vcpkg install error for abseil on windows

[jira] [Updated] (ARROW-14969) [C++][Python] Un-deprecate FileSystem::OpenAppendStream

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14969: --- Labels: good-first-issue pull-request-available (was: good-first-issue) > [C++][Python]

[jira] [Resolved] (ARROW-13376) [C++][Gandiva] Implement FACTORIAL function on Gandiva

2021-12-02 Thread Pindikura Ravindra (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pindikura Ravindra resolved ARROW-13376. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request

[jira] [Commented] (ARROW-14965) [Python][C++] Contention when reading Parquet files with multi-threading

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452707#comment-17452707 ] Weston Pace commented on ARROW-14965: - > It looks like pyarrow._dataset.Scanner.to_reader doesn't

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452716#comment-17452716 ] Weston Pace commented on ARROW-12358: - If "delete_matching" is not creating the base directory or

[jira] [Commented] (ARROW-14965) [Python][C++] Contention when reading Parquet files with multi-threading

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452712#comment-17452712 ] Weston Pace commented on ARROW-14965: - Adding a filter (~50% selectivity) didn't seem to have much

[jira] [Comment Edited] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2021-12-02 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17450796#comment-17450796 ] Lance Dacey edited comment on ARROW-12358 at 12/3/21, 3:04 AM: --- I was not

[jira] [Commented] (ARROW-14974) [C++] Dataset scanning, in async mode, is running parquet reads on the CPU thread pool

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452704#comment-17452704 ] Weston Pace commented on ARROW-14974: - Note: The inverse is often true. We will sometimes do some

[jira] [Created] (ARROW-14974) [C++] Dataset scanning, in async mode, is running parquet reads on the CPU thread pool

2021-12-02 Thread Weston Pace (Jira)
Weston Pace created ARROW-14974: --- Summary: [C++] Dataset scanning, in async mode, is running parquet reads on the CPU thread pool Key: ARROW-14974 URL: https://issues.apache.org/jira/browse/ARROW-14974

[jira] [Updated] (ARROW-14965) [Python][C++] Contention when reading Parquet files with multi-threading

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-14965: Component/s: C++ > [Python][C++] Contention when reading Parquet files with multi-threading >

[jira] [Updated] (ARROW-14965) [Python][C++] Contention when reading Parquet files with multi-threading

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-14965: Summary: [Python][C++] Contention when reading Parquet files with multi-threading (was:

[jira] [Updated] (ARROW-14973) [python][CI] How to build wheels depending on PyArrow that are compatible with multiple PyArrow versions

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-14973: Component/s: Continuous Integration Python > [python][CI] How to build wheels

[jira] [Updated] (ARROW-14973) [python][CI]How to build wheels depending on PyArrow that are compatible with multiple PyArrow versions

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-14973: Summary: [python][CI]How to build wheels depending on PyArrow that are compatible with multiple

[jira] [Updated] (ARROW-14973) [python][CI] How to build wheels depending on PyArrow that are compatible with multiple PyArrow versions

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-14973: Summary: [python][CI] How to build wheels depending on PyArrow that are compatible with multiple

[jira] [Commented] (ARROW-14970) [C++][Compute] Replace ExecNode::InputReceived with ::MakeTask (Part 2)

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452675#comment-17452675 ] Weston Pace commented on ARROW-14970: - We talked about this offline but I'll add it as a comment

[jira] [Commented] (ARROW-14971) [C++] Decide if we implement GcsFileSystem::Move

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452674#comment-17452674 ] Weston Pace commented on ARROW-14971: - As far as I can tell Arrow does not rely on this operation

[jira] [Created] (ARROW-14973) How to build wheels depending on PyArrow that are compatible with multiple PyArrow versions

2021-12-02 Thread Julius (Jira)
Julius created ARROW-14973: -- Summary: How to build wheels depending on PyArrow that are compatible with multiple PyArrow versions Key: ARROW-14973 URL: https://issues.apache.org/jira/browse/ARROW-14973

[jira] [Updated] (ARROW-13579) Expose Create EmptyArray, EmptyRecordBatch and EmptyTable utility functions.

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13579: --- Labels: pull-request-available (was: ) > Expose Create EmptyArray, EmptyRecordBatch and

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2021-12-02 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452649#comment-17452649 ] Lance Dacey commented on ARROW-12358: - Any thoughts on "delete_matching" creating the partition if

[jira] [Commented] (ARROW-6407) [C++] Consolidate thirdparty bundle URLs, version bumping logic, etc

2021-12-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452619#comment-17452619 ] Ian Cook commented on ARROW-6407: - There has been some recent discussion in of moving the bundled

[jira] [Assigned] (ARROW-14971) [C++] Decide if we implement GcsFileSystem::Move

2021-12-02 Thread Carlos O'Ryan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos O'Ryan reassigned ARROW-14971: - Assignee: Carlos O'Ryan > [C++] Decide if we implement GcsFileSystem::Move >

[jira] [Resolved] (ARROW-14931) [Python] csv/orc format strings missing from some dataset docs

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-14931. - Resolution: Fixed Issue resolved by pull request 11814

[jira] [Updated] (ARROW-14972) [Python][Doc] Document automatic partitioning discovery

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14972: --- Labels: pull-request-available (was: ) > [Python][Doc] Document automatic partitioning

[jira] [Created] (ARROW-14972) [Python][Doc] Document automatic partitioning discovery

2021-12-02 Thread Weston Pace (Jira)
Weston Pace created ARROW-14972: --- Summary: [Python][Doc] Document automatic partitioning discovery Key: ARROW-14972 URL: https://issues.apache.org/jira/browse/ARROW-14972 Project: Apache Arrow

[jira] [Created] (ARROW-14971) [C++] Decide if we implement GcsFileSystem::Move

2021-12-02 Thread Carlos O'Ryan (Jira)
Carlos O'Ryan created ARROW-14971: - Summary: [C++] Decide if we implement GcsFileSystem::Move Key: ARROW-14971 URL: https://issues.apache.org/jira/browse/ARROW-14971 Project: Apache Arrow

[jira] [Updated] (ARROW-4975) [C++] Support concatenation of UnionArrays

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4975: -- Labels: good-first-issue good-second-issue pull-request-available (was: good-first-issue

[jira] [Assigned] (ARROW-14917) [C++] Implement GcsFileSystem::CreateDir

2021-12-02 Thread Carlos O'Ryan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Carlos O'Ryan reassigned ARROW-14917: - Assignee: Carlos O'Ryan > [C++] Implement GcsFileSystem::CreateDir >

[jira] [Updated] (ARROW-14917) [C++] Implement GcsFileSystem::CreateDir

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14917: --- Labels: pull-request-available (was: ) > [C++] Implement GcsFileSystem::CreateDir >

[jira] [Commented] (ARROW-14961) [C++] Bump version on Google Benchmark

2021-12-02 Thread Sasha Krassovsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452598#comment-17452598 ] Sasha Krassovsky commented on ARROW-14961: -- benchmark::CreateDenseRange and

[jira] [Comment Edited] (ARROW-2034) [C++] Filesystem implementation for Azure Blob Storage

2021-12-02 Thread Tom Augspurger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452547#comment-17452547 ] Tom Augspurger edited comment on ARROW-2034 at 12/2/21, 8:02 PM: - Does

[jira] [Commented] (ARROW-13035) [C++] Create a compute function returning indices of non-zero values

2021-12-02 Thread Niranda Perera (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452584#comment-17452584 ] Niranda Perera commented on ARROW-13035: [~amol-] is this still open? or are you working on

[jira] [Resolved] (ARROW-14779) [C++] Add other common round mode names to RoundMode docs

2021-12-02 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-14779. -- Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11838

[jira] [Comment Edited] (ARROW-14168) [R] Warn only once about arrow function differences

2021-12-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452577#comment-17452577 ] Ian Cook edited comment on ARROW-14168 at 12/2/21, 7:42 PM: [~dragosmg] yes,

[jira] [Commented] (ARROW-14168) [R] Warn only once about arrow function differences

2021-12-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452577#comment-17452577 ] Ian Cook commented on ARROW-14168: -- [~dragosmg] yes, now that {{{}if_else{}}}, {{{}case_when{}}}, and

[jira] [Updated] (ARROW-14168) [R] Warn only once about arrow function differences

2021-12-02 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-14168: - Description: When someone calls median or quantile, we warn them that it is approximate. -When someone

[jira] [Updated] (ARROW-13923) [C++] Improve CSV chunker with SIMD

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13923: --- Labels: pull-request-available (was: ) > [C++] Improve CSV chunker with SIMD >

[jira] [Commented] (ARROW-14965) Contention when reading Parquet files with multi-threading

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14965?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452563#comment-17452563 ] Weston Pace commented on ARROW-14965: - I'm not sure that contention is the right word here. ReadAll

[jira] [Commented] (ARROW-14781) Improved Tooling/Documentation on Constructing Larger than Memory Parquet

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452561#comment-17452561 ] Weston Pace commented on ARROW-14781: - Ah, also ARROW-13703 > Improved Tooling/Documentation on

[jira] [Commented] (ARROW-14781) Improved Tooling/Documentation on Constructing Larger than Memory Parquet

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452560#comment-17452560 ] Weston Pace commented on ARROW-14781: - As part of 7.0.0 I am working on ARROW-14426 / ARROW-14427

[jira] [Commented] (ARROW-14904) [C++] Enable CSV Writer to append / overwrite existing file

2021-12-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452558#comment-17452558 ] Weston Pace commented on ARROW-14904: - I'm a little bit torn here. Append is definitely something

[jira] [Commented] (ARROW-2034) [C++] Filesystem implementation for Azure Blob Storage

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452557#comment-17452557 ] Antoine Pitrou commented on ARROW-2034: --- We require only C++11 in the codebase. We might add

[jira] [Commented] (ARROW-2034) [C++] Filesystem implementation for Azure Blob Storage

2021-12-02 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452556#comment-17452556 ] Neal Richardson commented on ARROW-2034: I'm not an expert here, but I think we could use C++14

[jira] [Commented] (ARROW-2034) [C++] Filesystem implementation for Azure Blob Storage

2021-12-02 Thread Tom Augspurger (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452547#comment-17452547 ] Tom Augspurger commented on ARROW-2034: --- Does Arrow support C++14 features now (or more

[jira] [Comment Edited] (ARROW-7594) [C++] Implement HTTP and FTP file systems

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17343421#comment-17343421 ] Antoine Pitrou edited comment on ARROW-7594 at 12/2/21, 5:42 PM: - We

[jira] [Resolved] (ARROW-13553) [Doc] Add guidelines for code reviews

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-13553. Resolution: Fixed Issue resolved by pull request 11757

[jira] [Comment Edited] (ARROW-14930) [Python] FileNotFound when using bucket+folders in S3 + partitioned parquet

2021-12-02 Thread Luis Morales (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452514#comment-17452514 ] Luis Morales edited comment on ARROW-14930 at 12/2/21, 5:13 PM: Tried

[jira] [Updated] (ARROW-14853) [C++][Python] Cryptic error message when required compute options missing

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14853: --- Fix Version/s: 7.0.0 > [C++][Python] Cryptic error message when required compute options

[jira] [Commented] (ARROW-14930) [Python] FileNotFound when using bucket+folders in S3 + partitioned parquet

2021-12-02 Thread Luis Morales (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452516#comment-17452516 ] Luis Morales commented on ARROW-14930: -- (they were four tests sorry, not three) :) > [Python]

[jira] [Commented] (ARROW-14930) [Python] FileNotFound when using bucket+folders in S3 + partitioned parquet

2021-12-02 Thread Luis Morales (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452514#comment-17452514 ] Luis Morales commented on ARROW-14930: -- Tried three things:   (the bucket as a FileSelector

[jira] [Commented] (ARROW-12629) [C++] Configurable read-ahead in CSV and JSON readers

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452498#comment-17452498 ] Antoine Pitrou commented on ARROW-12629: {{use_readahead = true}} would sound good to me. >

[jira] [Commented] (ARROW-12629) [C++] Configurable read-ahead in CSV and JSON readers

2021-12-02 Thread Supun Kamburugamuva (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452497#comment-17452497 ] Supun Kamburugamuva commented on ARROW-12629: - What would be a good option name for this? 

[jira] [Commented] (ARROW-14946) [C++][Python] An operator for finding indices of a value

2021-12-02 Thread Niranda Perera (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452496#comment-17452496 ] Niranda Perera commented on ARROW-14946: [~jorisvandenbossche] actually my initial use case was

[jira] [Assigned] (ARROW-14970) [C++][Compute] Replace ExecNode::InputReceived with ::MakeTask (Part 2)

2021-12-02 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-14970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Percy Camilo Triveño Aucahuasi reassigned ARROW-14970: -- Assignee: Percy Camilo Triveño Aucahuasi >

[jira] [Created] (ARROW-14970) [C++][Compute] Replace ExecNode::InputReceived with ::MakeTask (Part 2)

2021-12-02 Thread Jira
Percy Camilo Triveño Aucahuasi created ARROW-14970: -- Summary: [C++][Compute] Replace ExecNode::InputReceived with ::MakeTask (Part 2) Key: ARROW-14970 URL:

[jira] [Resolved] (ARROW-14966) [R][CI] Add redundancy to CRAN mirrors for dependency installation

2021-12-02 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane resolved ARROW-14966. Resolution: Fixed Issue resolved by pull request 11839

[jira] [Assigned] (ARROW-14969) [C++][Python] Un-deprecate FileSystem::OpenAppendStream

2021-12-02 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina reassigned ARROW-14969: - Assignee: Akhil > [C++][Python] Un-deprecate FileSystem::OpenAppendStream >

[jira] [Assigned] (ARROW-14904) [C++] Enable CSV Writer to append / overwrite existing file

2021-12-02 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina reassigned ARROW-14904: - Assignee: Alessandro Molina > [C++] Enable CSV Writer to append / overwrite

[jira] [Assigned] (ARROW-14904) [C++] Enable CSV Writer to append / overwrite existing file

2021-12-02 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina reassigned ARROW-14904: - Assignee: (was: Alessandro Molina) > [C++] Enable CSV Writer to append /

[jira] [Assigned] (ARROW-14242) [Python] Array.to_string exposes confusing "indent" parameter

2021-12-02 Thread Marlene (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marlene reassigned ARROW-14242: --- Assignee: Marlene > [Python] Array.to_string exposes confusing "indent" parameter >

[jira] [Updated] (ARROW-14969) [C++][Python] Un-deprecate FileSystem::OpenAppendStream

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14969: --- Labels: good-first-issue (was: ) > [C++][Python] Un-deprecate FileSystem::OpenAppendStream

[jira] [Created] (ARROW-14969) [C++][Python] Un-deprecate FileSystem::OpenAppendStream

2021-12-02 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-14969: -- Summary: [C++][Python] Un-deprecate FileSystem::OpenAppendStream Key: ARROW-14969 URL: https://issues.apache.org/jira/browse/ARROW-14969 Project: Apache Arrow

[jira] [Updated] (ARROW-14904) [C++] Enable CSV Writer to append / overwrite existing file

2021-12-02 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-14904: -- Labels: good-first-issue (was: ) > [C++] Enable CSV Writer to append / overwrite

[jira] [Commented] (ARROW-14930) [Python] FileNotFound when using bucket+folders in S3 + partitioned parquet

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452453#comment-17452453 ] Joris Van den Bossche commented on ARROW-14930: --- A few questions to help diagnose the

[jira] [Updated] (ARROW-14930) [Python] FileNotFound when using bucket+folders in S3 + partitioned parquet

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-14930: -- Summary: [Python] FileNotFound when using bucket+folders in S3 + partitioned

[jira] [Updated] (ARROW-14959) [Python] Reading Hive-style partitioned parquet files from GCS

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-14959: -- Summary: [Python] Reading Hive-style partitioned parquet files from GCS

[jira] [Updated] (ARROW-14959) Reading Hive-style partitioned parquet files from GCS

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-14959: -- Component/s: Python > Reading Hive-style partitioned parquet files from GCS >

[jira] [Commented] (ARROW-14959) Reading Hive-style partitioned parquet files from GCS

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452448#comment-17452448 ] Joris Van den Bossche commented on ARROW-14959: --- Just to confirm, if you read a specific

[jira] [Resolved] (ARROW-14476) [CI] Crossbow should comment cause of failure

2021-12-02 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-14476. - Resolution: Fixed Issue resolved by pull request 11571

[jira] [Assigned] (ARROW-14968) [Python] Pin numpy build dependency using oldest-supported-numpy

2021-12-02 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-14968: --- Assignee: Krisztian Szucs > [Python] Pin numpy build dependency using

[jira] [Updated] (ARROW-14968) [Python] Pin numpy build dependency using oldest-supported-numpy

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14968: --- Labels: pull-request-available (was: ) > [Python] Pin numpy build dependency using

[jira] [Created] (ARROW-14968) [Python] Pin numpy build dependency using oldest-supported-numpy

2021-12-02 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14968: --- Summary: [Python] Pin numpy build dependency using oldest-supported-numpy Key: ARROW-14968 URL: https://issues.apache.org/jira/browse/ARROW-14968 Project:

[jira] [Commented] (ARROW-14946) [C++][Python] An operator for finding indices of a value

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452435#comment-17452435 ] Joris Van den Bossche commented on ARROW-14946: --- This is also related to numpy's

[jira] [Updated] (ARROW-13848) [C++] and() in a dataset filter

2021-12-02 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13848: --- Labels: beginner good-first-issue performance (was: beginner good-first-issue) > [C++]

[jira] [Commented] (ARROW-14904) [C++] Enable CSV Writer to append / overwrite existing file

2021-12-02 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452419#comment-17452419 ] Neal Richardson commented on ARROW-14904: - I think we just identified a use :) > [C++] Enable

[jira] [Created] (ARROW-14967) [CI][Python] Ability to include pip packages in the conda environments

2021-12-02 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-14967: - Summary: [CI][Python] Ability to include pip packages in the conda environments Key: ARROW-14967 URL: https://issues.apache.org/jira/browse/ARROW-14967

[jira] [Updated] (ARROW-14966) [R][CI] Add redundancy to CRAN mirrors for dependency installation

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14966: --- Labels: pull-request-available (was: ) > [R][CI] Add redundancy to CRAN mirrors for

[jira] [Created] (ARROW-14966) [R][CI] Add redundancy to CRAN mirrors for dependency installation

2021-12-02 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-14966: --- Summary: [R][CI] Add redundancy to CRAN mirrors for dependency installation Key: ARROW-14966 URL: https://issues.apache.org/jira/browse/ARROW-14966 Project:

[jira] [Updated] (ARROW-14956) [R] Implement lubridate's int_standardize

2021-12-02 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-14956: Summary: [R] Implement lubridate's int_standardize (was: [R] Implement lubirdate's

[jira] [Updated] (ARROW-14779) [C++] Add other common round mode names to RoundMode docs

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14779: --- Labels: beginner good-first-issue kernel pull-request-available (was: beginner

[jira] [Commented] (ARROW-14904) [C++] Enable CSV Writer to append / overwrite existing file

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452417#comment-17452417 ] Antoine Pitrou commented on ARROW-14904: bq. Apologies if I'm reopening a debate that's been

[jira] [Commented] (ARROW-14904) [C++] Enable CSV Writer to append / overwrite existing file

2021-12-02 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452411#comment-17452411 ] Neal Richardson commented on ARROW-14904: - Apologies if I'm reopening a debate that's been

[jira] [Updated] (ARROW-14760) [Doc] Steps in making your first PR - PR life cycle

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14760: --- Labels: pull-request-available (was: ) > [Doc] Steps in making your first PR - PR life

[jira] [Updated] (ARROW-14965) Contention when reading Parquet files with multi-threading

2021-12-02 Thread Nick Gates (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14965?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Gates updated ARROW-14965: --- Description: I'm attempting to read a table from multiple Parquet files where I already know which

[jira] [Resolved] (ARROW-3699) [C++] Dockerfile for testing 32-bit C++ build

2021-12-02 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-3699. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 10865

[jira] [Assigned] (ARROW-14505) [CI][Docs] Exercise documentation builds on the main branch

2021-12-02 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-14505: --- Assignee: Krisztian Szucs > [CI][Docs] Exercise documentation builds on the main

[jira] [Resolved] (ARROW-14505) [CI][Docs] Exercise documentation builds on the main branch

2021-12-02 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-14505. - Resolution: Fixed Issue resolved by pull request 11567

[jira] [Updated] (ARROW-14964) [C++] Restructure internal compute/kernel utilities

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14964: --- Description: There is a diversity of utilities for writing kernels in

[jira] [Updated] (ARROW-14964) [C++] Restructure internal compute/kernel utilities

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14964: --- Labels: good-second-issue (was: ) > [C++] Restructure internal compute/kernel utilities >

[jira] [Created] (ARROW-14964) [C++] Restructure internal compute/kernel utilities

2021-12-02 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-14964: -- Summary: [C++] Restructure internal compute/kernel utilities Key: ARROW-14964 URL: https://issues.apache.org/jira/browse/ARROW-14964 Project: Apache Arrow

[jira] [Updated] (ARROW-14903) [C++] Enable CSV Writer to control string to be used for missing data

2021-12-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14903: --- Labels: good-first-issue pull-request-available (was: good-first-issue) > [C++] Enable CSV

[jira] [Commented] (ARROW-14963) [Doc] Add copy button extension to code-blocks

2021-12-02 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452363#comment-17452363 ] Alenka Frim commented on ARROW-14963: - Thanks Joris for the update! > [Doc] Add copy button

[jira] [Updated] (ARROW-14963) [Doc] Add copy button extension to code-blocks

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-14963: -- Summary: [Doc] Add copy button extension to code-blocks (was: [Doc] Add

[jira] [Comment Edited] (ARROW-14963) [Doc] Add extensions to code-blocks

2021-12-02 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452351#comment-17452351 ] Joris Van den Bossche edited comment on ARROW-14963 at 12/2/21, 11:37 AM:

[jira] [Resolved] (ARROW-14914) [C++] Implement GcsFileSystem::DeleteRootDirContents

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-14914. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11829

[jira] [Updated] (ARROW-14961) [C++] Bump version on Google Benchmark

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14961: --- Issue Type: Task (was: Bug) > [C++] Bump version on Google Benchmark >

[jira] [Updated] (ARROW-14961) [C++] Bump version on Google Benchmark

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14961: --- Summary: [C++] Bump version on Google Benchmark (was: Bump version on Google Benchmark )

[jira] [Updated] (ARROW-14961) [C++] Bump version on Google Benchmark

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14961: --- Priority: Minor (was: Trivial) > [C++] Bump version on Google Benchmark >

[jira] [Updated] (ARROW-14961) [C++] Bump version on Google Benchmark

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14961: --- Component/s: C++ > [C++] Bump version on Google Benchmark >

[jira] [Commented] (ARROW-14961) Bump version on Google Benchmark

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17452338#comment-17452338 ] Antoine Pitrou commented on ARROW-14961: Which functions are those? > Bump version on Google

[jira] [Resolved] (ARROW-13536) [C++] Use decimal point-aware conversion from fast_float

2021-12-02 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-13536. Resolution: Fixed Issue resolved by pull request 11817

  1   2   >