[jira] [Updated] (ARROW-15053) [Python] Attribute nbytes of slice will return the value corresponding to the whole structure

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-15053: -- Summary: [Python] Attribute nbytes of slice will return the value correspondin

[jira] [Updated] (ARROW-15053) [Python] Attribute nbytes of slice will return the value corresponding to the whole structure

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-15053: -- Fix Version/s: 7.0.0 > [Python] Attribute nbytes of slice will return the valu

[jira] [Commented] (ARROW-15053) Attribute nbytes of slice will return the value corresponding to the whole structure

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458226#comment-17458226 ] Joris Van den Bossche commented on ARROW-15053: --- I think we should conside

[jira] [Updated] (ARROW-15050) [Python] pyarrow.scalar doesn't accept nested pyarrow values

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-15050: -- Summary: [Python] pyarrow.scalar doesn't accept nested pyarrow values (was: p

[jira] [Commented] (ARROW-15050) [Python] pyarrow.scalar doesn't accept nested pyarrow values

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458230#comment-17458230 ] Joris Van den Bossche commented on ARROW-15050: --- [~vladfi] thanks for the

[jira] [Updated] (ARROW-5295) [Python] accept pyarrow values / scalars in constructor functions ?

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5295: - Labels: python-conversion (was: ) > [Python] accept pyarrow values / scalars in

[jira] [Updated] (ARROW-15050) [Python] pyarrow.scalar doesn't accept nested pyarrow values

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-15050: -- Labels: python-conversion (was: ) > [Python] pyarrow.scalar doesn't accept ne

[jira] [Commented] (ARROW-15045) PyArrow SIGSEGV error when using UnionDatasets

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458240#comment-17458240 ] Joris Van den Bossche commented on ARROW-15045: --- bq. At the time I am wri

[jira] [Updated] (ARROW-15045) PyArrow SIGSEGV error when using UnionDatasets

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-15045: -- Labels: dataset (was: ) > PyArrow SIGSEGV error when using UnionDatasets > --

[jira] [Assigned] (ARROW-14762) [Doc] Additional info and resources

2021-12-13 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alenka Frim reassigned ARROW-14762: --- Assignee: Alenka Frim > [Doc] Additional info and resources > -

[jira] [Resolved] (ARROW-15036) [C++] Automatically configure S3 SDK configuration parameter "maxConnections"

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-15036. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11929 [https:

[jira] [Assigned] (ARROW-15061) [C++] Add logging for kernel functions and exec plan nodes

2021-12-13 Thread Matthijs Brobbel (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthijs Brobbel reassigned ARROW-15061: Assignee: Matthijs Brobbel > [C++] Add logging for kernel functions and exec plan

[jira] [Commented] (ARROW-15073) [C++][Parquet][Python] LZ4- and zstd- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458247#comment-17458247 ] Antoine Pitrou commented on ARROW-15073: The Spark error message is weird. We do

[jira] [Commented] (ARROW-15074) [C++] Support multiple frames in LZ4?

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458249#comment-17458249 ] Antoine Pitrou commented on ARROW-15074: Hmm, I think we should make the spec mo

[jira] [Updated] (ARROW-15074) [C++] Support multiple frames in LZ4?

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-15074: --- Component/s: C++ > [C++] Support multiple frames in LZ4? > -

[jira] [Commented] (ARROW-14930) [C++][Python] FileNotFound with Scality accessed through S3 APIs

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458257#comment-17458257 ] Antoine Pitrou commented on ARROW-14930: I tried reproducing by creating a singl

[jira] [Comment Edited] (ARROW-15073) [C++][Parquet][Python] LZ4- and zstd- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458302#comment-17458302 ] Micah Kornfield edited comment on ARROW-15073 at 12/13/21, 10:56 AM: -

[jira] [Commented] (ARROW-15073) [C++][Parquet][Python] LZ4- and zstd- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458302#comment-17458302 ] Micah Kornfield commented on ARROW-15073: - If LZ4 gets translated to LZ4_RAW dep

[jira] [Created] (ARROW-15076) [C++][Gandiva][CI] Test failure/crash on fedora-cpp

2021-12-13 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-15076: -- Summary: [C++][Gandiva][CI] Test failure/crash on fedora-cpp Key: ARROW-15076 URL: https://issues.apache.org/jira/browse/ARROW-15076 Project: Apache Arrow

[jira] [Commented] (ARROW-14431) [C++][Gandiva] Implement AES ENCRYPT and AES DECRYPT functions

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458319#comment-17458319 ] Antoine Pitrou commented on ARROW-14431: [~pravindra] Next time, can you assign

[jira] [Commented] (ARROW-15076) [C++][Gandiva][CI] Test failure/crash on fedora-cpp

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458320#comment-17458320 ] Antoine Pitrou commented on ARROW-15076: cc [~pravindra] [~augustoasilva] Can on

[jira] [Assigned] (ARROW-14431) [C++][Gandiva] Implement AES ENCRYPT and AES DECRYPT functions

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-14431: -- Assignee: Augusto Alves Silva > [C++][Gandiva] Implement AES ENCRYPT and AES DECRYPT

[jira] [Commented] (ARROW-13237) [C++] S3 FileSystem doesn't seem to handle redirects

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458330#comment-17458330 ] Antoine Pitrou commented on ARROW-13237: Ok, it seems you can use (on recent AWS

[jira] [Updated] (ARROW-12535) [C++] Enable metadata writing in the ORCWriter

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-12535: --- Summary: [C++] Enable metadata writing in the ORCWriter (was: Enable metadata writing in th

[jira] [Updated] (ARROW-602) [C++] Provide iterator access to primitive elements inside a Column/ChunkedArray

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-602: - Labels: beginner good-first-issue newbie (was: beginner newbie) > [C++] Provide iterator access t

[jira] [Commented] (ARROW-602) [C++] Provide iterator access to primitive elements inside a Column/ChunkedArray

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-602?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458338#comment-17458338 ] Antoine Pitrou commented on ARROW-602: -- There will be an API and code organization as

[jira] [Commented] (ARROW-15045) PyArrow SIGSEGV error when using UnionDatasets

2021-12-13 Thread Thomas Cercato (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458340#comment-17458340 ] Thomas Cercato commented on ARROW-15045: This is an example tree of my data fold

[jira] [Commented] (ARROW-1888) [C++] Implement casts from one struct type to another (with same field names and number of fields)

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458346#comment-17458346 ] Antoine Pitrou commented on ARROW-1888: --- [~diegodfrf] Are you still planning to wor

[jira] [Updated] (ARROW-4729) [C++] Improve buffer symbolic index

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-4729: -- Fix Version/s: (was: 7.0.0) > [C++] Improve buffer symbolic index > ---

[jira] [Updated] (ARROW-4729) [C++] Improve buffer symbolic index

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-4729: -- Fix Version/s: 8.0.0 > [C++] Improve buffer symbolic index > --

[jira] [Assigned] (ARROW-5338) [Format][Integration] Define how to test for delta dictionary support in the JSON integration test data format

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-5338: - Assignee: (was: Antoine Pitrou) > [Format][Integration] Define how to test for delta

[jira] [Updated] (ARROW-5338) [Format][Integration] Define how to test for delta dictionary support in the JSON integration test data format

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-5338: -- Fix Version/s: 8.0.0 (was: 7.0.0) > [Format][Integration] Define how to

[jira] [Updated] (ARROW-10222) [C++] Add FileSystem::MakeUri() to serialize file locations to URIs

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-10222: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Add FileSystem::MakeUri() t

[jira] [Updated] (ARROW-6033) [C++] Provide an initialization and/or compatibility check function

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6033: -- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Provide an initialization and/

[jira] [Updated] (ARROW-11003) [C++][Dataset] Schema evolution in Dataset scanning

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-11003: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Dataset] Schema evolution i

[jira] [Updated] (ARROW-11419) [C++] DICTIONARY_REPLACEMENT neither read nor written

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-11419: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] DICTIONARY_REPLACEMENT neit

[jira] [Updated] (ARROW-11465) [C++] Parquet file writer snapshot API and proper ColumnChunk.file_path utilization

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-11465: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Parquet file writer snapsho

[jira] [Updated] (ARROW-11749) [C++][Dataset] Support projections between children of UnionDatasets

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-11749: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Dataset] Support projection

[jira] [Updated] (ARROW-12046) [C++] Support string collation for sorting

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-12046: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Support string collation fo

[jira] [Commented] (ARROW-12203) [C++][Python] Switch default Parquet version to 2.4

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458349#comment-17458349 ] Antoine Pitrou commented on ARROW-12203: [~emkornfield]  Did you have the opport

[jira] [Updated] (ARROW-12535) [C++] Enable metadata writing in the ORCWriter

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-12535: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Enable metadata writing in

[jira] [Resolved] (ARROW-8340) [Documentation] Sphinx documentation does not build with just-released Sphinx 3.0.0

2021-12-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-8340. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11878 [https:/

[jira] [Updated] (ARROW-15049) [R] arrowExports.cpp generation changed with glue package 1.5.1

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15049: --- Labels: pull-request-available (was: ) > [R] arrowExports.cpp generation changed with glue

[jira] [Commented] (ARROW-15045) PyArrow SIGSEGV error when using UnionDatasets

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458366#comment-17458366 ] Joris Van den Bossche commented on ARROW-15045: --- Thanks for that clarifica

[jira] [Comment Edited] (ARROW-15045) PyArrow SIGSEGV error when using UnionDatasets

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458366#comment-17458366 ] Joris Van den Bossche edited comment on ARROW-15045 at 12/13/21, 12:54 PM: ---

[jira] [Commented] (ARROW-15049) [R] arrowExports.cpp generation changed with glue package 1.5.1

2021-12-13 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458367#comment-17458367 ] Dewey Dunnington commented on ARROW-15049: -- Done! https://github.com/tidyverse/

[jira] [Updated] (ARROW-15075) [C++][Dataset] Implement Dataset for JSON format

2021-12-13 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-15075: - Labels: dataset (was: ) > [C++][Dataset] Implement Dataset for JSON format > --

[jira] [Commented] (ARROW-15045) PyArrow SIGSEGV error when using UnionDatasets

2021-12-13 Thread Thomas Cercato (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458380#comment-17458380 ] Thomas Cercato commented on ARROW-15045: Currently the workstation is running a

[jira] [Updated] (ARROW-1569) [C++] Kernel functions for determining monotonicity (ascending or descending) for well-ordered types

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1569: -- Labels: Analytics pull-request-available (was: Analytics) > [C++] Kernel functions for determi

[jira] [Resolved] (ARROW-14625) [Python][CI] Enable CI for Python on s390x

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-14625. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11688 [https:

[jira] [Resolved] (ARROW-13756) [Python] Error in pandas conversion for datetimetz column index

2021-12-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-13756. --- Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request

[jira] [Created] (ARROW-15077) [Python] Move Expression class from _dataset to _compute cython module

2021-12-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-15077: - Summary: [Python] Move Expression class from _dataset to _compute cython module Key: ARROW-15077 URL: https://issues.apache.org/jira/browse/ARROW-15077

[jira] [Updated] (ARROW-15077) [Python] Move Expression class from _dataset to _compute cython module

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15077: --- Labels: pull-request-available (was: ) > [Python] Move Expression class from _dataset to _c

[jira] [Closed] (ARROW-14689) [R] Tutorial on using tidyverse and base R functions in Arrow

2021-12-13 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina closed ARROW-14689. - Resolution: Done > [R] Tutorial on using tidyverse and base R functions in Arrow > -

[jira] [Created] (ARROW-15078) [C++] Silence CMake error "includes non-existent path" with bundled OpenTelemetry

2021-12-13 Thread David Li (Jira)
David Li created ARROW-15078: Summary: [C++] Silence CMake error "includes non-existent path" with bundled OpenTelemetry Key: ARROW-15078 URL: https://issues.apache.org/jira/browse/ARROW-15078 Project: Ap

[jira] [Assigned] (ARROW-15078) [C++] Silence CMake error "includes non-existent path" with bundled OpenTelemetry

2021-12-13 Thread Matthijs Brobbel (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthijs Brobbel reassigned ARROW-15078: Assignee: Matthijs Brobbel > [C++] Silence CMake error "includes non-existent pat

[jira] [Resolved] (ARROW-14737) [C++][Dataset] Support URI-decoding partition keys

2021-12-13 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-14737. -- Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11858 [https://github.com

[jira] [Updated] (ARROW-15078) [C++] Silence CMake error "includes non-existent path" with bundled OpenTelemetry

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15078: --- Labels: pull-request-available (was: ) > [C++] Silence CMake error "includes non-existent p

[jira] [Assigned] (ARROW-12768) [C++] Add sign bit checks to floating-point arithmetic kernels tests

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-12768: -- Assignee: Antoine Pitrou > [C++] Add sign bit checks to floating-point arithmetic ker

[jira] [Commented] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets

2021-12-13 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458503#comment-17458503 ] Will Jones commented on ARROW-15072: When you install the arrow R package on linux,

[jira] [Commented] (ARROW-15072) [R] Error: This build of the arrow package does not support Datasets

2021-12-13 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458505#comment-17458505 ] Jonathan Keane commented on ARROW-15072: Seconding [~willjones127]'s suggestions

[jira] [Updated] (ARROW-15073) [C++][Parquet][Python] LZ4- and compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão updated ARROW-15073: - Summary: [C++][Parquet][Python] LZ4- and compressed parquet files are unreadable by (py)spark (

[jira] [Commented] (ARROW-15073) [C++][Parquet][Python] LZ4- and zstd- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458523#comment-17458523 ] Jorge Leitão commented on ARROW-15073: -- the ZSTD I tested it myself as I wanted to

[jira] [Updated] (ARROW-15073) [C++][Parquet][Python] LZ4- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão updated ARROW-15073: - Summary: [C++][Parquet][Python] LZ4- compressed parquet files are unreadable by (py)spark (was:

[jira] [Commented] (ARROW-15074) [C++] Support multiple frames in LZ4?

2021-12-13 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458525#comment-17458525 ] Jorge Leitão commented on ARROW-15074: -- I do find a bit odd to restrict which LZ4 w

[jira] [Updated] (ARROW-12768) [C++] Add sign bit checks to floating-point arithmetic kernels tests

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-12768: --- Labels: pull-request-available (was: ) > [C++] Add sign bit checks to floating-point arithm

[jira] [Commented] (ARROW-15073) [C++][Parquet][Python] LZ4- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458531#comment-17458531 ] Antoine Pitrou commented on ARROW-15073: Hmm, so is there anything to do on the

[jira] [Commented] (ARROW-15074) [C++] Support multiple frames in LZ4?

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458532#comment-17458532 ] Antoine Pitrou commented on ARROW-15074: Well, it doesn't seem to be useful on t

[jira] [Commented] (ARROW-15073) [C++][Parquet][Python] LZ4- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458534#comment-17458534 ] Jorge Leitão commented on ARROW-15073: -- No, closed. Thank you for your input! > [

[jira] [Closed] (ARROW-15073) [C++][Parquet][Python] LZ4- compressed parquet files are unreadable by (py)spark

2021-12-13 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão closed ARROW-15073. Resolution: Not A Problem > [C++][Parquet][Python] LZ4- compressed parquet files are unreadable by

[jira] [Resolved] (ARROW-15056) [C++] Speed up GcsFileSystem tests

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-15056. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11933 [https:

[jira] [Created] (ARROW-15079) [C++] Add scheduler to constrain memory of exec plans

2021-12-13 Thread Weston Pace (Jira)
Weston Pace created ARROW-15079: --- Summary: [C++] Add scheduler to constrain memory of exec plans Key: ARROW-15079 URL: https://issues.apache.org/jira/browse/ARROW-15079 Project: Apache Arrow Is

[jira] [Updated] (ARROW-15074) [C++] Support multiple frames in LZ4?

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15074: --- Labels: pull-request-available (was: ) > [C++] Support multiple frames in LZ4? > --

[jira] [Updated] (ARROW-15079) [C++] Add scheduler to constrain memory of exec plans

2021-12-13 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-15079: Description: This is a high-level JIRA that will probably consist of a number of subtasks. Curre

[jira] [Comment Edited] (ARROW-13663) [C++] RecordBatchReader should support STL-like iteration

2021-12-13 Thread Dhruv Vats (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17456692#comment-17456692 ] Dhruv Vats edited comment on ARROW-13663 at 12/13/21, 5:11 PM: ---

[jira] [Assigned] (ARROW-15074) [C++] Support multiple frames in LZ4?

2021-12-13 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-15074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão reassigned ARROW-15074: Assignee: Jorge Leitão > [C++] Support multiple frames in LZ4? >

[jira] [Commented] (ARROW-12960) [C++][R] Option for is_nan(null) to evaluate to false or true

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458559#comment-17458559 ] Antoine Pitrou commented on ARROW-12960: What is the status on this? > [C++][R]

[jira] [Resolved] (ARROW-15078) [C++] Silence CMake error "includes non-existent path" with bundled OpenTelemetry

2021-12-13 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-15078. -- Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11939 [https://github.com

[jira] [Commented] (ARROW-13663) [C++] RecordBatchReader should support STL-like iteration

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458574#comment-17458574 ] Antoine Pitrou commented on ARROW-13663: Given the typical usage of STL iterator

[jira] [Updated] (ARROW-13969) [C++][Compute] Implement SelectKStable

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-13969: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Compute] Implement SelectKS

[jira] [Updated] (ARROW-13970) [C++][Compute] Implement streaming version for SelectK

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-13970: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Compute] Implement streamin

[jira] [Updated] (ARROW-14126) [C++] Add locale support for relevant string compute functions

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14126: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Add locale support for rele

[jira] [Created] (ARROW-15080) [Python] Allow creation of month_day_nano interval from tuple

2021-12-13 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-15080: -- Summary: [Python] Allow creation of month_day_nano interval from tuple Key: ARROW-15080 URL: https://issues.apache.org/jira/browse/ARROW-15080 Project: Apache Arr

[jira] [Updated] (ARROW-15080) [Python] Allow creation of month_day_nano interval from tuple

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-15080: --- Description: This should ideally be allowed but isn't: {code:python} >>> a = pa.array([(3, 2

[jira] [Commented] (ARROW-15080) [Python] Allow creation of month_day_nano interval from tuple

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458579#comment-17458579 ] Antoine Pitrou commented on ARROW-15080: cc [~emkornfield] > [Python] Allow cre

[jira] [Updated] (ARROW-14762) [Doc] Additional info and resources

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14762: --- Labels: pull-request-available (was: ) > [Doc] Additional info and resources >

[jira] [Updated] (ARROW-15080) [Python] Allow creation of month_day_nano interval from tuple

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-15080: --- Priority: Minor (was: Major) > [Python] Allow creation of month_day_nano interval from tupl

[jira] [Assigned] (ARROW-15069) [R] open_dataset very slow on heavily partitioned parquet dataset

2021-12-13 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-15069: --- Assignee: Weston Pace > [R] open_dataset very slow on heavily partitioned parquet dataset >

[jira] [Commented] (ARROW-15069) [R] open_dataset very slow on heavily partitioned parquet dataset

2021-12-13 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458583#comment-17458583 ] Weston Pace commented on ARROW-15069: - I'm going to try and take a look at this soon

[jira] [Updated] (ARROW-14254) [C++] Return a random sample of rows from a query

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14254: --- Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Return a random sample of r

[jira] [Commented] (ARROW-14332) [C++] Rename type traits utilities to improve semantic consistency

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14332?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458584#comment-17458584 ] Antoine Pitrou commented on ARROW-14332: [~edponce] Do you want to propose a PR

[jira] [Commented] (ARROW-14130) [C++] Reuse original offsets buffer in ASCII string kernels

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458585#comment-17458585 ] Antoine Pitrou commented on ARROW-14130: Do you want to submit a PR for this [~e

[jira] [Commented] (ARROW-14075) [C++][CI] Add an appveyor CI job for VisualStudio 2019, non-conda

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458587#comment-17458587 ] Antoine Pitrou commented on ARROW-14075: There is a VS 2019 + Windows 2019 job o

[jira] [Commented] (ARROW-14133) [C++] Simplify registration of scalar arithmetic/string functions

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458588#comment-17458588 ] Antoine Pitrou commented on ARROW-14133: [~edponce] Do you want to submit a PR h

jira@arrow.apache.org

2021-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-14918: --- Labels: pull-request-available (was: ) > [C++] Implement GcsFileSystem::GetFileInfo(const F

[jira] [Commented] (ARROW-14334) [C++][Compute] Add UTF-8 non-ASCII tests to scalar string kernels

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17458593#comment-17458593 ] Antoine Pitrou commented on ARROW-14334: [~edponce] Which functions are you thin

[jira] [Updated] (ARROW-14333) [C++][Compute] Add binary and LargeStringType tests to comparison kernels

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14333: --- Labels: good-first-issue good-second-issue (was: test type) > [C++][Compute] Add binary and

[jira] [Resolved] (ARROW-15049) [R] arrowExports.cpp generation changed with glue package 1.5.1

2021-12-13 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane resolved ARROW-15049. Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11936 [https:

[jira] [Updated] (ARROW-14587) [CI][Crossbow] Fetch a single crossbow branch instead of the full repo on Azure

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-14587: --- Priority: Minor (was: Major) > [CI][Crossbow] Fetch a single crossbow branch instead of the

[jira] [Updated] (ARROW-9111) [Python] csv.read_csv progress bar

2021-12-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9111: -- Fix Version/s: 8.0.0 (was: 7.0.0) > [Python] csv.read_csv progress bar >

  1   2   >