[jira] [Resolved] (ARROW-13511) [CI][R] Fail in the docker build step if R deps don't install

2021-08-06 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane resolved ARROW-13511. Resolution: Fixed Issue resolved by pull request 10841 [https://github.com/apache/arrow/pu

[jira] [Commented] (ARROW-12873) [C++][Compute] Support tagging ExecBatches with arbitrary extra information

2021-08-06 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17395006#comment-17395006 ] Weston Pace commented on ARROW-12873: - I thought the original proposal was tagging r

[jira] [Commented] (ARROW-12873) [C++][Compute] Support tagging ExecBatches with arbitrary extra information

2021-08-06 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17395002#comment-17395002 ] David Li commented on ARROW-12873: -- I think that's just a different way of encoding wha

[jira] [Commented] (ARROW-12873) [C++][Compute] Support tagging ExecBatches with arbitrary extra information

2021-08-06 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17395001#comment-17395001 ] Weston Pace commented on ARROW-12873: - With the caveat that I haven't been following

[jira] [Updated] (ARROW-13345) [C++] Implement logN compute function

2021-08-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13345: --- Labels: pull-request-available (was: ) > [C++] Implement logN compute function > --

[jira] [Updated] (ARROW-13580) [C++] quoted_strings_can_be_null only applied to string columns

2021-08-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13580: --- Labels: pull-request-available (was: ) > [C++] quoted_strings_can_be_null only applied to s

[jira] [Comment Edited] (ARROW-12873) [C++][Compute] Support tagging ExecBatches with arbitrary extra information

2021-08-06 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394991#comment-17394991 ] David Li edited comment on ARROW-12873 at 8/6/21, 9:20 PM: --- So

[jira] [Commented] (ARROW-12873) [C++][Compute] Support tagging ExecBatches with arbitrary extra information

2021-08-06 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394991#comment-17394991 ] David Li commented on ARROW-12873: -- So I tried implementing an arg_min_max node as part

[jira] [Created] (ARROW-13581) pyarrow array equals return False if there's nan

2021-08-06 Thread David Zhang (Jira)
David Zhang created ARROW-13581: --- Summary: pyarrow array equals return False if there's nan Key: ARROW-13581 URL: https://issues.apache.org/jira/browse/ARROW-13581 Project: Apache Arrow Issue

[jira] [Assigned] (ARROW-13580) [C++] quoted_strings_can_be_null only applied to string columns

2021-08-06 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-13580: --- Assignee: Weston Pace > [C++] quoted_strings_can_be_null only applied to string columns > -

[jira] [Created] (ARROW-13580) [C++] quoted_strings_can_be_null only applied to string columns

2021-08-06 Thread Weston Pace (Jira)
Weston Pace created ARROW-13580: --- Summary: [C++] quoted_strings_can_be_null only applied to string columns Key: ARROW-13580 URL: https://issues.apache.org/jira/browse/ARROW-13580 Project: Apache Arrow

[jira] [Closed] (ARROW-13567) [C++] ConvertOptions::Defaults leaves `timestamp_parsers` uninitialized

2021-08-06 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-13567. --- Resolution: Not A Bug > [C++] ConvertOptions::Defaults leaves `timestamp_parsers` uninitialized > --

[jira] [Commented] (ARROW-13567) [C++] ConvertOptions::Defaults leaves `timestamp_parsers` uninitialized

2021-08-06 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394969#comment-17394969 ] Weston Pace commented on ARROW-13567: - I ended up getting to the bottom of this. My

[jira] [Updated] (ARROW-12959) [C++][R] Option for is_null(NaN) to evaluate to true

2021-08-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-12959: --- Labels: pull-request-available (was: ) > [C++][R] Option for is_null(NaN) to evaluate to tr

[jira] [Assigned] (ARROW-13540) [C++][Compute] Add OrderByNode for ordering of rows in an ExecPlan

2021-08-06 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-13540: --- Assignee: Neal Richardson (was: David Li) > [C++][Compute] Add OrderByNode for ord

[jira] [Assigned] (ARROW-13540) [C++][Compute] Add OrderByNode for ordering of rows in an ExecPlan

2021-08-06 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-13540: --- Assignee: David Li (was: Neal Richardson) > [C++][Compute] Add OrderByNode for ord

[jira] [Updated] (ARROW-13574) [C++] Implement "count(*)" hash aggregate kernel

2021-08-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13574: --- Labels: pull-request-available (was: ) > [C++] Implement "count(*)" hash aggregate kernel >

[jira] [Assigned] (ARROW-7051) [C++] Improve MakeArrayOfNull to support creation of multiple arrays

2021-08-06 Thread Alexander (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander reassigned ARROW-7051: Assignee: Alexander > [C++] Improve MakeArrayOfNull to support creation of multiple arrays > -

[jira] [Updated] (ARROW-13578) Inconsistent handling of integer-valued partitions in dataset filters API

2021-08-06 Thread Matt Nizol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Nizol updated ARROW-13578: --- Description: When creating a partitioned data set via the pandas.to_parquet() method, partition col

[jira] [Created] (ARROW-13578) Inconsistent handling of integer-valued partitions in dataset filters API

2021-08-06 Thread Matt Nizol (Jira)
Matt Nizol created ARROW-13578: -- Summary: Inconsistent handling of integer-valued partitions in dataset filters API Key: ARROW-13578 URL: https://issues.apache.org/jira/browse/ARROW-13578 Project: Apache

[jira] [Updated] (ARROW-13579) Expose Create EmptyArray, EmptyRecordBatch and EmptyTable utility functions.

2021-08-06 Thread Alexander (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander updated ARROW-13579: -- Description:   Expose Create EmptyArray, EmptyRecordBatch and EmptyTable utility functions.   [http

[jira] [Created] (ARROW-13579) Expose Create EmptyArray, EmptyRecordBatch and EmptyTable utility functions.

2021-08-06 Thread Alexander (Jira)
Alexander created ARROW-13579: - Summary: Expose Create EmptyArray, EmptyRecordBatch and EmptyTable utility functions. Key: ARROW-13579 URL: https://issues.apache.org/jira/browse/ARROW-13579 Project: Apac

[jira] [Updated] (ARROW-13578) Inconsistent handling of integer-valued partitions in dataset filters API

2021-08-06 Thread Matt Nizol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Nizol updated ARROW-13578: --- Description: When creating a partitioned data set via the pandas.to_parquet() method, partition col

[jira] [Commented] (ARROW-13474) [C++][Python] PyArrow crash when filter/take empty Extension array

2021-08-06 Thread Paul Balanca (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17394771#comment-17394771 ] Paul Balanca commented on ARROW-13474: -- Thanks [~jorisvandenbossche] I don't want t

[jira] [Updated] (ARROW-13577) [Python][FlightRPC] pyarrow client do_put close method after write_table did not throw flight error

2021-08-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13577: --- Labels: pull-request-available (was: ) > [Python][FlightRPC] pyarrow client do_put close me

[jira] [Created] (ARROW-13577) [Python][FlightRPC] pyarrow client do_put close method after write_table did not throw flight error

2021-08-06 Thread lixiang li (Jira)
lixiang li created ARROW-13577: -- Summary: [Python][FlightRPC] pyarrow client do_put close method after write_table did not throw flight error Key: ARROW-13577 URL: https://issues.apache.org/jira/browse/ARROW-13577

[jira] [Updated] (ARROW-13268) [C++][Compute] Add ExecNode for semi and anti-semi join

2021-08-06 Thread Michal Nowakiewicz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michal Nowakiewicz updated ARROW-13268: --- Labels: pull-request-available query-engine (was: pull-request-available) > [C++][C

[jira] [Updated] (ARROW-13540) [C++][Compute] Add OrderByNode for ordering of rows in an ExecPlan

2021-08-06 Thread Michal Nowakiewicz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michal Nowakiewicz updated ARROW-13540: --- Fix Version/s: 6.0.0 Labels: pull-request-available query-engine (was: p

[jira] [Updated] (ARROW-13532) [C++][Compute] Join: add set membership test method to the grouper

2021-08-06 Thread Michal Nowakiewicz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michal Nowakiewicz updated ARROW-13532: --- Labels: pull-request-available query-engine (was: pull-request-available) > [C++][C

[jira] [Updated] (ARROW-12727) [C++][Compute] GroupBy: support more than 2^32 groups

2021-08-06 Thread Michal Nowakiewicz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michal Nowakiewicz updated ARROW-12727: --- Labels: query-engine (was: ) > [C++][Compute] GroupBy: support more than 2^32 group

[jira] [Updated] (ARROW-1565) [C++][Compute] Implement TopK/BottomK streaming execution nodes

2021-08-06 Thread Michal Nowakiewicz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michal Nowakiewicz updated ARROW-1565: -- Fix Version/s: 6.0.0 > [C++][Compute] Implement TopK/BottomK streaming execution nodes