[jira] [Assigned] (ARROW-13467) [C++] Support delta dictionaries in the IPC file format

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-13467: --- Assignee: Weston Pace > [C++] Support delta dictionaries in the IPC file format > -

[jira] [Updated] (ARROW-13467) [C++] Support delta dictionaries in the IPC file format

2022-01-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13467: --- Labels: pull-request-available (was: ) > [C++] Support delta dictionaries in the IPC file f

[jira] [Resolved] (ARROW-14577) [C++] Enable fine grained IO for async IPC reader

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-14577. - Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 11616 [https://gith

[jira] [Updated] (ARROW-15327) [R] Update news for 7.0.0

2022-01-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15327: --- Labels: pull-request-available (was: ) > [R] Update news for 7.0.0 > --

[jira] [Updated] (ARROW-15327) [R] Update news for 7.0.0

2022-01-14 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-15327: --- Summary: [R] Update news for 7.0.0 (was: [R] Update news) > [R] Update news for 7.0.0 > ---

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476475#comment-17476475 ] Weston Pace commented on ARROW-12358: - > or perhaps just specifying that DeleteDirCo

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476472#comment-17476472 ] David Li commented on ARROW-12358: -- I think we use KeyError for such things (or else we

[jira] [Comment Edited] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476470#comment-17476470 ] Weston Pace edited comment on ARROW-12358 at 1/14/22, 11:20 PM: --

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476470#comment-17476470 ] Weston Pace commented on ARROW-12358: - The "not found" error is thrown from python a

[jira] [Assigned] (ARROW-15327) [R] Update news

2022-01-14 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane reassigned ARROW-15327: -- Assignee: Jonathan Keane > [R] Update news > --- > > Key:

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476468#comment-17476468 ] David Li commented on ARROW-12358: -- Hmm, what about calling DeleteDirContents and swall

[jira] [Closed] (ARROW-15341) [Go] ipc.Reader Record Array leak

2022-01-14 Thread Christopher Wolff (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christopher Wolff closed ARROW-15341. - Resolution: Fixed > [Go] ipc.Reader Record Array leak >

[jira] [Commented] (ARROW-15341) [Go] ipc.Reader Record Array leak

2022-01-14 Thread Christopher Wolff (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476463#comment-17476463 ] Christopher Wolff commented on ARROW-15341: --- This seems to be fixed in the lat

[jira] [Created] (ARROW-15341) [Go] ipc.Reader Record Array leak

2022-01-14 Thread Christopher Wolff (Jira)
Christopher Wolff created ARROW-15341: - Summary: [Go] ipc.Reader Record Array leak Key: ARROW-15341 URL: https://issues.apache.org/jira/browse/ARROW-15341 Project: Apache Arrow Issue Type

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476451#comment-17476451 ] Weston Pace commented on ARROW-12358: - This (https://github.com/apache/arrow/compar

[jira] [Closed] (ARROW-15330) Relaxing `grpc-cpp` requirements to avoid package inconsistency

2022-01-14 Thread Prem Sagar Gali (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prem Sagar Gali closed ARROW-15330. --- Resolution: Invalid > Relaxing `grpc-cpp` requirements to avoid package inconsistency >

[jira] [Commented] (ARROW-15330) Relaxing `grpc-cpp` requirements to avoid package inconsistency

2022-01-14 Thread Prem Sagar Gali (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476449#comment-17476449 ] Prem Sagar Gali commented on ARROW-15330: - Thanks [~keith.j.kraus] ! I was under

[jira] [Commented] (ARROW-15340) [C++] Try and fetch IPC footer in one read instead of two

2022-01-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476448#comment-17476448 ] David Li commented on ARROW-15340: -- And just for reference, the Parquet reader already

[jira] [Commented] (ARROW-14908) [R] join on dataset crashes on Windows

2022-01-14 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476446#comment-17476446 ] Will Jones commented on ARROW-14908: [~jclark] thanks for that example repro. I get

[jira] [Created] (ARROW-15340) [C++] Try and fetch IPC footer in one read instead of two

2022-01-14 Thread Weston Pace (Jira)
Weston Pace created ARROW-15340: --- Summary: [C++] Try and fetch IPC footer in one read instead of two Key: ARROW-15340 URL: https://issues.apache.org/jira/browse/ARROW-15340 Project: Apache Arrow

[jira] [Commented] (ARROW-13340) [C++][Dataset] Simplify ScanOptions after complexity has moved to ScanNode

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476424#comment-17476424 ] Weston Pace commented on ARROW-13340: - I think this is a good idea (some of the disc

[jira] [Closed] (ARROW-13328) [C++][Dataset] Use an ExecPlan for synchronous scans or drop synchronous scans

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-13328. --- Fix Version/s: 7.0.0 (was: 8.0.0) Resolution: Fixed ARROW-13554 remove

[jira] [Closed] (ARROW-13338) [C++][Dataset] Make async Scanner the default

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-13338. --- Fix Version/s: 7.0.0 (was: 8.0.0) Resolution: Fixed Fixed with ARROW-1

[jira] [Commented] (ARROW-13338) [C++][Dataset] Make async Scanner the default

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476407#comment-17476407 ] Weston Pace commented on ARROW-13338: - Yes. I will close this. > [C++][Dataset] Ma

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476405#comment-17476405 ] Weston Pace commented on ARROW-12358: - Ah, I think I see. We call something like...

[jira] [Commented] (ARROW-13467) [C++] Support delta dictionaries in the IPC file format

2022-01-14 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476403#comment-17476403 ] Ian Cook commented on ARROW-13467: -- See this thread from the user@ list for more detail

[jira] [Resolved] (ARROW-15332) [C++] Add new cases and fix issues in IPC read/write benchmark

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-15332. - Fix Version/s: 7.0.0 Resolution: Fixed Issue resolved by pull request 12150 [https://gith

[jira] [Commented] (ARROW-14911) [C++] arrow-compute-hash-join-node-test failed

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476361#comment-17476361 ] Weston Pace commented on ARROW-14911: - I ran tests against the 6.0.1 tag without any

[jira] [Updated] (ARROW-14911) [C++] arrow-compute-hash-join-node-test failed

2022-01-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-14911: Priority: Major (was: Blocker) > [C++] arrow-compute-hash-join-node-test failed > ---

[jira] [Commented] (ARROW-14047) [C++] [Parquet] FileReader returns inconsistent results on repeat reads

2022-01-14 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476352#comment-17476352 ] Will Jones commented on ARROW-14047: Update: I've managed to reproduce this issue in

[jira] [Commented] (ARROW-15253) [Python] Error in to_pandas for empty dataframe with pd.interval_range index

2022-01-14 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476338#comment-17476338 ] Alenka Frim commented on ARROW-15253: - Question: how could I check for `ArrowInterva

[jira] [Updated] (ARROW-3039) [Go] add support for DictionaryArray

2022-01-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3039: -- Labels: pull-request-available (was: ) > [Go] add support for DictionaryArray > --

[jira] [Updated] (ARROW-12590) [C++][R] Update copies of Homebrew files to reflect recent updates

2022-01-14 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-12590: --- Fix Version/s: (was: 7.0.0) > [C++][R] Update copies of Homebrew files to reflect recent

[jira] [Updated] (ARROW-12590) [C++][R] Update copies of Homebrew files to reflect recent updates

2022-01-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-12590: --- Labels: pull-request-available (was: ) > [C++][R] Update copies of Homebrew files to reflec

[jira] [Updated] (ARROW-15337) [Doc] New contributors guide updates

2022-01-14 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-15337: - Description: index - do we need the hyperlink to the Arrow homepage on the first mention of Arro

[jira] [Created] (ARROW-15339) [Website] Add Skyhook blog post

2022-01-14 Thread David Li (Jira)
David Li created ARROW-15339: Summary: [Website] Add Skyhook blog post Key: ARROW-15339 URL: https://issues.apache.org/jira/browse/ARROW-15339 Project: Apache Arrow Issue Type: Improvement

[jira] [Comment Edited] (ARROW-14908) [R] join on dataset crashes on Windows

2022-01-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17475807#comment-17475807 ] Neal Richardson edited comment on ARROW-14908 at 1/14/22, 3:09 PM: ---

[jira] [Commented] (ARROW-14908) [R] join on dataset crashes on Windows

2022-01-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476195#comment-17476195 ] Neal Richardson commented on ARROW-14908: - Other angles [~jclark] and I explored

[jira] [Commented] (ARROW-13514) [JS] Update flatbuffers

2022-01-14 Thread Dominik Moritz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476190#comment-17476190 ] Dominik Moritz commented on ARROW-13514: Already done in https://github.com/apac

[jira] [Updated] (ARROW-11502) [C++] Optimize Arrow ByteStreamSplitDecode with Neon

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-11502: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Optimize Arrow ByteStream

[jira] [Updated] (ARROW-11776) [Java][Dataset] Support writing to files within dataset scanner via JNI

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-11776: Fix Version/s: 8.0.0 (was: 7.0.0) > [Java][Dataset] Support writing

[jira] [Commented] (ARROW-11776) [Java][Dataset] Support writing to files within dataset scanner via JNI

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11776?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476173#comment-17476173 ] Krisztian Szucs commented on ARROW-11776: - Postponing to 8.0 since the PR is in-

[jira] [Commented] (ARROW-13338) [C++][Dataset] Make async Scanner the default

2022-01-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476171#comment-17476171 ] David Li commented on ARROW-13338: -- ARROW-13554 effectively resolved this, right? [~wes

[jira] [Resolved] (ARROW-15325) [R] Fix CRAN comment on map_batches collect

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-15325. - Assignee: Neal Richardson (was: Will Jones) Resolution: Fixed > [R] Fix CRAN comm

[jira] [Updated] (ARROW-14679) [R] [C++] Handle suffix argument in joins

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-14679: Fix Version/s: 8.0.0 (was: 7.0.0) > [R] [C++] Handle suffix argumen

[jira] [Updated] (ARROW-15212) [C++] Handle suffix argument in joins

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-15212: Fix Version/s: 8.0.0 > [C++] Handle suffix argument in joins > ---

[jira] [Commented] (ARROW-14679) [R] [C++] Handle suffix argument in joins

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14679?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476166#comment-17476166 ] Krisztian Szucs commented on ARROW-14679: - Postponing it to 8.0 > [R] [C++] Han

[jira] [Commented] (ARROW-13514) [JS] Update flatbuffers

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476164#comment-17476164 ] Krisztian Szucs commented on ARROW-13514: - [~domoritz] Do you plan to resolve th

[jira] [Commented] (ARROW-12724) [C++] Add documentation for authoring compute kernels

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12724?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476163#comment-17476163 ] Krisztian Szucs commented on ARROW-12724: - Postponing to 8.0 > [C++] Add docume

[jira] [Updated] (ARROW-12724) [C++] Add documentation for authoring compute kernels

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-12724: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Add documentation for aut

[jira] [Commented] (ARROW-12723) [C++][Compute] GroupBy: add unittests for individual components of hash group by

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476162#comment-17476162 ] Krisztian Szucs commented on ARROW-12723: - [~michalno] is this still valid? > [

[jira] [Updated] (ARROW-12723) [C++][Compute] GroupBy: add unittests for individual components of hash group by

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-12723: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Compute] GroupBy: add uni

[jira] [Updated] (ARROW-12755) [C++][Compute] Add quotient and modulo kernels

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-12755: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Compute] Add quotient and

[jira] [Commented] (ARROW-12755) [C++][Compute] Add quotient and modulo kernels

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476161#comment-17476161 ] Krisztian Szucs commented on ARROW-12755: - Since the PR is in draft, I'm postpon

[jira] [Updated] (ARROW-13338) [C++][Dataset] Make async Scanner the default

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-13338: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Dataset] Make async Scann

[jira] [Commented] (ARROW-13338) [C++][Dataset] Make async Scanner the default

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476159#comment-17476159 ] Krisztian Szucs commented on ARROW-13338: - Postponing to 8.0. > [C++][Dataset]

[jira] [Assigned] (ARROW-12515) [Dev][Wiki][Release] Fix and update Windows RC verify script

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-12515: --- Assignee: Krisztian Szucs (was: Balazs Jeszenszky) > [Dev][Wiki][Release] Fix and

[jira] [Assigned] (ARROW-12515) [Dev][Wiki][Release] Fix and update Windows RC verify script

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-12515: --- Assignee: Ian Cook (was: Krisztian Szucs) > [Dev][Wiki][Release] Fix and update Wi

[jira] [Commented] (ARROW-10726) [Python] Reading multiple parquet files with different index column dtype (originating pandas) reads wrong data

2022-01-14 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476153#comment-17476153 ] Alenka Frim commented on ARROW-10726: - One thing I noticed was that _pq.read_table_

[jira] [Updated] (ARROW-14775) [JS] Embrace ESM in main arrow package

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-14775: Fix Version/s: 8.0.0 (was: 7.0.0) > [JS] Embrace ESM in main arrow

[jira] [Commented] (ARROW-14775) [JS] Embrace ESM in main arrow package

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476145#comment-17476145 ] Krisztian Szucs commented on ARROW-14775: - [~domoritz] Postponing it to 8.0, fee

[jira] [Updated] (ARROW-14725) [C++][Compute] Extract Expression simplification passes to an extensible registry

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-14725: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Compute] Extract Expressi

[jira] [Commented] (ARROW-14725) [C++][Compute] Extract Expression simplification passes to an extensible registry

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476143#comment-17476143 ] Krisztian Szucs commented on ARROW-14725: - Postponing to 8.0 since the PR is sti

[jira] [Updated] (ARROW-8221) [Python][Dataset] Expose schema inference / validation options in the factory

2022-01-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8221: - Fix Version/s: 8.0.0 > [Python][Dataset] Expose schema inference / validation opt

[jira] [Updated] (ARROW-8221) [Python][Dataset] Expose schema inference / validation options in the factory

2022-01-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8221: - Fix Version/s: (was: 7.0.0) > [Python][Dataset] Expose schema inference / val

[jira] [Resolved] (ARROW-15077) [Python] Move Expression class from _dataset to _compute cython module

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-15077. - Resolution: Fixed Issue resolved by pull request 11938 [https://github.com/apache/arrow/

[jira] [Assigned] (ARROW-13087) [R] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-13087: --- Assignee: Krisztian Szucs (was: Dewey Dunnington) > [R] Expose Parquet ArrowReader

[jira] [Assigned] (ARROW-13087) [R] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-13087: --- Assignee: Dewey Dunnington (was: Krisztian Szucs) > [R] Expose Parquet ArrowReader

[jira] [Updated] (ARROW-14233) [C++] Improve ExecPlan::ToString

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-14233: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++] Improve ExecPlan::ToStrin

[jira] [Commented] (ARROW-14233) [C++] Improve ExecPlan::ToString

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476131#comment-17476131 ] Krisztian Szucs commented on ARROW-14233: - Postponing it to 8.0 > [C++] Improve

[jira] [Assigned] (ARROW-12060) [Python] Enable calling compute functions on Expressions

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-12060: --- Assignee: Joris Van den Bossche (was: Krisztian Szucs) > [Python] Enable calling c

[jira] [Assigned] (ARROW-12060) [Python] Enable calling compute functions on Expressions

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-12060: --- Assignee: Krisztian Szucs (was: Joris Van den Bossche) > [Python] Enable calling c

[jira] [Commented] (ARROW-12060) [Python] Enable calling compute functions on Expressions

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476126#comment-17476126 ] Krisztian Szucs commented on ARROW-12060: - Postponing this to 8.0 > [Python] En

[jira] [Updated] (ARROW-12060) [Python] Enable calling compute functions on Expressions

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-12060: Fix Version/s: 8.0.0 (was: 7.0.0) > [Python] Enable calling compute

[jira] [Assigned] (ARROW-12480) [Java][Dataset] FileSystemDataset: Support reading from a directory

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-12480: --- Assignee: Krisztian Szucs (was: Hongze Zhang) > [Java][Dataset] FileSystemDataset:

[jira] [Assigned] (ARROW-12480) [Java][Dataset] FileSystemDataset: Support reading from a directory

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-12480: --- Assignee: Hongze Zhang (was: Krisztian Szucs) > [Java][Dataset] FileSystemDataset:

[jira] [Commented] (ARROW-15089) [C++] Add compute kernel to get MapArray value for given key

2022-01-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476125#comment-17476125 ] David Li commented on ARROW-15089: -- * MapArray is of type MapType. Types are not values

[jira] [Resolved] (ARROW-15076) [C++][Gandiva][CI] Test failure/crash on fedora-cpp

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-15076. - Resolution: Fixed Issue resolved by pull request 12146 [https://github.com/apache/arrow/

[jira] [Commented] (ARROW-12358) [C++][Python][R][Dataset] Control overwriting vs appending when writing to existing dataset

2022-01-14 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476120#comment-17476120 ] Lance Dacey commented on ARROW-12358: - Ah, so it must be related to the filesystem.

[jira] [Resolved] (ARROW-15326) [CI][Gandiva] Ubuntu release build is failing with failing Gandiva tests

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-15326. - Resolution: Fixed Issue resolved by pull request 12145 [https://github.com/apache/arrow/

[jira] [Comment Edited] (ARROW-15259) [Java] [Benchmarking] Large stdout when running benchmarks

2022-01-14 Thread Liya Fan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476112#comment-17476112 ] Liya Fan edited comment on ARROW-15259 at 1/14/22, 12:40 PM: -

[jira] [Commented] (ARROW-15259) [Java] [Benchmarking] Large stdout when running benchmarks

2022-01-14 Thread Liya Fan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476112#comment-17476112 ] Liya Fan commented on ARROW-15259: -- Sorry [~el...@ursacomputing.com] I am having some p

[jira] [Updated] (ARROW-15095) [Dev][Website] Changelog generation should use commit messages

2022-01-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15095: --- Labels: pull-request-available (was: ) > [Dev][Website] Changelog generation should use com

[jira] [Commented] (ARROW-15239) [C++][Compute] Introduce Bloom filters to hash join

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476100#comment-17476100 ] Krisztian Szucs commented on ARROW-15239: - Since the PR is in draft I'm postponi

[jira] [Updated] (ARROW-15239) [C++][Compute] Introduce Bloom filters to hash join

2022-01-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-15239: Fix Version/s: 8.0.0 (was: 7.0.0) > [C++][Compute] Introduce Bloom

[jira] [Commented] (ARROW-14821) [R] Implement bindings for lubridate's floor_date, ceiling_date, and round_date

2022-01-14 Thread Danielle Navarro (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476086#comment-17476086 ] Danielle Navarro commented on ARROW-14821: -- Okay there's a first pass at this h

[jira] [Comment Edited] (ARROW-15089) [C++] Add compute kernel to get MapArray value for given key

2022-01-14 Thread Dhruv Vats (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476066#comment-17476066 ] Dhruv Vats edited comment on ARROW-15089 at 1/14/22, 11:18 AM: ---

[jira] [Commented] (ARROW-15089) [C++] Add compute kernel to get MapArray value for given key

2022-01-14 Thread Dhruv Vats (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476066#comment-17476066 ] Dhruv Vats commented on ARROW-15089: I thought to ask this before I got too confused

[jira] [Comment Edited] (ARROW-15089) [C++] Add compute kernel to get MapArray value for given key

2022-01-14 Thread Dhruv Vats (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476066#comment-17476066 ] Dhruv Vats edited comment on ARROW-15089 at 1/14/22, 10:45 AM: ---

[jira] [Assigned] (ARROW-15338) Add `pyarrow.orc.read_table` API

2022-01-14 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned ARROW-15338: - Assignee: Dongjoon Hyun > Add `pyarrow.orc.read_table` API > --

[jira] [Updated] (ARROW-15338) Add `pyarrow.orc.read_table` API

2022-01-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15338: --- Labels: pull-request-available (was: ) > Add `pyarrow.orc.read_table` API > ---

[jira] [Created] (ARROW-15338) Add `pyarrow.orc.read_table` API

2022-01-14 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created ARROW-15338: - Summary: Add `pyarrow.orc.read_table` API Key: ARROW-15338 URL: https://issues.apache.org/jira/browse/ARROW-15338 Project: Apache Arrow Issue Type: Improve

[jira] [Comment Edited] (ARROW-15123) [R] CSV dataset file header read in as data

2022-01-14 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476001#comment-17476001 ] Nicola Crane edited comment on ARROW-15123 at 1/14/22, 9:24 AM: --

[jira] [Comment Edited] (ARROW-15123) [R] CSV dataset file header read in as data

2022-01-14 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476025#comment-17476025 ] Nicola Crane edited comment on ARROW-15123 at 1/14/22, 9:23 AM: --

[jira] [Updated] (ARROW-15337) [Doc] New contributors guide updates

2022-01-14 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-15337: - Component/s: Documentation > [Doc] New contributors guide updates >

[jira] [Created] (ARROW-15337) [Doc] New contributors guide updates

2022-01-14 Thread Nicola Crane (Jira)
Nicola Crane created ARROW-15337: Summary: [Doc] New contributors guide updates Key: ARROW-15337 URL: https://issues.apache.org/jira/browse/ARROW-15337 Project: Apache Arrow Issue Type: Sub-t

[jira] [Updated] (ARROW-15123) [R] CSV dataset file header read in as data

2022-01-14 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-15123: - Summary: [R] CSV dataset file header read in as data (was: [R] Dataset file header read in as d

[jira] [Updated] (ARROW-15123) [R] Dataset file header read in as data

2022-01-14 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicola Crane updated ARROW-15123: - Summary: [R] Dataset file header read in as data (was: [R] Schema order not respected and file

[jira] [Commented] (ARROW-15123) [R] Schema order not respected and file header ignored

2022-01-14 Thread Nicola Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17476025#comment-17476025 ] Nicola Crane commented on ARROW-15123: -- [~ndefriesJIRA] In the short-term, you coul

[jira] [Updated] (ARROW-15123) [R] Schema order not respected and file header ignored

2022-01-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15123: --- Labels: pull-request-available schema (was: schema) > [R] Schema order not respected and fi

  1   2   >