[jira] [Updated] (ARROW-11084) [Rust] Clippy failing in master

2020-12-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11084: --- Labels: pull-request-available (was: ) > [Rust] Clippy failing in master >

[jira] [Created] (ARROW-11084) [Rust] Clippy failing in master

2020-12-30 Thread Jira
Jorge Leitão created ARROW-11084: Summary: [Rust] Clippy failing in master Key: ARROW-11084 URL: https://issues.apache.org/jira/browse/ARROW-11084 Project: Apache Arrow Issue Type: Bug

[jira] [Updated] (ARROW-11083) [CI] Build "Source Release and Merge Script" is broken

2020-12-30 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão updated ARROW-11083: - Priority: Major (was: Minor) > [CI] Build "Source Release and Merge Script" is broken >

[jira] [Updated] (ARROW-11083) [CI] Build "Source Release and Merge Script" is broken

2020-12-30 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão updated ARROW-11083: - Issue Type: Bug (was: Task) > [CI] Build "Source Release and Merge Script" is broken >

[jira] [Created] (ARROW-11083) [CI] Build "Source Release and Merge Script" is broken

2020-12-30 Thread Jira
Jorge Leitão created ARROW-11083: Summary: [CI] Build "Source Release and Merge Script" is broken Key: ARROW-11083 URL: https://issues.apache.org/jira/browse/ARROW-11083 Project: Apache Arrow

[jira] [Commented] (ARROW-6697) [Rust] [DataFusion] Validate that all parquet partitions have the same schema

2020-12-30 Thread Ruihang Xia (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256866#comment-17256866 ] Ruihang Xia commented on ARROW-6697: No problem. I have looked at that PR, your implementation is

[jira] [Updated] (ARROW-11082) [Rust] Add FFI for LargeUtf8

2020-12-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11082: --- Labels: pull-request-available (was: ) > [Rust] Add FFI for LargeUtf8 >

[jira] [Created] (ARROW-11082) [Rust] Add FFI for LargeUtf8

2020-12-30 Thread Jira
Jorge Leitão created ARROW-11082: Summary: [Rust] Add FFI for LargeUtf8 Key: ARROW-11082 URL: https://issues.apache.org/jira/browse/ARROW-11082 Project: Apache Arrow Issue Type: New Feature

[jira] [Created] (ARROW-11081) [Java] Make IPC option immutable

2020-12-30 Thread Liya Fan (Jira)
Liya Fan created ARROW-11081: Summary: [Java] Make IPC option immutable Key: ARROW-11081 URL: https://issues.apache.org/jira/browse/ARROW-11081 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-11044) [C++] Add "replace" kernel

2020-12-30 Thread Bruno LE HYARIC (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruno LE HYARIC updated ARROW-11044: Description: Purpose a "replace" compute kernel which could fulfil ARROW-10641 - [C++] A

[jira] [Updated] (ARROW-10668) [R] Filtering does not work with .data pronoun

2020-12-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10668: --- Labels: pull-request-available (was: ) > [R] Filtering does not work with .data pronoun >

[jira] [Assigned] (ARROW-9572) [CI][Homebrew] Properly enable Gandiva and improve testing

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9572: -- Assignee: (was: Neal Richardson) > [CI][Homebrew] Properly enable Gandiva and

[jira] [Resolved] (ARROW-11079) [R] Catch up on changelog since 2.0

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-11079. - Resolution: Fixed Issue resolved by pull request 9050

[jira] [Created] (ARROW-11080) [C++][Dataset] Improvements to implicit casting

2020-12-30 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11080: --- Summary: [C++][Dataset] Improvements to implicit casting Key: ARROW-11080 URL: https://issues.apache.org/jira/browse/ARROW-11080 Project: Apache Arrow

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256729#comment-17256729 ] Weston Pace commented on ARROW-11067: - That appears to have fixed the issue. > [R] read_csv_arrow

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256727#comment-17256727 ] Weston Pace commented on ARROW-11067: - Yes, I will try that real quick. > [R] read_csv_arrow

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256724#comment-17256724 ] Antoine Pitrou commented on ARROW-11067: Nevermind, I've found a good candidate for the culprit:

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256720#comment-17256720 ] Antoine Pitrou commented on ARROW-11067: Ok, can you play with the various conversion options

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256719#comment-17256719 ] Neal Richardson commented on ARROW-11067: - C++ pretty print. Also {{is.na}} dispatches to the

[jira] [Commented] (ARROW-10773) [R] parallel as.data.frame.Table hangs indefinitely on Windows

2020-12-30 Thread Bruno Tremblay (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256716#comment-17256716 ] Bruno Tremblay commented on ARROW-10773: I've disabled threading on windows for the moment. I do

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256717#comment-17256717 ] Antoine Pitrou commented on ARROW-11067: To be clear, printing calls the C++ Arrow pretty-print

[jira] [Updated] (ARROW-10773) [R] parallel as.data.frame.Table hangs indefinitely on Windows

2020-12-30 Thread Bruno Tremblay (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruno Tremblay updated ARROW-10773: --- Description: On Windows only Tested on 2 machines, mingw.  Reprex {code:java}

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256715#comment-17256715 ] Neal Richardson commented on ARROW-11067: - The print method for Table doesn't print the values.

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256707#comment-17256707 ] Antoine Pitrou commented on ARROW-11067: Also, if you display {{tab}} directly without going

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256703#comment-17256703 ] Antoine Pitrou commented on ARROW-11067: Do you get the same results when you pass the

[jira] [Commented] (ARROW-11006) [Python] Array to_numpy slow compared to Numpy.view

2020-12-30 Thread Paul Balanca (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256676#comment-17256676 ] Paul Balanca commented on ARROW-11006: -- Thanks, I wanted to make sure any work I start would not

[jira] [Updated] (ARROW-11079) [R] Catch up on changelog since 2.0

2020-12-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11079: --- Labels: pull-request-available (was: ) > [R] Catch up on changelog since 2.0 >

[jira] [Created] (ARROW-11079) [R] Catch up on changelog since 2.0

2020-12-30 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-11079: --- Summary: [R] Catch up on changelog since 2.0 Key: ARROW-11079 URL: https://issues.apache.org/jira/browse/ARROW-11079 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-10372) [C++][Dataset] Read compressed CSVs

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10372: Fix Version/s: (was: 3.0.0) 4.0.0 > [C++][Dataset] Read compressed

[jira] [Assigned] (ARROW-10834) [R] Print method fails for SubTreeFileSystem

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-10834: --- Assignee: Ian Cook (was: Jonathan Keane) > [R] Print method fails for

[jira] [Updated] (ARROW-8748) [R] Add bindings to ConcatenateTables

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8748: --- Fix Version/s: (was: 3.0.0) > [R] Add bindings to ConcatenateTables >

[jira] [Updated] (ARROW-11029) [Rust] [DataFusion] Document why join order optimization does not work with filter pushdown

2020-12-30 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-11029: --- Issue Type: Improvement (was: Bug) > [Rust] [DataFusion] Document why join order optimization does

[jira] [Updated] (ARROW-11029) [Rust] [DataFusion] Document why join order optimization does not work with filter pushdown

2020-12-30 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-11029: --- Summary: [Rust] [DataFusion] Document why join order optimization does not work with filter

[jira] [Closed] (ARROW-11053) [Rust] [DataFusion] Optimize joins with dynamic capacity for output batches

2020-12-30 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-11053. -- Resolution: Won't Fix > [Rust] [DataFusion] Optimize joins with dynamic capacity for output batches >

[jira] [Commented] (ARROW-11058) [Rust] [DataFusion] Implement "coalesce batches" operator

2020-12-30 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256640#comment-17256640 ] Andy Grove commented on ARROW-11058: I can see your argument [~jorgecarleitao] I can see how we

[jira] [Created] (ARROW-11078) [C++] Implement autocasting in arithmetic compute kernels

2020-12-30 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-11078: -- Summary: [C++] Implement autocasting in arithmetic compute kernels Key: ARROW-11078 URL: https://issues.apache.org/jira/browse/ARROW-11078 Project: Apache Arrow

[jira] [Updated] (ARROW-11069) Parquet writer incorrect data being written when data type is dictionary

2020-12-30 Thread Palash Goel (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Palash Goel updated ARROW-11069: Description: When writing a dict column using pyarrow.    {code:python} import pandas as pd

[jira] [Commented] (ARROW-10773) [R] parallel as.data.frame.Table hangs indefinitely on Windows

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256627#comment-17256627 ] Neal Richardson commented on ARROW-10773: - [~meztez] is this still an issue for you? > [R]

[jira] [Closed] (ARROW-10730) [R] Installation fails on Fedora 32

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-10730. --- Assignee: Neal Richardson Resolution: Cannot Reproduce We've made some improvements

[jira] [Resolved] (ARROW-9903) [R] open_dataset freezes opening feather files on Windows

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-9903. Fix Version/s: 3.0.0 Assignee: Ben Kietzman Resolution: Fixed > [R]

[jira] [Assigned] (ARROW-10463) [R] Better messaging for currently unsupported CSV options in open_dataset

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-10463: --- Fix Version/s: 3.0.0 Assignee: Ian Cook Summary: [R] Better

[jira] [Updated] (ARROW-10470) [R] Fix missing file error causing NYC taxi example to fail

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10470: Fix Version/s: 3.0.0 > [R] Fix missing file error causing NYC taxi example to fail >

[jira] [Assigned] (ARROW-10470) [R] Fix missing file error causing NYC taxi example to fail

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-10470: --- Assignee: Ian Cook > [R] Fix missing file error causing NYC taxi example to fail

[jira] [Resolved] (ARROW-10733) [R] Improvements to Linux installation troubleshooting

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-10733. - Resolution: Fixed Issue resolved by pull request 9034

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] HashJoinExec slow with many batches

2020-12-30 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256609#comment-17256609 ] Daniël Heres commented on ARROW-11030: -- [~andygrove], interesting take, will check it out! Doesn't

[jira] [Commented] (ARROW-11077) [Rust] ParquetFileArrowReader panicks when trying to read nested list

2020-12-30 Thread Ben Sully (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256607#comment-17256607 ] Ben Sully commented on ARROW-11077: --- I've attached a small Parquet file to allow for a repro. This

[jira] [Updated] (ARROW-11077) [Rust] ParquetFileArrowReader panicks when trying to read nested list

2020-12-30 Thread Ben Sully (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Sully updated ARROW-11077: -- Attachment: small-nested-lists.parquet > [Rust] ParquetFileArrowReader panicks when trying to read

[jira] [Updated] (ARROW-11077) [Rust] ParquetFileArrowReader panicks when trying to read nested list

2020-12-30 Thread Ben Sully (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Sully updated ARROW-11077: -- Summary: [Rust] ParquetFileArrowReader panicks when trying to read nested list (was:

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256605#comment-17256605 ] Neal Richardson commented on ARROW-11067: - I can reproduce this on macOS 10.14 / R 3.6, as well

[jira] [Resolved] (ARROW-11058) [Rust] [DataFusion] Implement "coalesce batches" operator

2020-12-30 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-11058. Resolution: Fixed Issue resolved by pull request 9043 [https://github.com/apache/arrow/pull/9043]

[jira] [Created] (ARROW-11077) ParquetFileArrowReader panicks when trying to read nested list

2020-12-30 Thread Ben Sully (Jira)
Ben Sully created ARROW-11077: - Summary: ParquetFileArrowReader panicks when trying to read nested list Key: ARROW-11077 URL: https://issues.apache.org/jira/browse/ARROW-11077 Project: Apache Arrow

[jira] [Resolved] (ARROW-10416) [R] Support Tables in Flight

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-10416. - Resolution: Fixed Issue resolved by pull request 9039

[jira] [Resolved] (ARROW-11050) [R] Handle RecordBatch in write_parquet

2020-12-30 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-11050. - Resolution: Fixed Issue resolved by pull request 9033

[jira] [Updated] (ARROW-11076) [Rust][DataFusion] Refactor usage of right indices in hash join

2020-12-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11076: --- Labels: pull-request-available (was: ) > [Rust][DataFusion] Refactor usage of right

[jira] [Comment Edited] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256576#comment-17256576 ] John Sheffield edited comment on ARROW-11067 at 12/30/20, 4:07 PM: ---

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread John Sheffield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256576#comment-17256576 ] John Sheffield commented on ARROW-11067: Hm, the plot thickens. I just replicated Weston's

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] HashJoinExec slow with many batches

2020-12-30 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256575#comment-17256575 ] Andy Grove commented on ARROW-11030: I just did an experiment where I concatenated all the batches

[jira] [Commented] (ARROW-11067) [R] read_csv_arrow silently fails to read some strings and returns nulls

2020-12-30 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11067?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256566#comment-17256566 ] Weston Pace commented on ARROW-11067: - Hmm...there might be something more at play here.  I'm not

[jira] [Created] (ARROW-11076) [Rust][DataFusion] Refactor usage of right indices in hash join

2020-12-30 Thread Jira
Daniël Heres created ARROW-11076: Summary: [Rust][DataFusion] Refactor usage of right indices in hash join Key: ARROW-11076 URL: https://issues.apache.org/jira/browse/ARROW-11076 Project: Apache

[jira] [Created] (ARROW-11075) Getting reference not found with OCR enabled pyarrow

2020-12-30 Thread Kandarpa (Jira)
Kandarpa created ARROW-11075: Summary: Getting reference not found with OCR enabled pyarrow Key: ARROW-11075 URL: https://issues.apache.org/jira/browse/ARROW-11075 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-11044) [C++] Add "replace" kernel

2020-12-30 Thread Bruno LE HYARIC (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruno LE HYARIC updated ARROW-11044: Description: Purpose a "replace" compute kernel which could fulfil ARROW-10641 - [C++] A

[jira] [Updated] (ARROW-11044) [C++] Add "replace" kernel

2020-12-30 Thread Bruno LE HYARIC (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruno LE HYARIC updated ARROW-11044: Description: Purpose a "replace" compute kernel which could fulfil ARROW-10641 - [C++] A

[jira] [Updated] (ARROW-11044) [C++] Add "replace" kernel

2020-12-30 Thread Bruno LE HYARIC (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruno LE HYARIC updated ARROW-11044: Description: Purpose a "replace" compute kernel which could fulfil ARROW-10641 - [C++] A

[jira] [Created] (ARROW-11074) [Rust][DataFusion] Implement predicate push-down for parquet tables

2020-12-30 Thread Yordan Pavlov (Jira)
Yordan Pavlov created ARROW-11074: - Summary: [Rust][DataFusion] Implement predicate push-down for parquet tables Key: ARROW-11074 URL: https://issues.apache.org/jira/browse/ARROW-11074 Project:

[jira] [Resolved] (ARROW-11073) [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs

2020-12-30 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb resolved ARROW-11073. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 9046

[jira] [Updated] (ARROW-11073) [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs

2020-12-30 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-11073: Description: Rustfmt error was introduced in this PR:

[jira] [Updated] (ARROW-11073) [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs

2020-12-30 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-11073: Component/s: Rust > [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs >

[jira] [Updated] (ARROW-11073) [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs

2020-12-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11073: --- Labels: pull-request-available (was: ) > [Rust] Lint Error on CI Tests in

[jira] [Updated] (ARROW-11073) [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs

2020-12-30 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-11073: Summary: [Rust] Lint Error on CI Tests in /arrow/rust/arrow/src/ipc/reader.rs (was: [Rust] Lint

[jira] [Updated] (ARROW-11073) [Rust] Lint Error on CI Tests

2020-12-30 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-11073: Description: Rustfmt error was introduced in this PR:

[jira] [Updated] (ARROW-11073) [Rust] Lint Error on CI Tests

2020-12-30 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-11073: Description: Rustfmt error was introduced in this PR:

[jira] [Created] (ARROW-11073) [Rust] Lint Error on CI Tests

2020-12-30 Thread Andrew Lamb (Jira)
Andrew Lamb created ARROW-11073: --- Summary: [Rust] Lint Error on CI Tests Key: ARROW-11073 URL: https://issues.apache.org/jira/browse/ARROW-11073 Project: Apache Arrow Issue Type: Bug

[jira] [Commented] (ARROW-11068) [Rust] [DataFusion] Wrap more operators in CoalesceBatchExec

2020-12-30 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256477#comment-17256477 ] Andrew Lamb commented on ARROW-11068: - I have some suggestions here:

[jira] [Created] (ARROW-11072) Support int32 and int64 physical types

2020-12-30 Thread Jira
Florian Müller created ARROW-11072: -- Summary: Support int32 and int64 physical types Key: ARROW-11072 URL: https://issues.apache.org/jira/browse/ARROW-11072 Project: Apache Arrow Issue

[jira] [Comment Edited] (ARROW-11030) [Rust] [DataFusion] HashJoinExec slow with many batches

2020-12-30 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256456#comment-17256456 ] Daniël Heres edited comment on ARROW-11030 at 12/30/20, 11:10 AM: -- So

[jira] [Commented] (ARROW-11030) [Rust] [DataFusion] HashJoinExec slow with many batches

2020-12-30 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256456#comment-17256456 ] Daniël Heres commented on ARROW-11030: -- |So in summary, what looks like is the problem here: * for

[jira] [Comment Edited] (ARROW-11030) [Rust] [DataFusion] HashJoinExec slow with many batches

2020-12-30 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256456#comment-17256456 ] Daniël Heres edited comment on ARROW-11030 at 12/30/20, 11:09 AM: -- So

[jira] [Commented] (ARROW-9480) [Rust] [DataFusion] All DataFusion execution plan traits should require Send + Sync

2020-12-30 Thread Ben Sully (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256417#comment-17256417 ] Ben Sully commented on ARROW-9480: -- (note that adding `Send + Sync` to the `DataFrame` and

[jira] [Commented] (ARROW-9480) [Rust] [DataFusion] All DataFusion execution plan traits should require Send + Sync

2020-12-30 Thread Ben Sully (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17256415#comment-17256415 ] Ben Sully commented on ARROW-9480: -- I'm running into this issue when experimenting with running a