[jira] [Created] (ARROW-9455) Request: add option for taking all columns from all files in pa.dataset

2020-07-14 Thread David Cortes (Jira)
David Cortes created ARROW-9455: --- Summary: Request: add option for taking all columns from all files in pa.dataset Key: ARROW-9455 URL: https://issues.apache.org/jira/browse/ARROW-9455 Project: Apache A

[jira] [Updated] (ARROW-9388) [C++] Division kernels

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9388: -- Labels: compute pull-request-available (was: compute) > [C++] Division kernels > -

[jira] [Assigned] (ARROW-9388) [C++] Division kernels

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9388: Assignee: Liya Fan (was: Apache Arrow JIRA Bot) > [C++] Division kernels

[jira] [Assigned] (ARROW-9388) [C++] Division kernels

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9388: Assignee: Apache Arrow JIRA Bot (was: Liya Fan) > [C++] Division kernels

[jira] [Updated] (ARROW-9455) [Python] add option for taking all columns from all files in pa.dataset

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9455: - Summary: [Python] add option for taking all columns from all files in pa.dataset

[jira] [Commented] (ARROW-9420) [Rust][DataFusion] Add repartition/shuffle plan

2020-07-14 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157211#comment-17157211 ] Jorge commented on ARROW-9420: -- Thanks a lot for the clarification. Makes a lot of sense. I

[jira] [Created] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9456: --- Summary: [Python] Dataset segfault when not importing pyarrow.parquet Key: ARROW-9456 URL: https://issues.apache.org/jira/browse/ARROW-9456 Project: Apache Arr

[jira] [Commented] (ARROW-9455) [Python] add option for taking all columns from all files in pa.dataset

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157213#comment-17157213 ] Joris Van den Bossche commented on ARROW-9455: -- [~david-cortes] thanks for t

[jira] [Closed] (ARROW-9455) [Python] add option for taking all columns from all files in pa.dataset

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-9455. Resolution: Duplicate > [Python] add option for taking all columns from all files i

[jira] [Commented] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157216#comment-17157216 ] Antoine Pitrou commented on ARROW-9456: --- cc [~jorisvandenbossche] > [Python] Data

[jira] [Updated] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9456: -- Fix Version/s: 1.0.0 > [Python] Dataset segfault when not importing pyarrow.parquet >

[jira] [Updated] (ARROW-9390) [C++] Review compute function names

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9390: -- Description: We should probably make compute function naming more consistent while it's not to

[jira] [Commented] (ARROW-9426) [CI] Maybe redundant 'entry' key in .pre-commit-config.yaml

2020-07-14 Thread FredGan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157221#comment-17157221 ] FredGan commented on ARROW-9426: Sorry that it's not redundant. But two same "entry" key

[jira] [Commented] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157224#comment-17157224 ] Joris Van den Bossche commented on ARROW-9456: -- [~maartenbreddels] do you se

[jira] [Created] (ARROW-9457) [C++] TableReader support protobuf

2020-07-14 Thread Shuai Zhang (Jira)
Shuai Zhang created ARROW-9457: -- Summary: [C++] TableReader support protobuf Key: ARROW-9457 URL: https://issues.apache.org/jira/browse/ARROW-9457 Project: Apache Arrow Issue Type: New Feature

[jira] [Created] (ARROW-9459) [C++][Dataset] Make collecting/parsing statistics optional for ParquetFragment

2020-07-14 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9459: Summary: [C++][Dataset] Make collecting/parsing statistics optional for ParquetFragment Key: ARROW-9459 URL: https://issues.apache.org/jira/browse/ARROW-9459

[jira] [Created] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9458: --- Summary: [Python] Dataset singlethreaded only Key: ARROW-9458 URL: https://issues.apache.org/jira/browse/ARROW-9458 Project: Apache Arrow Issue Type: B

[jira] [Updated] (ARROW-9459) [C++][Dataset] Make collecting/parsing statistics optional for ParquetFragment

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9459: - Description: See some timing checks here: https://github.com/dask/dask/pull/6346

[jira] [Commented] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157231#comment-17157231 ] Maarten Breddels commented on ARROW-9456: - This file gives me the same problem {c

[jira] [Commented] (ARROW-9444) [C++][Doc] Undocumented compute functions (string_isalpha, etc.)

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157235#comment-17157235 ] Maarten Breddels commented on ARROW-9444: - Feel free to assign to me, I didn't kn

[jira] [Commented] (ARROW-9459) [C++][Dataset] Make collecting/parsing statistics optional for ParquetFragment

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157240#comment-17157240 ] Joris Van den Bossche commented on ARROW-9459: -- One question is, if we do th

[jira] [Updated] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9450: -- Fix Version/s: (was: 2.0.0) 1.0.0 > [Python] "pytest pyarrow" takes over

[jira] [Commented] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157247#comment-17157247 ] Antoine Pitrou commented on ARROW-9450: --- This is a major annoyance for development.

[jira] [Commented] (ARROW-9171) [C++] Comments in FindArrow.cmake misleading

2020-07-14 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157248#comment-17157248 ] Uwe Korn commented on ARROW-9171: - I would be able to squeeze this in but I have the feel

[jira] [Assigned] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-9450: - Assignee: Antoine Pitrou > [Python] "pytest pyarrow" takes over 10 seconds to collect te

[jira] [Updated] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9450: -- Labels: pull-request-available (was: ) > [Python] "pytest pyarrow" takes over 10 seconds to co

[jira] [Assigned] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9450: Assignee: Antoine Pitrou (was: Apache Arrow JIRA Bot) > [Python] "pytest

[jira] [Assigned] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9450: Assignee: Apache Arrow JIRA Bot (was: Antoine Pitrou) > [Python] "pytest

[jira] [Assigned] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-9456: - Assignee: Antoine Pitrou > [Python] Dataset segfault when not importing pyarrow.parquet

[jira] [Assigned] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-9456: - Assignee: (was: Antoine Pitrou) > [Python] Dataset segfault when not importing pyarr

[jira] [Resolved] (ARROW-9437) [Python][Packaging] Homebrew fails to install build dependencies in the macOS wheel builds

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9437. Resolution: Fixed Issue resolved by pull request 7728 [https://github.com/apache/arrow/pull

[jira] [Created] (ARROW-9460) [C++] BinaryContainsExact doesn't cope with double characters in the pattern

2020-07-14 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-9460: --- Summary: [C++] BinaryContainsExact doesn't cope with double characters in the pattern Key: ARROW-9460 URL: https://issues.apache.org/jira/browse/ARROW-9460 Project: Apache Arro

[jira] [Commented] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157268#comment-17157268 ] Joris Van den Bossche commented on ARROW-9456: -- Are you able to reproduce it

[jira] [Commented] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157269#comment-17157269 ] Joris Van den Bossche commented on ARROW-9456: -- Also can't reproduce it when

[jira] [Resolved] (ARROW-9379) [Rust] Support unsigned dictionary indices

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9379. Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7745 [https://

[jira] [Updated] (ARROW-9379) [Rust] Support unsigned dictionary indices

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9379: -- Labels: pull-request-available (was: ) > [Rust] Support unsigned dictionary indices >

[jira] [Updated] (ARROW-9460) [C++] BinaryContainsExact doesn't cope with double characters in the pattern

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9460: -- Labels: pull-request-available (was: ) > [C++] BinaryContainsExact doesn't cope with double ch

[jira] [Assigned] (ARROW-9379) [Rust] Support unsigned dictionary indices

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-9379: -- Assignee: Bobby Wagner > [Rust] Support unsigned dictionary indices >

[jira] [Commented] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157293#comment-17157293 ] Maarten Breddels commented on ARROW-9456: - Note that you should not run the vaex

[jira] [Closed] (ARROW-9456) [Python] Dataset segfault when not importing pyarrow.parquet

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maarten Breddels closed ARROW-9456. --- Resolution: Not A Bug > [Python] Dataset segfault when not importing pyarrow.parquet > -

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157306#comment-17157306 ] Joris Van den Bossche commented on ARROW-9458: -- That it doesn't do this in p

[jira] [Comment Edited] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157306#comment-17157306 ] Joris Van den Bossche edited comment on ARROW-9458 at 7/14/20, 11:50 AM: --

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157308#comment-17157308 ] Joris Van den Bossche commented on ARROW-9458: -- Ah, and forgot to note: we a

[jira] [Created] (ARROW-9461) [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch

2020-07-14 Thread Jorge (Jira)
Jorge created ARROW-9461: Summary: [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch Key: ARROW-9461 URL: https://issues.apache.org/jira/browse/ARROW-9461 Project: Apache

[jira] [Updated] (ARROW-9461) [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9461: -- Labels: pull-request-available (was: ) > [Rust] Reading Date32 and Date64 errors - they are in

[jira] [Assigned] (ARROW-9461) [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9461: Assignee: Apache Arrow JIRA Bot (was: Jorge) > [Rust] Reading Date32 and

[jira] [Assigned] (ARROW-9461) [Rust] Reading Date32 and Date64 errors - they are incorrectly converted to RecordBatch

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9461: Assignee: Jorge (was: Apache Arrow JIRA Bot) > [Rust] Reading Date32 and

[jira] [Updated] (ARROW-8506) [c++] Miss tests to verify expected_buffer with bit_width > 8 in RLE

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8506: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [c++] Miss tests to verify expect

[jira] [Updated] (ARROW-8499) [C++][Dataset] In ScannerBuilder, batch_size will not work if projecter is not empty

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8499: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [C++][Dataset] In ScannerBuilder,

[jira] [Updated] (ARROW-8496) [C++] Refine ByteStreamSplitDecodeScalar

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8496: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [C++] Refine ByteStreamSplitDecod

[jira] [Updated] (ARROW-8360) [C++][Gandiva] Fixes date32 support for date/time functions

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8360: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [C++][Gandiva] Fixes date32 suppo

[jira] [Updated] (ARROW-8467) [C++] Test cases using ArrayFromJSON assume only a little-endian platform

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8467: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [C++] Test cases using ArrayFromJ

[jira] [Updated] (ARROW-8511) [Developer][Release] Windows release verification script does not halt if C++ compilation fails

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8511: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [Developer][Release] Windows rele

[jira] [Updated] (ARROW-8515) [C++] Bitmap ToString should have an option of grouping by bytes

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8515: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [C++] Bitmap ToString should have

[jira] [Updated] (ARROW-8517) [Developer][Release] Update Crossbow RC verification setup for changes since 0.16.0

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8517: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [Developer][Release] Update Cross

[jira] [Updated] (ARROW-8696) [Java] Convert tests to integration tests

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8696: --- Fix Version/s: 1.0.0 > [Java] Convert tests to integration tests > --

[jira] [Updated] (ARROW-9000) [Java] build crashes with JDK14

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9000: --- Fix Version/s: 1.0.0 > [Java] build crashes with JDK14 > --- > >

[jira] [Updated] (ARROW-8972) [Java] Support range value comparison for large varchar/varbinary vectors

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8972: --- Fix Version/s: 1.0.0 > [Java] Support range value comparison for large varchar/varbinary vect

[jira] [Updated] (ARROW-8443) [Gandiva][C++] Fix round/truncate to no-op for special cases

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8443: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [Gandiva][C++] Fix round/truncate

[jira] [Updated] (ARROW-8230) [Java] Move Netty memory manager into a separate module

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8230: --- Fix Version/s: 1.0.0 > [Java] Move Netty memory manager into a separate module >

[jira] [Updated] (ARROW-7955) [Java] Support large buffer for file/stream IPC

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-7955: --- Fix Version/s: 1.0.0 > [Java] Support large buffer for file/stream IPC >

[jira] [Resolved] (ARROW-9448) [Java] Circular initialization between ArrowBuf and BaseAllocator leads to null HistoricalLog for empty buffer

2020-07-14 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-9448. - Resolution: Fixed Issue resolved by pull request 7742 [https://github.com/apache/arrow/pull/7742] > [Jav

[jira] [Updated] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maarten Breddels updated ARROW-9458: Attachment: image-2020-07-14-14-31-29-943.png > [Python] Dataset singlethreaded only >

[jira] [Updated] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maarten Breddels updated ARROW-9458: Attachment: image-2020-07-14-14-38-16-767.png > [Python] Dataset singlethreaded only >

[jira] [Created] (ARROW-9462) [Go] The Indentation after the first Record arrjson writer is missing

2020-07-14 Thread FredGan (Jira)
FredGan created ARROW-9462: -- Summary: [Go] The Indentation after the first Record arrjson writer is missing Key: ARROW-9462 URL: https://issues.apache.org/jira/browse/ARROW-9462 Project: Apache Arrow

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157338#comment-17157338 ] Maarten Breddels commented on ARROW-9458: -   Running this (now with all columns)

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157337#comment-17157337 ] Joris Van den Bossche commented on ARROW-9458: -- [~maartenbreddels] how big a

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157340#comment-17157340 ] Maarten Breddels commented on ARROW-9458: - Did you set ? batch_size=1_000_000 >

[jira] [Updated] (ARROW-9462) [Go] The Indentation after the first Record arrjson writer is missing

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9462: -- Labels: pull-request-available (was: ) > [Go] The Indentation after the first Record arrjson w

[jira] [Created] (ARROW-9463) [Go] The writer is double closed in TestReadWrite

2020-07-14 Thread FredGan (Jira)
FredGan created ARROW-9463: -- Summary: [Go] The writer is double closed in TestReadWrite Key: ARROW-9463 URL: https://issues.apache.org/jira/browse/ARROW-9463 Project: Apache Arrow Issue Type: Test

[jira] [Updated] (ARROW-9463) [Go] The writer is double closed in TestReadWrite

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9463: -- Labels: pull-request-available (was: ) > [Go] The writer is double closed in TestReadWrite > -

[jira] [Updated] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-9464: -- Description: I would like to propose a refactor of the physical/execution planning based on the experi

[jira] [Created] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
Andy Grove created ARROW-9464: - Summary: [Rust] [DataFusion] Physical plan refactor to support async and optimization rules Key: ARROW-9464 URL: https://issues.apache.org/jira/browse/ARROW-9464 Project: A

[jira] [Updated] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-9464: -- Description: I would like to propose a refactor of the physical/execution planning based on the experi

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157349#comment-17157349 ] Joris Van den Bossche commented on ARROW-9458: -- > Did you set ? batch_size=1

[jira] [Commented] (ARROW-9420) [Rust][DataFusion] Add repartition/shuffle plan

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157350#comment-17157350 ] Andy Grove commented on ARROW-9420: --- I have created a new Jira to replace this one, sin

[jira] [Assigned] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reassigned ARROW-9464: - Assignee: Andy Grove > [Rust] [DataFusion] Physical plan refactor to support async and optimizat

[jira] [Closed] (ARROW-9420) [Rust][DataFusion] Add repartition/shuffle plan

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-9420. - Resolution: Duplicate Duplicate of https://issues.apache.org/jira/browse/ARROW-9464 > [Rust][DataFusion]

[jira] [Resolved] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9450. --- Resolution: Fixed Issue resolved by pull request 7749 [https://github.com/apache/arrow/pull/7

[jira] [Created] (ARROW-9465) [Python] Improve ergonomics of compute functions

2020-07-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-9465: - Summary: [Python] Improve ergonomics of compute functions Key: ARROW-9465 URL: https://issues.apache.org/jira/browse/ARROW-9465 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-9465) [Python] Improve ergonomics of compute functions

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9465: -- Description: Introspection of exported compute functions currently yield suboptimal output: {co

[jira] [Comment Edited] (ARROW-9359) [Rust][Dev] Cache packages and/or compilation in docker images

2020-07-14 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157358#comment-17157358 ] Jorge edited comment on ARROW-9359 at 7/14/20, 1:22 PM: One idea

[jira] [Commented] (ARROW-9359) [Rust][Dev] Cache packages and/or compilation in docker images

2020-07-14 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157358#comment-17157358 ] Jorge commented on ARROW-9359: -- One idea that I often use: {code:docker} # use specific ve

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157365#comment-17157365 ] Joris Van den Bossche commented on ARROW-9458: -- It might be we are not relea

[jira] [Closed] (ARROW-9444) [C++][Doc] Undocumented compute functions (string_isalpha, etc.)

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-9444. - Resolution: Duplicate > [C++][Doc] Undocumented compute functions (string_isalpha, etc.) > --

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157372#comment-17157372 ] Maarten Breddels commented on ARROW-9458: - Yes, in my case, the row groups are 1_

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157374#comment-17157374 ] Maarten Breddels commented on ARROW-9458: - Indeed, seeing a massive speedup. Too

[jira] [Commented] (ARROW-8205) [Rust] [DataFusion] DataFusion should enforce unique field names in a schema

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157378#comment-17157378 ] Andy Grove commented on ARROW-8205: --- I think I am fine with removing Expr::Column(usize

[jira] [Created] (ARROW-9466) [Rust] [DataFusion] Upgrade to latest version of sqlparser crate

2020-07-14 Thread Andy Grove (Jira)
Andy Grove created ARROW-9466: - Summary: [Rust] [DataFusion] Upgrade to latest version of sqlparser crate Key: ARROW-9466 URL: https://issues.apache.org/jira/browse/ARROW-9466 Project: Apache Arrow

[jira] [Closed] (ARROW-8614) [Rust] [Website] Create Rust-specific 0.17.0 blog post

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8614. - Resolution: Won't Fix Too late to do this now. > [Rust] [Website] Create Rust-specific 0.17.0 blog post

[jira] [Closed] (ARROW-8824) [Rust] [DataFusion] Implement new SQL parser

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8824. - Resolution: Won't Fix Closing this. We should do https://issues.apache.org/jira/browse/ARROW-9466 instea

[jira] [Updated] (ARROW-9467) [Rust] [Website] Create Rust-specific 1.0.0 blog post

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-9467: -- Component/s: Website Rust - DataFusion Rust > [Rust] [Website] Create

[jira] [Created] (ARROW-9467) [Rust] [Website] Create Rust-specific 1.0.0 blog post

2020-07-14 Thread Andy Grove (Jira)
Andy Grove created ARROW-9467: - Summary: [Rust] [Website] Create Rust-specific 1.0.0 blog post Key: ARROW-9467 URL: https://issues.apache.org/jira/browse/ARROW-9467 Project: Apache Arrow Issue Ty

[jira] [Closed] (ARROW-8829) [Rust] Implement SQL parser

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8829. - Resolution: Won't Fix > [Rust] Implement SQL parser > --- > > Key

[jira] [Closed] (ARROW-8828) [Rust] Implement SQL tokenizer

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8828. - Resolution: Won't Fix > [Rust] Implement SQL tokenizer > -- > >

[jira] [Closed] (ARROW-8774) [Rust] [DataFusion] Improve threading model

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8774. - Resolution: Duplicate Replacing with https://issues.apache.org/jira/browse/ARROW-9464 > [Rust] [DataFusi

[jira] [Closed] (ARROW-9466) [Rust] [DataFusion] Upgrade to latest version of sqlparser crate

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-9466. - Resolution: Duplicate Duplicate of https://issues.apache.org/jira/browse/ARROW-7903 > [Rust] [DataFusion

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157387#comment-17157387 ] Maarten Breddels commented on ARROW-9458: - let me know if you want to do the hono

[jira] [Commented] (ARROW-7903) [Rust] Upgrade SQLParser dependency for DataFusion?

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17157386#comment-17157386 ] Andy Grove commented on ARROW-7903: --- I think we should go ahead and do this, even thoug

[jira] [Updated] (ARROW-7903) [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-7903: -- Summary: [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion (was: [Rust] Upgrade SQLParse

  1   2   >