[jira] [Assigned] (ARROW-9453) [Rust] Compiling Rust libary against WASM32 library

2020-07-14 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan reassigned ARROW-9453: -- Assignee: RJ Atwal > [Rust] Compiling Rust libary against WASM32 library >

[jira] [Assigned] (ARROW-9424) [C++][Parquet] Disable writing files with LZ4 codec

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9424: --- Assignee: Wes McKinney (was: Ben Kietzman) > [C++][Parquet] Disable writing files with LZ4

[jira] [Resolved] (ARROW-9424) [C++][Parquet] Disable writing files with LZ4 codec

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9424. - Resolution: Fixed Issue resolved by pull request 7757

[jira] [Created] (ARROW-9474) Column type inference in read_csv vs. open_csv. CSV conversion error to null.

2020-07-14 Thread Sep Dehpour (Jira)
Sep Dehpour created ARROW-9474: -- Summary: Column type inference in read_csv vs. open_csv. CSV conversion error to null. Key: ARROW-9474 URL: https://issues.apache.org/jira/browse/ARROW-9474 Project:

[jira] [Updated] (ARROW-9453) [Rust] Compiling Rust libary against WASM32 library

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9453: -- Labels: pull-request-available (was: ) > [Rust] Compiling Rust libary against WASM32 library

[jira] [Resolved] (ARROW-9399) [C++] Add forward compatibility checks for unrecognized future MetadataVersion

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9399. - Resolution: Fixed Issue resolved by pull request 7765

[jira] [Resolved] (ARROW-9452) [Rust] [DateFusion] Improve performance of parquet scan

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9452. - Resolution: Fixed Issue resolved by pull request 7743

[jira] [Resolved] (ARROW-9473) [Doc] Polishing for 1.0

2020-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-9473. Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7766

[jira] [Resolved] (ARROW-9409) [CI][Crossbow] Nightly conda-r fails

2020-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-9409. Fix Version/s: (was: 2.0.0) 1.0.0 Resolution: Fixed Issue

[jira] [Resolved] (ARROW-9472) [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed

2020-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-9472. Resolution: Fixed Issue resolved by pull request 7763

[jira] [Assigned] (ARROW-9473) [Doc] Polishing for 1.0

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9473: Assignee: Apache Arrow JIRA Bot (was: Neal Richardson) > [Doc] Polishing

[jira] [Assigned] (ARROW-9473) [Doc] Polishing for 1.0

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9473: Assignee: Neal Richardson (was: Apache Arrow JIRA Bot) > [Doc] Polishing

[jira] [Created] (ARROW-9473) [Doc] Polishing for 1.0

2020-07-14 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9473: -- Summary: [Doc] Polishing for 1.0 Key: ARROW-9473 URL: https://issues.apache.org/jira/browse/ARROW-9473 Project: Apache Arrow Issue Type: New Feature

[jira] [Updated] (ARROW-9473) [Doc] Polishing for 1.0

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9473: -- Labels: pull-request-available (was: ) > [Doc] Polishing for 1.0 > --- >

[jira] [Updated] (ARROW-9399) [C++] Add forward compatibility checks for unrecognized future MetadataVersion

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9399: -- Labels: pull-request-available (was: ) > [C++] Add forward compatibility checks for

[jira] [Resolved] (ARROW-9438) [CI] Spark integration tests are failing

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9438. - Resolution: Fixed Issue resolved by pull request 7746

[jira] [Resolved] (ARROW-8314) [Python] Provide a method to select a subset of columns of a Table

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8314. - Resolution: Fixed Issue resolved by pull request 7272

[jira] [Assigned] (ARROW-8480) [Rust] There is no check for allocation failure

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8480: --- Assignee: Mahmut Bulut > [Rust] There is no check for allocation failure >

[jira] [Resolved] (ARROW-8480) [Rust] There is no check for allocation failure

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8480. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7734

[jira] [Resolved] (ARROW-9449) [R] Strip arrow.so

2020-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-9449. Resolution: Fixed Issue resolved by pull request 7741

[jira] [Resolved] (ARROW-8650) [Rust] [Website] Add documentation to Arrow website

2020-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-8650. Resolution: Fixed Issue resolved by pull request 7762

[jira] [Resolved] (ARROW-9447) [Rust][DataFusion] Allow closures as ScalarUDFs

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9447. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7740

[jira] [Resolved] (ARROW-9458) [Python] Dataset Scanner is single-threaded only

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9458. - Resolution: Fixed Issue resolved by pull request 7756

[jira] [Assigned] (ARROW-9298) [C++] Fix crashes on invalid input (OSS-Fuzz)

2020-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9298: -- Assignee: Antoine Pitrou (was: Apache Arrow JIRA Bot) > [C++] Fix crashes on invalid

[jira] [Assigned] (ARROW-9472) [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9472: Assignee: Neal Richardson (was: Apache Arrow JIRA Bot) > [R] Provide

[jira] [Assigned] (ARROW-9472) [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9472: Assignee: Apache Arrow JIRA Bot (was: Neal Richardson) > [R] Provide

[jira] [Updated] (ARROW-9472) [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9472: -- Labels: pull-request-available (was: ) > [R] Provide configurable MetadataVersion in IPC API

[jira] [Commented] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread Ryan Murray (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157620#comment-17157620 ] Ryan Murray commented on ARROW-9470: much better result than I got! I honestly think the RAT plugin

[jira] [Commented] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157614#comment-17157614 ] Antoine Pitrou commented on ARROW-9470: --- {{archery docker run --no-build debian-java}} takes 4

[jira] [Commented] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157598#comment-17157598 ] Antoine Pitrou commented on ARROW-9470: --- It turns out the RAT plugin is also stupid, it fails if

[jira] [Commented] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread Ryan Murray (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157577#comment-17157577 ] Ryan Murray commented on ARROW-9470: Testing locally the builds pass but spit out warnings but run

[jira] [Updated] (ARROW-8650) [Rust] [Website] Add documentation to Arrow website

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8650: -- Labels: pull-request-available (was: ) > [Rust] [Website] Add documentation to Arrow website

[jira] [Commented] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread Ryan Murray (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157564#comment-17157564 ] Ryan Murray commented on ARROW-9470: Im not a huge fan of turning it off completely. If we wrap it in

[jira] [Created] (ARROW-9472) [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed

2020-07-14 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9472: -- Summary: [R] Provide configurable MetadataVersion in IPC API and environment variable to set default to V4 when needed Key: ARROW-9472 URL:

[jira] [Resolved] (ARROW-9385) [Python] [CI] jpype integration failure

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9385. --- Resolution: Fixed Issue resolved by pull request 7753

[jira] [Updated] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9470: -- Labels: pull-request-available (was: ) > [CI][Java] Run Maven in parallel >

[jira] [Closed] (ARROW-9468) [Python][Java] Ensure jvm module doesn't leak java buffers

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-9468. - Resolution: Duplicate > [Python][Java] Ensure jvm module doesn't leak java buffers >

[jira] [Commented] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157535#comment-17157535 ] Antoine Pitrou commented on ARROW-9470: --- It looks like the Apache RAT Maven plugin isn't compatible

[jira] [Created] (ARROW-9471) [C++] Scan Dataset in reverse

2020-07-14 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9471: --- Summary: [C++] Scan Dataset in reverse Key: ARROW-9471 URL: https://issues.apache.org/jira/browse/ARROW-9471 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-7831) [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7831. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 6402

[jira] [Assigned] (ARROW-7831) [Java] unnecessary buffer allocation when calling splitAndTransferTo on variable width vectors

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7831: --- Assignee: stephane campinas > [Java] unnecessary buffer allocation when calling

[jira] [Created] (ARROW-9470) [CI][Java] Run Maven in parallel

2020-07-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-9470: - Summary: [CI][Java] Run Maven in parallel Key: ARROW-9470 URL: https://issues.apache.org/jira/browse/ARROW-9470 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-8729) [C++][Dataset] Only selecting a partition column results in empty table

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8729. - Resolution: Fixed Issue resolved by pull request 7534

[jira] [Assigned] (ARROW-9424) [C++][Parquet] Disable writing files with LZ4 codec

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9424: Assignee: Ben Kietzman (was: Apache Arrow JIRA Bot) > [C++][Parquet]

[jira] [Assigned] (ARROW-9424) [C++][Parquet] Disable writing files with LZ4 codec

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9424: Assignee: Apache Arrow JIRA Bot (was: Ben Kietzman) > [C++][Parquet]

[jira] [Resolved] (ARROW-9390) [C++] Review compute function names

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9390. --- Resolution: Fixed Issue resolved by pull request 7755

[jira] [Updated] (ARROW-9424) [C++][Parquet] Disable writing files with LZ4 codec

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9424: -- Labels: pull-request-available (was: ) > [C++][Parquet] Disable writing files with LZ4 codec

[jira] [Updated] (ARROW-9469) [Python] Make more objects weakrefable

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9469: -- Labels: pull-request-available (was: ) > [Python] Make more objects weakrefable >

[jira] [Assigned] (ARROW-9469) [Python] Make more objects weakrefable

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-9469: - Assignee: Antoine Pitrou > [Python] Make more objects weakrefable >

[jira] [Updated] (ARROW-7800) [Python] Expose GetRecordBatchReader API in PyArrow

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7800: Fix Version/s: (was: 1.0.0) 2.0.0 > [Python] Expose GetRecordBatchReader

[jira] [Assigned] (ARROW-9438) [CI] Spark integration tests are failing

2020-07-14 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-9438: --- Assignee: Bryan Cutler > [CI] Spark integration tests are failing >

[jira] [Created] (ARROW-9469) [Python] Make more objects weakrefable

2020-07-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-9469: - Summary: [Python] Make more objects weakrefable Key: ARROW-9469 URL: https://issues.apache.org/jira/browse/ARROW-9469 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-9468) [Python][Java] Ensure jvm module doesn't leak java buffers

2020-07-14 Thread Ryan Murray (Jira)
Ryan Murray created ARROW-9468: -- Summary: [Python][Java] Ensure jvm module doesn't leak java buffers Key: ARROW-9468 URL: https://issues.apache.org/jira/browse/ARROW-9468 Project: Apache Arrow

[jira] [Commented] (ARROW-9458) [Python] Dataset Scanner is single-threaded only

2020-07-14 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157445#comment-17157445 ] Ben Kietzman commented on ARROW-9458: - This looks like it might be a systemic problem with using a

[jira] [Commented] (ARROW-9409) [CI][Crossbow] Nightly conda-r fails

2020-07-14 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157441#comment-17157441 ] Neal Richardson commented on ARROW-9409: It would be better IMO to actually test the conda

[jira] [Commented] (ARROW-9453) [Rust] Compiling Rust libary against WASM32 library

2020-07-14 Thread RJ Atwal (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157435#comment-17157435 ] RJ Atwal commented on ARROW-9453: - [~andygrove]  To answer your questions: 1. Wasm code would be running

[jira] [Commented] (ARROW-9453) [Rust] Compiling Rust libary against WASM32 library

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157419#comment-17157419 ] Andy Grove commented on ARROW-9453: --- This sounds really interesting. I have been thinking about

[jira] [Updated] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9458: - Fix Version/s: 1.0.0 > [Python] Dataset singlethreaded only >

[jira] [Updated] (ARROW-9458) [Python] Dataset Scanner is single-threaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9458: - Summary: [Python] Dataset Scanner is single-threaded only (was: [Python]

[jira] [Resolved] (ARROW-8344) [C#] StringArray.Builder.Clear() corrupts subsequently-built array contents

2020-07-14 Thread Eric Erhardt (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Erhardt resolved ARROW-8344. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7671

[jira] [Resolved] (ARROW-9460) [C++] BinaryContainsExact doesn't cope with double characters in the pattern

2020-07-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9460. - Resolution: Fixed Issue resolved by pull request 7750

[jira] [Updated] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9458: -- Labels: pull-request-available (was: ) > [Python] Dataset singlethreaded only >

[jira] [Assigned] (ARROW-9390) [C++] Review compute function names

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9390: Assignee: Apache Arrow JIRA Bot (was: Antoine Pitrou) > [C++] Review

[jira] [Assigned] (ARROW-9390) [C++] Review compute function names

2020-07-14 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9390: Assignee: Antoine Pitrou (was: Apache Arrow JIRA Bot) > [C++] Review

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157390#comment-17157390 ] Joris Van den Bossche commented on ARROW-9458: -- How do you release the GIL with a yielding

[jira] [Updated] (ARROW-9390) [C++] Review compute function names

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9390: -- Labels: pull-request-available (was: ) > [C++] Review compute function names >

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157387#comment-17157387 ] Maarten Breddels commented on ARROW-9458: - let me know if you want to do the honors yourself,

[jira] [Commented] (ARROW-7903) [Rust] Upgrade SQLParser dependency for DataFusion?

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157386#comment-17157386 ] Andy Grove commented on ARROW-7903: --- I think we should go ahead and do this, even though it is a fair

[jira] [Updated] (ARROW-7903) [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-7903: -- Summary: [Rust] [DataFusion] Upgrade SQLParser dependency for DataFusion (was: [Rust] Upgrade

[jira] [Closed] (ARROW-9466) [Rust] [DataFusion] Upgrade to latest version of sqlparser crate

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-9466. - Resolution: Duplicate Duplicate of https://issues.apache.org/jira/browse/ARROW-7903 > [Rust]

[jira] [Closed] (ARROW-8774) [Rust] [DataFusion] Improve threading model

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8774. - Resolution: Duplicate Replacing with https://issues.apache.org/jira/browse/ARROW-9464 > [Rust]

[jira] [Closed] (ARROW-8829) [Rust] Implement SQL parser

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8829?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8829. - Resolution: Won't Fix > [Rust] Implement SQL parser > --- > >

[jira] [Closed] (ARROW-8824) [Rust] [DataFusion] Implement new SQL parser

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8824. - Resolution: Won't Fix Closing this. We should do https://issues.apache.org/jira/browse/ARROW-9466

[jira] [Closed] (ARROW-8614) [Rust] [Website] Create Rust-specific 0.17.0 blog post

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-8614. - Resolution: Won't Fix Too late to do this now. > [Rust] [Website] Create Rust-specific 0.17.0 blog post

[jira] [Created] (ARROW-9466) [Rust] [DataFusion] Upgrade to latest version of sqlparser crate

2020-07-14 Thread Andy Grove (Jira)
Andy Grove created ARROW-9466: - Summary: [Rust] [DataFusion] Upgrade to latest version of sqlparser crate Key: ARROW-9466 URL: https://issues.apache.org/jira/browse/ARROW-9466 Project: Apache Arrow

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157374#comment-17157374 ] Maarten Breddels commented on ARROW-9458: - Indeed, seeing a massive speedup. Too bad py-spy

[jira] [Closed] (ARROW-9444) [C++][Doc] Undocumented compute functions (string_isalpha, etc.)

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-9444. - Resolution: Duplicate > [C++][Doc] Undocumented compute functions (string_isalpha, etc.) >

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157365#comment-17157365 ] Joris Van den Bossche commented on ARROW-9458: -- It might be we are not releasing the GIL in

[jira] [Commented] (ARROW-9359) [Rust][Dev] Cache packages and/or compilation in docker images

2020-07-14 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157358#comment-17157358 ] Jorge commented on ARROW-9359: -- One idea that I often use: {code:docker} # use specific version here to

[jira] [Comment Edited] (ARROW-9359) [Rust][Dev] Cache packages and/or compilation in docker images

2020-07-14 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157358#comment-17157358 ] Jorge edited comment on ARROW-9359 at 7/14/20, 1:22 PM: One idea that I often

[jira] [Updated] (ARROW-9465) [Python] Improve ergonomics of compute functions

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9465: -- Description: Introspection of exported compute functions currently yield suboptimal output:

[jira] [Created] (ARROW-9465) [Python] Improve ergonomics of compute functions

2020-07-14 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-9465: - Summary: [Python] Improve ergonomics of compute functions Key: ARROW-9465 URL: https://issues.apache.org/jira/browse/ARROW-9465 Project: Apache Arrow

[jira] [Closed] (ARROW-9420) [Rust][DataFusion] Add repartition/shuffle plan

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-9420. - Resolution: Duplicate Duplicate of https://issues.apache.org/jira/browse/ARROW-9464 >

[jira] [Resolved] (ARROW-9450) [Python] "pytest pyarrow" takes over 10 seconds to collect tests and start executing

2020-07-14 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9450. --- Resolution: Fixed Issue resolved by pull request 7749

[jira] [Assigned] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reassigned ARROW-9464: - Assignee: Andy Grove > [Rust] [DataFusion] Physical plan refactor to support async and

[jira] [Commented] (ARROW-9420) [Rust][DataFusion] Add repartition/shuffle plan

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157350#comment-17157350 ] Andy Grove commented on ARROW-9420: --- I have created a new Jira to replace this one, since the changes I

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157349#comment-17157349 ] Joris Van den Bossche commented on ARROW-9458: -- > Did you set ? batch_size=1_000_000 The

[jira] [Updated] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-9464: -- Description: I would like to propose a refactor of the physical/execution planning based on the

[jira] [Created] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
Andy Grove created ARROW-9464: - Summary: [Rust] [DataFusion] Physical plan refactor to support async and optimization rules Key: ARROW-9464 URL: https://issues.apache.org/jira/browse/ARROW-9464 Project:

[jira] [Updated] (ARROW-9464) [Rust] [DataFusion] Physical plan refactor to support async and optimization rules

2020-07-14 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-9464: -- Description: I would like to propose a refactor of the physical/execution planning based on the

[jira] [Updated] (ARROW-9463) [Go] The writer is double closed in TestReadWrite

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9463: -- Labels: pull-request-available (was: ) > [Go] The writer is double closed in TestReadWrite >

[jira] [Created] (ARROW-9463) [Go] The writer is double closed in TestReadWrite

2020-07-14 Thread FredGan (Jira)
FredGan created ARROW-9463: -- Summary: [Go] The writer is double closed in TestReadWrite Key: ARROW-9463 URL: https://issues.apache.org/jira/browse/ARROW-9463 Project: Apache Arrow Issue Type: Test

[jira] [Updated] (ARROW-9462) [Go] The Indentation after the first Record arrjson writer is missing

2020-07-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9462: -- Labels: pull-request-available (was: ) > [Go] The Indentation after the first Record arrjson

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157340#comment-17157340 ] Maarten Breddels commented on ARROW-9458: - Did you set ? batch_size=1_000_000 > [Python] Dataset

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157337#comment-17157337 ] Joris Van den Bossche commented on ARROW-9458: -- [~maartenbreddels] how big are the row

[jira] [Commented] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17157338#comment-17157338 ] Maarten Breddels commented on ARROW-9458: -   Running this (now with all columns) {code:java}

[jira] [Updated] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maarten Breddels updated ARROW-9458: Attachment: image-2020-07-14-14-38-16-767.png > [Python] Dataset singlethreaded only >

[jira] [Updated] (ARROW-9458) [Python] Dataset singlethreaded only

2020-07-14 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maarten Breddels updated ARROW-9458: Attachment: image-2020-07-14-14-31-29-943.png > [Python] Dataset singlethreaded only >

[jira] [Updated] (ARROW-7955) [Java] Support large buffer for file/stream IPC

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-7955: --- Fix Version/s: 1.0.0 > [Java] Support large buffer for file/stream IPC >

[jira] [Updated] (ARROW-8443) [Gandiva][C++] Fix round/truncate to no-op for special cases

2020-07-14 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8443: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [Gandiva][C++] Fix

  1   2   >