[jira] [Commented] (ARROW-8135) [Python] Problem importing PyArrow on a cluster

2020-03-18 Thread Matej Murin (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061490#comment-17061490 ] Matej Murin commented on ARROW-8135: [~wesm] yes, thank you for the link ! it was the

[jira] [Closed] (ARROW-8135) [Python] Problem importing PyArrow on a cluster

2020-03-18 Thread Matej Murin (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matej Murin closed ARROW-8135. -- Fix Version/s: 0.16.0 Resolution: Fixed Had been missing dependencies of the package > [Python]

[jira] [Resolved] (ARROW-7812) [Packaging][Python] Upgrade LLVM in manylinux1 docker image

2020-03-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-7812. - Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6651 [https://githu

[jira] [Created] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Florian Jetter (Jira)
Florian Jetter created ARROW-8142: - Summary: [Python/C++] Casting empty table from after parquet roundtrip causes critical failure Key: ARROW-8142 URL: https://issues.apache.org/jira/browse/ARROW-8142

[jira] [Updated] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter updated ARROW-8142: -- Description: When casting a schema of an empty table from dict encoded to non-dict encoded typ

[jira] [Created] (ARROW-8143) [C++] Provide a default implementation for ExtensionType::ExtensionEquals

2020-03-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8143: -- Summary: [C++] Provide a default implementation for ExtensionType::ExtensionEquals Key: ARROW-8143 URL: https://issues.apache.org/jira/browse/ARROW-8143 Project:

[jira] [Updated] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8142: - Description: When casting a schema of an empty table from dict encoded to non-dic

[jira] [Updated] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8142: - Fix Version/s: 0.17.0 > [Python/C++] Casting empty table from after parquet round

[jira] [Updated] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8142: - Component/s: C++ > [Python/C++] Casting empty table from after parquet roundtrip

[jira] [Resolved] (ARROW-8126) [C++][Compute] Add Top-K kernel benchmark

2020-03-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8126. --- Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6639 [https://g

[jira] [Assigned] (ARROW-8122) [Python] Empty numpy arrays with shape cannot be deserialized

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-8122: Assignee: Wenjun Si (was: Joris Van den Bossche) > [Python] Empty numpy a

[jira] [Reopened] (ARROW-7907) [Python] Conversion to pandas of empty table with timestamp type aborts

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reopened ARROW-7907: -- Assignee: (was: Wes McKinney) Actually, it seems that the linked commit o

[jira] [Commented] (ARROW-8142) [Python/C++] Casting empty table from after parquet roundtrip causes critical failure

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061622#comment-17061622 ] Joris Van den Bossche commented on ARROW-8142: -- [~fjetter] thanks for the re

[jira] [Commented] (ARROW-7907) [Python] Conversion to pandas of empty table with timestamp type aborts

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061623#comment-17061623 ] Joris Van den Bossche commented on ARROW-7907: -- So a small reproducer that c

[jira] [Resolved] (ARROW-3329) [Python] Error casting decimal(38, 4) to int64

2020-03-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-3329. --- Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6427 [https://g

[jira] [Assigned] (ARROW-3329) [Python] Error casting decimal(38, 4) to int64

2020-03-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-3329: - Assignee: Jacek Pliszka > [Python] Error casting decimal(38, 4) to int64 > -

[jira] [Commented] (ARROW-1231) [C++] Add filesystem / IO implementation for Google Cloud Storage

2020-03-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061636#comment-17061636 ] Antoine Pitrou commented on ARROW-1231: --- [~clarkzinzow] You're welcome to take a lo

[jira] [Commented] (ARROW-7755) [Python] Windows wheel cannot be installed on Python 3.8

2020-03-18 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061666#comment-17061666 ] Krisztian Szucs commented on ARROW-7755: Bintray have both rc and "official" rele

[jira] [Created] (ARROW-8144) [CI] Cmake 3.2 nightly builds fails

2020-03-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8144: -- Summary: [CI] Cmake 3.2 nightly builds fails Key: ARROW-8144 URL: https://issues.apache.org/jira/browse/ARROW-8144 Project: Apache Arrow Issue Type: Bug

[jira] [Updated] (ARROW-8144) [CI] Cmake 3.2 nightly build fails

2020-03-18 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8144: --- Summary: [CI] Cmake 3.2 nightly build fails (was: [CI] Cmake 3.2 nightly builds fails) > [C

[jira] [Updated] (ARROW-8144) [CI] Cmake 3.2 nightly builds fails

2020-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8144: -- Labels: pull-request-available (was: ) > [CI] Cmake 3.2 nightly builds fails > ---

[jira] [Created] (ARROW-8145) [C++] Rename GetTargetInfos

2020-03-18 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8145: - Summary: [C++] Rename GetTargetInfos Key: ARROW-8145 URL: https://issues.apache.org/jira/browse/ARROW-8145 Project: Apache Arrow Issue Type: Wish

[jira] [Reopened] (ARROW-3329) [Python] Error casting decimal(38, 4) to int64

2020-03-18 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jacek Pliszka reopened ARROW-3329: -- It is resolved in C++. Now I need to work on Python part. Thank you to your work! What is left of

[jira] [Created] (ARROW-8146) [C++] Add per-filesystem facility to sanitize a path

2020-03-18 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8146: - Summary: [C++] Add per-filesystem facility to sanitize a path Key: ARROW-8146 URL: https://issues.apache.org/jira/browse/ARROW-8146 Project: Apache Arrow I

[jira] [Updated] (ARROW-8146) [C++] Add per-filesystem facility to sanitize a path

2020-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8146: -- Labels: pull-request-available (was: ) > [C++] Add per-filesystem facility to sanitize a path

[jira] [Commented] (ARROW-1231) [C++] Add filesystem / IO implementation for Google Cloud Storage

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061849#comment-17061849 ] Wes McKinney commented on ARROW-1231: - [~clarkzinzow] note that google-cloud-cpp does

[jira] [Created] (ARROW-8147) [Packaging] Add google-cloud-cpp to ThirdpartyToolchain

2020-03-18 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8147: --- Summary: [Packaging] Add google-cloud-cpp to ThirdpartyToolchain Key: ARROW-8147 URL: https://issues.apache.org/jira/browse/ARROW-8147 Project: Apache Arrow Is

[jira] [Created] (ARROW-8148) [Packaging][C++] Add google-cloud-cpp to conda-forge

2020-03-18 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8148: --- Summary: [Packaging][C++] Add google-cloud-cpp to conda-forge Key: ARROW-8148 URL: https://issues.apache.org/jira/browse/ARROW-8148 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-8147) [C++] Add google-cloud-cpp to ThirdpartyToolchain

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8147: Summary: [C++] Add google-cloud-cpp to ThirdpartyToolchain (was: [C++][Packaging] Add google-cloud

[jira] [Updated] (ARROW-8147) [C++][Packaging] Add google-cloud-cpp to ThirdpartyToolchain

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8147: Summary: [C++][Packaging] Add google-cloud-cpp to ThirdpartyToolchain (was: [Packaging] Add google

[jira] [Commented] (ARROW-4484) [Java] improve Flight DoPut busy wait

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061873#comment-17061873 ] David Li commented on ARROW-4484: - Not a blocker for any version. > [Java] improve Fligh

[jira] [Updated] (ARROW-4484) [Java] improve Flight DoPut busy wait

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-4484: Fix Version/s: (was: 0.17.0) > [Java] improve Flight DoPut busy wait >

[jira] [Updated] (ARROW-8058) [C++][Python][Dataset] Provide an option to toggle validation and schema inference in FileSystemDatasetFactoryOptions

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-8058: -- Fix Version/s: (was: 1.0.0) 0.17.0 > [C++][Python][Datas

[jira] [Assigned] (ARROW-8058) [C++][Python][Dataset] Provide an option to toggle validation and schema inference in FileSystemDatasetFactoryOptions

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-8058: - Assignee: Francois Saint-Jacques > [C++][Python][Dataset] Provide an opt

[jira] [Updated] (ARROW-5745) [C++] properties of Map(Array|Type) are confusingly named

2020-03-18 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-5745: Fix Version/s: (was: 0.17.0) 1.0.0 > [C++] properties of Map(Array|Type) are

[jira] [Assigned] (ARROW-7673) [C++][Dataset] Revisit File discovery failure mode

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-7673: - Assignee: Francois Saint-Jacques > [C++][Dataset] Revisit File discovery

[jira] [Commented] (ARROW-7579) [FlightRPC] Make Handshake optional

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061874#comment-17061874 ] David Li commented on ARROW-7579: - Not a blocker for 0.17, removing from fix versions. >

[jira] [Updated] (ARROW-7579) [FlightRPC] Make Handshake optional

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-7579: Fix Version/s: (was: 0.17.0) > [FlightRPC] Make Handshake optional > --

[jira] [Commented] (ARROW-6062) [FlightRPC] Allow timeouts on all stream reads

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061876#comment-17061876 ] David Li commented on ARROW-6062: - Removing from 0.17. > [FlightRPC] Allow timeouts on

[jira] [Updated] (ARROW-6062) [FlightRPC] Allow timeouts on all stream reads

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-6062: Fix Version/s: (was: 0.17.0) > [FlightRPC] Allow timeouts on all stream reads > ---

[jira] [Created] (ARROW-8149) [C++/Python] Enable CUDA Support in conda recipes

2020-03-18 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-8149: --- Summary: [C++/Python] Enable CUDA Support in conda recipes Key: ARROW-8149 URL: https://issues.apache.org/jira/browse/ARROW-8149 Project: Apache Arrow Issue Type: New

[jira] [Updated] (ARROW-7673) [C++][Dataset] Revisit File discovery failure mode

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-7673: -- Fix Version/s: (was: 1.0.0) 0.17.0 > [C++][Dataset] Revi

[jira] [Assigned] (ARROW-7854) [C++][Dataset] Option to memory map when reading IPC format

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-7854: - Assignee: Francois Saint-Jacques > [C++][Dataset] Option to memory map w

[jira] [Updated] (ARROW-5744) [C++] Do not error in Table::CombineChunks for BinaryArray types that overflow 2GB limit

2020-03-18 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-5744: Fix Version/s: (was: 0.17.0) 1.0.0 > [C++] Do not error in Table::CombineChu

[jira] [Updated] (ARROW-7820) [C++][Gandiva] Add CMake support for compiling LLVM's IR into a library

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-7820: -- Fix Version/s: (was: 0.17.0) > [C++][Gandiva] Add CMake support for compili

[jira] [Updated] (ARROW-7818) [C++][Gandiva] Generate Filter kernels from gandiva code at compile time

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-7818: -- Fix Version/s: (was: 0.17.0) > [C++][Gandiva] Generate Filter kernels from

[jira] [Commented] (ARROW-7854) [C++][Dataset] Option to memory map when reading IPC format

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061882#comment-17061882 ] Joris Van den Bossche commented on ARROW-7854: -- [~fsaintjacques] this actual

[jira] [Updated] (ARROW-6122) [C++] ArgSort kernel must support FixedSizeBinary

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6122: -- Fix Version/s: (was: 0.17.0) > [C++] ArgSort kernel must support FixedSizeB

[jira] [Commented] (ARROW-5572) [Python] raise error message when passing invalid filter in parquet reading

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061915#comment-17061915 ] Joris Van den Bossche commented on ARROW-5572: -- This works now correctly wit

[jira] [Assigned] (ARROW-8088) [C++][Dataset] Partition columns with specified dictionary type result in all nulls

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-8088: Assignee: Ben Kietzman (was: Joris Van den Bossche) > [C++][Dataset] Part

[jira] [Assigned] (ARROW-8088) [C++][Dataset] Partition columns with specified dictionary type result in all nulls

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-8088: Assignee: Joris Van den Bossche (was: Ben Kietzman) > [C++][Dataset] Part

[jira] [Updated] (ARROW-8088) [C++][Dataset] Partition columns with specified dictionary type result in all nulls

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8088: - Fix Version/s: 0.17.0 > [C++][Dataset] Partition columns with specified dictionar

[jira] [Assigned] (ARROW-8088) [C++][Dataset] Partition columns with specified dictionary type result in all nulls

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-8088: Assignee: Ben Kietzman (was: Joris Van den Bossche) > [C++][Dataset] Part

[jira] [Resolved] (ARROW-8127) [C++] [Parquet] Incorrect column chunk metadata for multipage batch writes

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8127. - Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6637 [https://githu

[jira] [Assigned] (ARROW-8088) [C++][Dataset] Partition columns with specified dictionary type result in all nulls

2020-03-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-8088: Assignee: Joris Van den Bossche > [C++][Dataset] Partition columns with sp

[jira] [Updated] (ARROW-7673) [C++][Dataset] Revisit File discovery failure mode

2020-03-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-7673: --- Fix Version/s: (was: 0.17.0) 1.0.0 > [C++][Dataset] Revisit File disco

[GitHub] [arrow-testing] pitrou opened a new pull request #21: PARQUET-1819: [C++] Add parquet fuzz files

2020-03-18 Thread GitBox
pitrou opened a new pull request #21: PARQUET-1819: [C++] Add parquet fuzz files URL: https://github.com/apache/arrow-testing/pull/21 This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow-testing] pitrou merged pull request #21: PARQUET-1819: [C++] Add parquet fuzz files

2020-03-18 Thread GitBox
pitrou merged pull request #21: PARQUET-1819: [C++] Add parquet fuzz files URL: https://github.com/apache/arrow-testing/pull/21 This is an automated message from the Apache Git Service. To respond to the message, please log o

[jira] [Created] (ARROW-8150) [Rust] Allow writing custom FileMetaData k/v pairs

2020-03-18 Thread David Kegley (Jira)
David Kegley created ARROW-8150: --- Summary: [Rust] Allow writing custom FileMetaData k/v pairs Key: ARROW-8150 URL: https://issues.apache.org/jira/browse/ARROW-8150 Project: Apache Arrow Issue T

[jira] [Created] (ARROW-8151) [Benchmarking][Dataset] Benchmark Parquet read performance with S3File

2020-03-18 Thread David Li (Jira)
David Li created ARROW-8151: --- Summary: [Benchmarking][Dataset] Benchmark Parquet read performance with S3File Key: ARROW-8151 URL: https://issues.apache.org/jira/browse/ARROW-8151 Project: Apache Arrow

[jira] [Created] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-03-18 Thread David Li (Jira)
David Li created ARROW-8152: --- Summary: [C++] IO: split large coalesced reads into smaller ones Key: ARROW-8152 URL: https://issues.apache.org/jira/browse/ARROW-8152 Project: Apache Arrow Issue Type

[jira] [Updated] (ARROW-8151) [Benchmarking][Dataset] Benchmark Parquet read performance with S3File

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-8151: Issue Type: Improvement (was: Bug) > [Benchmarking][Dataset] Benchmark Parquet read performance with S3Fil

[jira] [Updated] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-8152: Issue Type: Improvement (was: Bug) > [C++] IO: split large coalesced reads into smaller ones > ---

[jira] [Commented] (ARROW-1231) [C++] Add filesystem / IO implementation for Google Cloud Storage

2020-03-18 Thread Clark Zinzow (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17061994#comment-17061994 ] Clark Zinzow commented on ARROW-1231: - [~apitrou] [~wesm] Great, thanks! Is it safe t

[jira] [Updated] (ARROW-8150) [Rust] Allow writing custom FileMetaData k/v pairs

2020-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8150: -- Labels: pull-request-available (was: ) > [Rust] Allow writing custom FileMetaData k/v pairs >

[jira] [Updated] (ARROW-7390) [C++][Dataset] Concurrency race in Projector::Project

2020-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7390: -- Labels: pull-request-available (was: ) > [C++][Dataset] Concurrency race in Projector::Project

[jira] [Closed] (ARROW-7854) [C++][Dataset] Option to memory map when reading IPC format

2020-03-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques closed ARROW-7854. - Resolution: Not A Problem Already supported. > [C++][Dataset] Option to memory m

[jira] [Created] (ARROW-8153) [Packaging] Update the conda feedstock files and upload artifacts to Anaconda

2020-03-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8153: -- Summary: [Packaging] Update the conda feedstock files and upload artifacts to Anaconda Key: ARROW-8153 URL: https://issues.apache.org/jira/browse/ARROW-8153 Proje

[jira] [Updated] (ARROW-8144) [CI] Cmake 3.2 nightly build fails

2020-03-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-8144: Component/s: Continuous Integration > [CI] Cmake 3.2 nightly build fails >

[jira] [Updated] (ARROW-8153) [Packaging] Update the conda feedstock files and upload artifacts to Anaconda

2020-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8153: -- Labels: pull-request-available (was: ) > [Packaging] Update the conda feedstock files and uplo

[jira] [Commented] (ARROW-8145) [C++] Rename GetTargetInfos

2020-03-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062035#comment-17062035 ] Kouhei Sutou commented on ARROW-8145: - Oh, sorry. I'm OK either. Should I create a pu

[jira] [Commented] (ARROW-8145) [C++] Rename GetTargetInfos

2020-03-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062036#comment-17062036 ] Antoine Pitrou commented on ARROW-8145: --- I can do it. There's no hurry in any case.

[jira] [Updated] (ARROW-8145) [C++] Rename GetTargetInfos

2020-03-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-8145: -- Fix Version/s: 0.17.0 > [C++] Rename GetTargetInfos > --- > >

[jira] [Resolved] (ARROW-8144) [CI] Cmake 3.2 nightly build fails

2020-03-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-8144. - Resolution: Fixed Issue resolved by pull request 6654 [https://github.com/apache/arrow/pull/6654]

[jira] [Created] (ARROW-8154) HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release

2020-03-18 Thread Eric Henry (Jira)
Eric Henry created ARROW-8154: - Summary: HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release Key: ARROW-8154 URL: https://issues.apache.org/jira/browse/ARROW-8154 Project: Apach

[jira] [Assigned] (ARROW-7966) [Integration][Flight][C++] Client should verify each batch independently

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-7966: --- Assignee: David Li > [Integration][Flight][C++] Client should verify each batch independently >

[jira] [Commented] (ARROW-7966) [Integration][Flight][C++] Client should verify each batch independently

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062043#comment-17062043 ] David Li commented on ARROW-7966: - Fixing this causes the test I added in ARROW-7899 to f

[jira] [Comment Edited] (ARROW-7966) [Integration][Flight][C++] Client should verify each batch independently

2020-03-18 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062043#comment-17062043 ] David Li edited comment on ARROW-7966 at 3/18/20, 8:57 PM: --- Fix

[jira] [Updated] (ARROW-7966) [Integration][Flight][C++] Client should verify each batch independently

2020-03-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7966: -- Labels: pull-request-available (was: ) > [Integration][Flight][C++] Client should verify each

[jira] [Commented] (ARROW-8154) [Python] HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8154?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062082#comment-17062082 ] Wes McKinney commented on ARROW-8154: - I think this is a dup of ARROW-7841, a regress

[jira] [Updated] (ARROW-8154) [Python] HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8154: Summary: [Python] HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release (w

[jira] [Updated] (ARROW-8141) [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics API

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8141: Fix Version/s: 0.17.0 > [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics

[jira] [Assigned] (ARROW-8141) [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics API

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8141: --- Assignee: Frank Du > [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrins

[jira] [Resolved] (ARROW-8141) [C++] Optimize BM_PlainDecodingBoolean performance using AVX512 Intrinsics API

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8141. - Resolution: Fixed Issue resolved by pull request 6650 [https://github.com/apache/arrow/pull/6650]

[jira] [Assigned] (ARROW-8080) [C++] Add AVX512 build option

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8080: --- Assignee: Frank Du > [C++] Add AVX512 build option > - > >

[jira] [Resolved] (ARROW-8080) [C++] Add AVX512 build option

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8080. - Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6585 [https://githu

[jira] [Created] (ARROW-8155) [C++] Add "ON only if system dependencies available" build mode for certain optional Arrow components

2020-03-18 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8155: --- Summary: [C++] Add "ON only if system dependencies available" build mode for certain optional Arrow components Key: ARROW-8155 URL: https://issues.apache.org/jira/browse/ARROW-8155

[jira] [Commented] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062119#comment-17062119 ] Wes McKinney commented on ARROW-8152: - What do you think about introducing the read p

[jira] [Updated] (ARROW-8154) [Python] HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release

2020-03-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-8154: Fix Version/s: 0.17.0 > [Python] HDFS Filesystem does not set environment variables in pyarrow >

[jira] [Resolved] (ARROW-8154) [Python] HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release

2020-03-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-8154. - Resolution: Duplicate I think so too. Sorry for the regression. Please wait for 0.17.0. > [Pyth

[jira] [Commented] (ARROW-1231) [C++] Add filesystem / IO implementation for Google Cloud Storage

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062123#comment-17062123 ] Wes McKinney commented on ARROW-1231: - [~clarkzinzow] well, adding the thirdparty dep

[jira] [Commented] (ARROW-8145) [C++] Rename GetTargetInfos

2020-03-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062124#comment-17062124 ] Kouhei Sutou commented on ARROW-8145: - Thanks! > [C++] Rename GetTargetInfos > -

[jira] [Created] (ARROW-8156) [C++] Add variant of Filesystem::OpenInputFile that has memory-map like behavior if it is possible

2020-03-18 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8156: --- Summary: [C++] Add variant of Filesystem::OpenInputFile that has memory-map like behavior if it is possible Key: ARROW-8156 URL: https://issues.apache.org/jira/browse/ARROW-8156

[jira] [Commented] (ARROW-7854) [C++][Dataset] Option to memory map when reading IPC format

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062135#comment-17062135 ] Wes McKinney commented on ARROW-7854: - I'm not totally satisfied by the global memory

[jira] [Closed] (ARROW-8154) [Python] HDFS Filesystem does not set environment variables in pyarrow 0.16.0 release

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8154. --- > [Python] HDFS Filesystem does not set environment variables in pyarrow > 0.16.0 release > ---

[jira] [Created] (ARROW-8157) [C++] Upgrade to LLVM 9

2020-03-18 Thread Jun NAITOH (Jira)
Jun NAITOH created ARROW-8157: - Summary: [C++] Upgrade to LLVM 9 Key: ARROW-8157 URL: https://issues.apache.org/jira/browse/ARROW-8157 Project: Apache Arrow Issue Type: Improvement Comp

[jira] [Resolved] (ARROW-7858) [C++][Python] Support casting an Extension type to its storage type

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7858. - Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6633 [https://githu

[jira] [Commented] (ARROW-1231) [C++] Add filesystem / IO implementation for Google Cloud Storage

2020-03-18 Thread Clark Zinzow (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062166#comment-17062166 ] Clark Zinzow commented on ARROW-1231: - [~wesm] Ah I don't think I was very clear, sor

[jira] [Commented] (ARROW-1231) [C++] Add filesystem / IO implementation for Google Cloud Storage

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17062175#comment-17062175 ] Wes McKinney commented on ARROW-1231: - Perhaps I'm not understanding ARROW-8031. Are

[jira] [Resolved] (ARROW-7996) [Python] Error serializing empty pandas DataFrame with pyarrow

2020-03-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7996. - Resolution: Fixed Resolved by https://github.com/apache/arrow/commit/7916fb49a0e4c125a02f8c13afb

  1   2   >