[jira] [Updated] (ARROW-10228) Donate Julia Implementation

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10228: --- Labels: pull-request-available (was: ) > Donate Julia Implementation >

[jira] [Updated] (ARROW-10229) [C++][Parquet] Remove left over ARROW_LOG statement.

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10229: --- Labels: pull-request-available (was: ) > [C++][Parquet] Remove left over ARROW_LOG

[jira] [Created] (ARROW-10229) [C++][Parquet] Remove left over ARROW_LOG statement.

2020-10-07 Thread Micah Kornfield (Jira)
Micah Kornfield created ARROW-10229: --- Summary: [C++][Parquet] Remove left over ARROW_LOG statement. Key: ARROW-10229 URL: https://issues.apache.org/jira/browse/ARROW-10229 Project: Apache Arrow

[jira] [Created] (ARROW-10228) Donate Julia Implementation

2020-10-07 Thread Jacob Quinn (Jira)
Jacob Quinn created ARROW-10228: --- Summary: Donate Julia Implementation Key: ARROW-10228 URL: https://issues.apache.org/jira/browse/ARROW-10228 Project: Apache Arrow Issue Type: New Feature

[jira] [Resolved] (ARROW-10227) [Ruby] Use a table size as the default for parquet chunk_size

2020-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-10227. -- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8391

[jira] [Assigned] (ARROW-10227) [Ruby] Use a table size as the default for parquet chunk_size

2020-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou reassigned ARROW-10227: Assignee: Shoichi Kagawa > [Ruby] Use a table size as the default for parquet chunk_size

[jira] [Updated] (ARROW-10227) [Ruby] Use a table size as the default for parquet chunk_size

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10227: --- Labels: pull-request-available (was: ) > [Ruby] Use a table size as the default for

[jira] [Updated] (ARROW-8518) [Python] Create tools to enable optional components (like Gandiva, Flight) to be built and deployed as separate Python packages

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8518: -- Labels: pull-request-available (was: ) > [Python] Create tools to enable optional components

[jira] [Created] (ARROW-10227) [Ruby] Use a table size as the default for parquet chunk_size

2020-10-07 Thread Shoichi Kagawa (Jira)
Shoichi Kagawa created ARROW-10227: -- Summary: [Ruby] Use a table size as the default for parquet chunk_size Key: ARROW-10227 URL: https://issues.apache.org/jira/browse/ARROW-10227 Project: Apache

[jira] [Updated] (ARROW-10207) [C++] Unary kernels that results in a list have no preallocated offset buffer

2020-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-10207: - Summary: [C++] Unary kernels that results in a list have no preallocated offset buffer (was:

[jira] [Assigned] (ARROW-10224) Build Python 3.9 wheels

2020-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou reassigned ARROW-10224: Assignee: Terence Honles > Build Python 3.9 wheels > --- > >

[jira] [Commented] (ARROW-10226) [Rust] [DataFusion] TPC-H query 1 no longer completes for 100GB dataset

2020-10-07 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209945#comment-17209945 ] Andy Grove commented on ARROW-10226: Query works fine against tbl files but not against parquet

[jira] [Resolved] (ARROW-10134) [C++][Dataset] Add ParquetFileFragment::num_row_groups property

2020-10-07 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-10134. -- Resolution: Fixed Issue resolved by pull request 8317

[jira] [Updated] (ARROW-8296) [C++][Dataset] IpcFileFormat should support writing files with compressed buffers

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8296: -- Labels: dataset pull-request-available (was: dataset) > [C++][Dataset] IpcFileFormat should

[jira] [Resolved] (ARROW-9782) [C++][Dataset] Ability to write ".feather" files with IpcFileFormat

2020-10-07 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-9782. - Resolution: Fixed Issue resolved by pull request 8305

[jira] [Commented] (ARROW-10226) [Rust] [DataFusion] TPC-H query 1 no longer completes for 100GB dataset

2020-10-07 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209937#comment-17209937 ] Andy Grove commented on ARROW-10226: The query also returns the wrong results ... grouping by

[jira] [Updated] (ARROW-6720) [JAVA][C++]Support Parquet Read and Write in Java

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-6720: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [JAVA][C++]Support Parquet Read

[jira] [Commented] (ARROW-6720) [JAVA][C++]Support Parquet Read and Write in Java

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209927#comment-17209927 ] Krisztian Szucs commented on ARROW-6720: It would be nice to have a status update here. Until

[jira] [Updated] (ARROW-10056) [Python] PyArrow writes invalid Feather v2 file: OSError: Verification of flatbuffer-encoded Footer failed.

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-10056: Fix Version/s: (was: 2.0.0) 3.0.0 > [Python] PyArrow writes

[jira] [Updated] (ARROW-10226) [Rust] [DataFusion] TPC-H query 1 no longer completes for 100GB dataset

2020-10-07 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-10226: --- Description: I re-installed my desktop a few days ago (now using Ubuntu 20.04 LTS)  and when I try

[jira] [Created] (ARROW-10226) [Rust] [DataFusion] TPC-H query 1 no longer completes for 100GB dataset

2020-10-07 Thread Andy Grove (Jira)
Andy Grove created ARROW-10226: -- Summary: [Rust] [DataFusion] TPC-H query 1 no longer completes for 100GB dataset Key: ARROW-10226 URL: https://issues.apache.org/jira/browse/ARROW-10226 Project: Apache

[jira] [Commented] (ARROW-7494) [Java] Remove reader index and writer index from ArrowBuf

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209913#comment-17209913 ] Krisztian Szucs commented on ARROW-7494: It's not likely to land in 2.0 so postponing. > [Java]

[jira] [Updated] (ARROW-7494) [Java] Remove reader index and writer index from ArrowBuf

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-7494: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [Java] Remove reader index and

[jira] [Updated] (ARROW-9843) [C++] Implement Between trinary kernel

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9843: -- Labels: pull-request-available (was: ) > [C++] Implement Between trinary kernel >

[jira] [Updated] (ARROW-9572) [CI][Homebrew] Properly enable Gandiva and improve testing

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9572: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [CI][Homebrew] Properly enable

[jira] [Commented] (ARROW-9572) [CI][Homebrew] Properly enable Gandiva and improve testing

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209908#comment-17209908 ] Krisztian Szucs commented on ARROW-9572: Since it has been added upstream I'm postponing to 3.0.

[jira] [Assigned] (ARROW-10225) [Rust] [Parquet] Fix bull bitmap comparisons in roundtrip tests

2020-10-07 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale reassigned ARROW-10225: -- Assignee: Neville Dipale > [Rust] [Parquet] Fix bull bitmap comparisons in roundtrip

[jira] [Updated] (ARROW-10225) [Rust] [Parquet] Fix bull bitmap comparisons in roundtrip tests

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10225: --- Labels: pull-request-available (was: ) > [Rust] [Parquet] Fix bull bitmap comparisons in

[jira] [Created] (ARROW-10225) [Rust] [Parquet] Fix bull bitmap comparisons in roundtrip tests

2020-10-07 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-10225: -- Summary: [Rust] [Parquet] Fix bull bitmap comparisons in roundtrip tests Key: ARROW-10225 URL: https://issues.apache.org/jira/browse/ARROW-10225 Project: Apache

[jira] [Commented] (ARROW-10215) [Rust] [DataFusion] Rename "Source" typedef

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209906#comment-17209906 ] Krisztian Szucs commented on ARROW-10215: - Postponing to 3.0. > [Rust] [DataFusion] Rename

[jira] [Updated] (ARROW-10215) [Rust] [DataFusion] Rename "Source" typedef

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-10215: Fix Version/s: (was: 2.0.0) 3.0.0 > [Rust] [DataFusion] Rename

[jira] [Commented] (ARROW-9847) [Rust] Inconsistent use of import arrow:: vs crate::arrow::

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209903#comment-17209903 ] Krisztian Szucs commented on ARROW-9847: [~andygrove] Is it going to make into 2.0? If not please

[jira] [Commented] (ARROW-9001) [R] Box outputs as correct type in call_function

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209901#comment-17209901 ] Krisztian Szucs commented on ARROW-9001: [~romainfrancois] [~npr] Based on the github

[jira] [Updated] (ARROW-9001) [R] Box outputs as correct type in call_function

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9001: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [R] Box outputs as correct type

[jira] [Resolved] (ARROW-10168) [Rust] [Parquet] Extend arrow schema conversion to projected fields

2020-10-07 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-10168. Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8354

[jira] [Resolved] (ARROW-10144) [Flight] Add support for using the TLS_SNI extension

2020-10-07 Thread James Duong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Duong resolved ARROW-10144. - Resolution: Invalid Verified as already working by [~tifflhl] > [Flight] Add support for using

[jira] [Assigned] (ARROW-10144) [Flight] Add support for using the TLS_SNI extension

2020-10-07 Thread James Duong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Duong reassigned ARROW-10144: --- Assignee: James Duong > [Flight] Add support for using the TLS_SNI extension >

[jira] [Updated] (ARROW-9853) [RUST] Implement "take" kernel for dictionary arrays

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9853: --- Fix Version/s: (was: 3.0.0) 2.0.0 > [RUST] Implement "take" kernel

[jira] [Updated] (ARROW-9585) [Rust] Remove duplicated to-do line in DataFusion readme

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9585: --- Fix Version/s: 2.0.0 > [Rust] Remove duplicated to-do line in DataFusion readme >

[jira] [Updated] (ARROW-9536) Missing parameters in PlasmaOutOfMemoryException.java

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9536: --- Fix Version/s: 2.0.0 > Missing parameters in PlasmaOutOfMemoryException.java >

[jira] [Created] (ARROW-10224) Build Python 3.9 wheels

2020-10-07 Thread Terence Honles (Jira)
Terence Honles created ARROW-10224: -- Summary: Build Python 3.9 wheels Key: ARROW-10224 URL: https://issues.apache.org/jira/browse/ARROW-10224 Project: Apache Arrow Issue Type: Task

[jira] [Commented] (ARROW-10144) [Flight] Add support for using the TLS_SNI extension

2020-10-07 Thread Tiffany Lam (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209883#comment-17209883 ] Tiffany Lam commented on ARROW-10144: - [~jduong] I have verified that there are existing TLS SNI

[jira] [Updated] (ARROW-9508) [Release][APT][Yum] Enable verification for arm64 binaries

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9508?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9508: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [Release][APT][Yum] Enable

[jira] [Updated] (ARROW-9328) [C++][Gandiva] Add LTRIM, RTRIM, BTRIM functions for string

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9328: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Gandiva] Add LTRIM, RTRIM,

[jira] [Updated] (ARROW-10223) [C++] Use timestamp parsers for date32() CSV parsing

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-10223: --- Priority: Minor (was: Major) > [C++] Use timestamp parsers for date32() CSV parsing >

[jira] [Updated] (ARROW-9534) [Rust] [DataFusion] Implement functions for creating literal expressions for all types

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9534: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [Rust] [DataFusion] Implement

[jira] [Created] (ARROW-10223) [C++] Use timestamp parsers for date32() CSV parsing

2020-10-07 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-10223: --- Summary: [C++] Use timestamp parsers for date32() CSV parsing Key: ARROW-10223 URL: https://issues.apache.org/jira/browse/ARROW-10223 Project: Apache Arrow

[jira] [Updated] (ARROW-9205) [Documentation] Fix typos in Columnar.rst

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9205: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [Documentation] Fix typos in

[jira] [Updated] (ARROW-9621) [Python] test_move_file() is failed with fsspec 0.8.0

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9621: --- Fix Version/s: 2.0.0 > [Python] test_move_file() is failed with fsspec 0.8.0 >

[jira] [Updated] (ARROW-9973) [Java] JDBC DateConsumer does not allow dates before epoch

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9973: --- Fix Version/s: 2.0.0 > [Java] JDBC DateConsumer does not allow dates before epoch >

[jira] [Updated] (ARROW-9853) [RUST] Implement "take" kernel for dictionary arrays

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9853: --- Fix Version/s: 3.0.0 > [RUST] Implement "take" kernel for dictionary arrays >

[jira] [Updated] (ARROW-9997) [Python] StructScalar.as_py() fails if the type has duplicate field names

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9997: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [Python] StructScalar.as_py() fails

[jira] [Updated] (ARROW-10183) [C++] Create a ForEach library function that runs on an iterator of futures

2020-10-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-10183: - Summary: [C++] Create a ForEach library function that runs on an iterator of futures (was:

[jira] [Commented] (ARROW-1614) [C++] Add a Tensor logical value type with constant dimensions, implemented using ExtensionType

2020-10-07 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209872#comment-17209872 ] Rok Mihevc commented on ARROW-1614: --- I'd like to contribute to this work and will have time available

[jira] [Resolved] (ARROW-10139) [C++] Add support for building arrow_testing without building tests

2020-10-07 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-10139. -- Resolution: Fixed Issue resolved by pull request 8356

[jira] [Commented] (ARROW-9997) [Python] StructScalar.as_py() fails if the type has duplicate field names

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209862#comment-17209862 ] Krisztian Szucs commented on ARROW-9997: I find this issue a bit pressing before the release, but

[jira] [Updated] (ARROW-6607) [Python] Support for set/list columns when converting from Pandas

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-6607: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [Python] Support for set/list

[jira] [Commented] (ARROW-6607) [Python] Support for set/list columns when converting from Pandas

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6607?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209859#comment-17209859 ] Krisztian Szucs commented on ARROW-6607: [~jorisvandenbossche] I'm postponing it to 3.0 so we can

[jira] [Resolved] (ARROW-10099) [C++][Dataset] Also allow integer partition fields to be dictionary encoded

2020-10-07 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-10099. -- Resolution: Fixed Issue resolved by pull request 8367

[jira] [Resolved] (ARROW-9266) [Python][Packaging] Enable S3 support in macOS wheels

2020-10-07 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9266. Resolution: Fixed Issue resolved by pull request 8315

[jira] [Created] (ARROW-10222) [C++] Add FileSystem::MakeUri() to serialize file locations to URIs

2020-10-07 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-10222: Summary: [C++] Add FileSystem::MakeUri() to serialize file locations to URIs Key: ARROW-10222 URL: https://issues.apache.org/jira/browse/ARROW-10222 Project: Apache

[jira] [Resolved] (ARROW-10204) [RUST] [Datafusion] Test failure in aggregate_grouped_empty with simd feature enabled

2020-10-07 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-10204. Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8378

[jira] [Created] (ARROW-10221) Javascript toArray() method ignores nulls on some types.

2020-10-07 Thread Ben Schmidt (Jira)
Ben Schmidt created ARROW-10221: --- Summary: Javascript toArray() method ignores nulls on some types. Key: ARROW-10221 URL: https://issues.apache.org/jira/browse/ARROW-10221 Project: Apache Arrow

[jira] [Created] (ARROW-10220) Cache javascript utf-8 dictionary keys?

2020-10-07 Thread Ben Schmidt (Jira)
Ben Schmidt created ARROW-10220: --- Summary: Cache javascript utf-8 dictionary keys? Key: ARROW-10220 URL: https://issues.apache.org/jira/browse/ARROW-10220 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-10219) [C++] csv::TableReader column names, Read() arguments

2020-10-07 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209762#comment-17209762 ] Neal Richardson commented on ARROW-10219: - I didn't know about include_columns, thanks. Here's

[jira] [Updated] (ARROW-3822) [C++] parquet::arrow::FileReader::GetRecordBatchReader may not iterate through chunked columns completely

2020-10-07 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-3822: Fix Version/s: (was: 2.0.0) 3.0.0 > [C++]

[jira] [Resolved] (ARROW-6972) [C#] Should support StructField arrays

2020-10-07 Thread Eric Erhardt (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Erhardt resolved ARROW-6972. - Resolution: Fixed Issue resolved by pull request 8348

[jira] [Resolved] (ARROW-9964) [C++] CSV date support

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9964. --- Resolution: Fixed Issue resolved by pull request 8381

[jira] [Commented] (ARROW-10219) [C++] csv::TableReader column names, Read() arguments

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209724#comment-17209724 ] Antoine Pitrou commented on ARROW-10219: I'm not sure I understand #1, can you explain a bit

[jira] [Created] (ARROW-10219) [C++] csv::TableReader column names, Read() arguments

2020-10-07 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-10219: --- Summary: [C++] csv::TableReader column names, Read() arguments Key: ARROW-10219 URL: https://issues.apache.org/jira/browse/ARROW-10219 Project: Apache Arrow

[jira] [Resolved] (ARROW-9645) [Python] Deprecate the legacy pyarrow.filesystem interface

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9645. --- Resolution: Fixed Issue resolved by pull request 8149

[jira] [Resolved] (ARROW-10196) [C++] Add Future::DeferNotOk()

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-10196. Resolution: Fixed Issue resolved by pull request 8362

[jira] [Commented] (ARROW-9974) [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading large number of files using ParquetDataset

2020-10-07 Thread Ashish Gupta (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209705#comment-17209705 ] Ashish Gupta commented on ARROW-9974: - Tried... export MALLOC_MMAP_THRESHOLD_=65536 same error

[jira] [Resolved] (ARROW-10181) [Rust] Arrow tests fail to compile on Raspberry Pi (32 bit)

2020-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-10181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão resolved ARROW-10181. -- Resolution: Fixed Issue resolved by pull request 8353

[jira] [Commented] (ARROW-9974) [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading large number of files using ParquetDataset

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209676#comment-17209676 ] Antoine Pitrou commented on ARROW-9974: --- [~kgashish] Can you try what I suggested above? >

[jira] [Resolved] (ARROW-10030) [Rust] Support fromIter and toIter

2020-10-07 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-10030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge Leitão resolved ARROW-10030. -- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8211

[jira] [Commented] (ARROW-9974) [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading large number of files using ParquetDataset

2020-10-07 Thread Ashish Gupta (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209671#comment-17209671 ] Ashish Gupta commented on ARROW-9974: - Anyone tried to reproduce on centos-8? > [Python][C++]

[jira] [Updated] (ARROW-10172) [Python] pyarrow.concat_arrays segfaults if a resulting StringArray's capacity overflows

2020-10-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-10172: - Summary: [Python] pyarrow.concat_arrays segfaults if a resulting StringArray's capacity

[jira] [Updated] (ARROW-10172) [Python] cancat_arrays requires upcast for large array

2020-10-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-10172: - Summary: [Python] cancat_arrays requires upcast for large array (was: cancat_arrays requires

[jira] [Commented] (ARROW-10140) [Python][C++] No data for map column of a parquet file created from pyarrow and pandas

2020-10-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209667#comment-17209667 ] Wes McKinney commented on ARROW-10140: -- I'm reopening this until someone confirms that this case is

[jira] [Comment Edited] (ARROW-10140) [Python][C++] No data for map column of a parquet file created from pyarrow and pandas

2020-10-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209667#comment-17209667 ] Wes McKinney edited comment on ARROW-10140 at 10/7/20, 4:39 PM: I'm

[jira] [Reopened] (ARROW-10140) [Python][C++] No data for map column of a parquet file created from pyarrow and pandas

2020-10-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reopened ARROW-10140: -- > [Python][C++] No data for map column of a parquet file created from pyarrow > and pandas >

[jira] [Updated] (ARROW-10088) [R] Don't store "data.table" pointer in metadata

2020-10-07 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Fix Version/s: (was: 2.0.0) > [R] Don't store "data.table" pointer in metadata >

[jira] [Updated] (ARROW-10088) [R] Don't store "data.table" pointer in metadata

2020-10-07 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Summary: [R] Don't store "data.table" pointer in metadata (was: [R] Issues in restoring

[jira] [Updated] (ARROW-10088) [R] Issues in restoring R metadata for "integer64", "data.table" classes

2020-10-07 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10088: Description: Issues with metadata$r: * The ".internal.selfref" attribute from data.table

[jira] [Commented] (ARROW-10088) [R] Issues in restoring R metadata for "integer64", "data.table" classes

2020-10-07 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209665#comment-17209665 ] Neal Richardson commented on ARROW-10088: - The integer64 subclass issue doesn't reproduce on

[jira] [Resolved] (ARROW-10217) [CI] Run fewer GitHub Actions jobs

2020-10-07 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-10217. - Resolution: Fixed Issue resolved by pull request 8380

[jira] [Resolved] (ARROW-10218) [Python] [C++] Errors when building pyarrow from source

2020-10-07 Thread Andrew Wieteska (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Wieteska resolved ARROW-10218. - Resolution: Fixed > [Python] [C++] Errors when building pyarrow from source >

[jira] [Commented] (ARROW-10218) [Python] [C++] Errors when building pyarrow from source

2020-10-07 Thread Andrew Wieteska (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209648#comment-17209648 ] Andrew Wieteska commented on ARROW-10218: - That was it. Thanks so much!!! > [Python] [C++]

[jira] [Commented] (ARROW-10218) [Python] [C++] Errors when building pyarrow from source

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209646#comment-17209646 ] Antoine Pitrou commented on ARROW-10218: It's {{-DARROW_PYTHON=ON}} and not

[jira] [Updated] (ARROW-10217) [CI] Run fewer GitHub Actions jobs

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10217: --- Labels: pull-request-available (was: ) > [CI] Run fewer GitHub Actions jobs >

[jira] [Resolved] (ARROW-10214) [Python] UnicodeDecodeError when printing schema with binary metadata

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-10214. Resolution: Fixed Issue resolved by pull request 8379

[jira] [Updated] (ARROW-9964) [C++] CSV date support

2020-10-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9964: -- Labels: pull-request-available (was: ) > [C++] CSV date support > -- > >

[jira] [Created] (ARROW-10218) [Python] [C++] Errors when building pyarrow from source

2020-10-07 Thread Andrew Wieteska (Jira)
Andrew Wieteska created ARROW-10218: --- Summary: [Python] [C++] Errors when building pyarrow from source Key: ARROW-10218 URL: https://issues.apache.org/jira/browse/ARROW-10218 Project: Apache Arrow

[jira] [Resolved] (ARROW-10093) [R] Add ability to opt-out of int64 -> int demotion

2020-10-07 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-10093. - Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8341

[jira] [Commented] (ARROW-10100) [C++][Dataset] Ability to read/subset a ParquetFileFragment with given set of row group ids

2020-10-07 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209624#comment-17209624 ] Joris Van den Bossche commented on ARROW-10100: --- [~bkietz] thoughts on the return value

[jira] [Created] (ARROW-10217) [CI] Run fewer GitHub Actions jobs

2020-10-07 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-10217: --- Summary: [CI] Run fewer GitHub Actions jobs Key: ARROW-10217 URL: https://issues.apache.org/jira/browse/ARROW-10217 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-10214) [Python] UnicodeDecodeError when printing schema with binary metadata

2020-10-07 Thread Paul Balanca (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17209614#comment-17209614 ] Paul Balanca commented on ARROW-10214: -- That was fast! Thanks, amazing support :) > [Python]

[jira] [Resolved] (ARROW-7960) [C++][Parquet] Add support for schema translation from parquet nodes back to arrow for missing types

2020-10-07 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-7960. --- Resolution: Fixed Issue resolved by pull request 8376

[jira] [Created] (ARROW-10216) [Rust] Simd implementation of min/max aggregation kernels for primitive types

2020-10-07 Thread Jira
Jörn Horstmann created ARROW-10216: -- Summary: [Rust] Simd implementation of min/max aggregation kernels for primitive types Key: ARROW-10216 URL: https://issues.apache.org/jira/browse/ARROW-10216

  1   2   >