[jira] [Updated] (ARROW-6053) [Python] RecordBatchStreamReader::Open2 cdef type signature doesn't match C++

2019-07-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6053: -- Labels: pull-request-available (was: ) > [Python] RecordBatchStreamReader::Open2 cdef type sig

[jira] [Created] (ARROW-6053) [Python] RecordBatchStreamReader::Open2 cdef type signature doesn't match C++

2019-07-26 Thread Paul Taylor (JIRA)
Paul Taylor created ARROW-6053: -- Summary: [Python] RecordBatchStreamReader::Open2 cdef type signature doesn't match C++ Key: ARROW-6053 URL: https://issues.apache.org/jira/browse/ARROW-6053 Project: Apac

[jira] [Updated] (ARROW-6042) [C++] Implement alternative DictionaryBuilder that always yields int32 indices

2019-07-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6042: -- Labels: pull-request-available (was: ) > [C++] Implement alternative DictionaryBuilder that al

[jira] [Resolved] (ARROW-3772) [C++] Read Parquet dictionary encoded ColumnChunks directly into an Arrow DictionaryArray

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-3772. - Resolution: Fixed Issue resolved by pull request 4949 [https://github.com/apache/arrow/pull/4949]

[jira] [Created] (ARROW-6052) [C++] Divide up arrow/array.h,cc into files in arrow/array/ similar to builder files

2019-07-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6052: --- Summary: [C++] Divide up arrow/array.h,cc into files in arrow/array/ similar to builder files Key: ARROW-6052 URL: https://issues.apache.org/jira/browse/ARROW-6052 Proj

[jira] [Assigned] (ARROW-6042) [C++] Implement alternative DictionaryBuilder that always yields int32 indices

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6042: --- Assignee: Wes McKinney > [C++] Implement alternative DictionaryBuilder that always yields in

[jira] [Commented] (ARROW-6051) [C++][Python] Parquet float column of NaN writing performance regression from 0.13.0 to 0.14.1

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16894167#comment-16894167 ] Wes McKinney commented on ARROW-6051: - I made before and after flamegraphs using {co

[jira] [Updated] (ARROW-6051) [C++][Python] Parquet float column of NaN writing performance regression from 0.13.0 to 0.14.1

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6051: Attachment: perf_before.svg > [C++][Python] Parquet float column of NaN writing performance regress

[jira] [Updated] (ARROW-6051) [C++][Python] Parquet float column of NaN writing performance regression from 0.13.0 to 0.14.1

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6051: Attachment: perf.svg > [C++][Python] Parquet float column of NaN writing performance regression fro

[jira] [Updated] (ARROW-6051) [C++][Python] Parquet float column of NaN writing performance regression from 0.13.0 to 0.14.1

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6051: Summary: [C++][Python] Parquet float column of NaN writing performance regression from 0.13.0 to 0.

[jira] [Created] (ARROW-6051) [C++][Python] Parquet float column writing performance regression from 0.13.0 to 0.14.1

2019-07-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6051: --- Summary: [C++][Python] Parquet float column writing performance regression from 0.13.0 to 0.14.1 Key: ARROW-6051 URL: https://issues.apache.org/jira/browse/ARROW-6051 P

[jira] [Resolved] (ARROW-6045) [C++] Benchmark for Parquet float and NaN encoding/decoding

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6045. - Resolution: Fixed Fix Version/s: 1.0.0 Issue resolved by pull request 4915 [https://github

[jira] [Assigned] (ARROW-6045) [C++] Benchmark for Parquet float and NaN encoding/decoding

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6045: --- Assignee: Itamar Turner-Trauring > [C++] Benchmark for Parquet float and NaN encoding/decodi

[jira] [Created] (ARROW-6050) [Java] Update out-of-date java/flight/README.md

2019-07-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6050: --- Summary: [Java] Update out-of-date java/flight/README.md Key: ARROW-6050 URL: https://issues.apache.org/jira/browse/ARROW-6050 Project: Apache Arrow Issue Type

[jira] [Resolved] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Sutou Kouhei (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sutou Kouhei resolved ARROW-6047. - Resolution: Fixed Issue resolved by pull request 4954 [https://github.com/apache/arrow/pull/4954]

[jira] [Assigned] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Sutou Kouhei (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sutou Kouhei reassigned ARROW-6047: --- Assignee: Chao Sun > [Rust] Rust nightly 1.38.0 builds failing > ---

[jira] [Commented] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Chao Sun (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16894120#comment-16894120 ] Chao Sun commented on ARROW-6047: - {quote} Uh, that's not good. I'm concerned about havin

[jira] [Updated] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6047: -- Labels: pull-request-available (was: ) > [Rust] Rust nightly 1.38.0 builds failing > -

[jira] [Created] (ARROW-6048) [C++] Add ChunkedArray::View which calls to Array::View

2019-07-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6048: --- Summary: [C++] Add ChunkedArray::View which calls to Array::View Key: ARROW-6048 URL: https://issues.apache.org/jira/browse/ARROW-6048 Project: Apache Arrow Is

[jira] [Commented] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Chao Sun (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16894082#comment-16894082 ] Chao Sun commented on ARROW-6047: - Hmm I didn't know that it will uses 2.6.0 even though

[jira] [Created] (ARROW-6049) [C++] Support using Array::View from compatible dictionary type to another

2019-07-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6049: --- Summary: [C++] Support using Array::View from compatible dictionary type to another Key: ARROW-6049 URL: https://issues.apache.org/jira/browse/ARROW-6049 Project: Apach

[jira] [Commented] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Andy Grove (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16894076#comment-16894076 ] Andy Grove commented on ARROW-6047: --- The Arrow parquet module compiles against parquet-

[jira] [Commented] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16894075#comment-16894075 ] Wes McKinney commented on ARROW-6047: - Uh, that's not good. I'm concerned about havin

[jira] [Commented] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Andy Grove (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16894074#comment-16894074 ] Andy Grove commented on ARROW-6047: --- The issue seems to be due to a new release of the

[jira] [Created] (ARROW-6047) [Rust] Rust nightly 1.38.0 builds failing

2019-07-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-6047: --- Summary: [Rust] Rust nightly 1.38.0 builds failing Key: ARROW-6047 URL: https://issues.apache.org/jira/browse/ARROW-6047 Project: Apache Arrow Issue Type: Bug

[jira] [Closed] (ARROW-6028) [C++][Python] Failed to compile on windows platform using arrow

2019-07-26 Thread Haowei Yu (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haowei Yu closed ARROW-6028. Resolution: Not A Problem > [C++][Python] Failed to compile on windows platform using arrow > -

[jira] [Commented] (ARROW-6028) [C++][Python] Failed to compile on windows platform using arrow

2019-07-26 Thread Haowei Yu (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16894001#comment-16894001 ] Haowei Yu commented on ARROW-6028: -- [~wesmckinn] Thanks for your advice. After adding in

[jira] [Closed] (ARROW-6044) [Python] Pyarrow HDFS client gets hung after a while

2019-07-26 Thread Fred Tzeng (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Fred Tzeng closed ARROW-6044. - Resolution: Feedback Received > [Python] Pyarrow HDFS client gets hung after a while > --

[jira] [Commented] (ARROW-6044) [Python] Pyarrow HDFS client gets hung after a while

2019-07-26 Thread Fred Tzeng (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16893966#comment-16893966 ] Fred Tzeng commented on ARROW-6044: --- Thanks for the feedback, I will check with the Had

[jira] [Commented] (ARROW-6046) Slice RecordBatch of String array with offset 0 returns whole batch

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16893964#comment-16893964 ] Wes McKinney commented on ARROW-6046: - The offsets buffer is not truncated in the fir

[jira] [Updated] (ARROW-6046) [C++] Slice RecordBatch of String array with offset 0 returns whole batch

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6046: Summary: [C++] Slice RecordBatch of String array with offset 0 returns whole batch (was: Slice Rec

[jira] [Updated] (ARROW-6046) Slice RecordBatch of String array with offset 0 returns whole batch

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6046: Fix Version/s: 1.0.0 > Slice RecordBatch of String array with offset 0 returns whole batch > --

[jira] [Updated] (ARROW-6045) [C++] Benchmark for Parquet float and NaN encoding/decoding

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6045: Summary: [C++] Benchmark for Parquet float and NaN encoding/decoding (was: Benchmark for Parquet f

[jira] [Updated] (ARROW-6046) Slice RecordBatch of String array with offset 0 returns whole batch

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6046: Priority: Major (was: Blocker) > Slice RecordBatch of String array with offset 0 returns whole bat

[jira] [Updated] (ARROW-6044) [Python] Pyarrow HDFS client gets hung after a while

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6044: Labels: hdfs (was: ) > [Python] Pyarrow HDFS client gets hung after a while >

[jira] [Updated] (ARROW-6044) [Python] Pyarrow HDFS client gets hung after a while

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6044: Summary: [Python] Pyarrow HDFS client gets hung after a while (was: Pyarrow HDFS client gets hung

[jira] [Commented] (ARROW-6044) Pyarrow HDFS client gets hung after a while

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16893872#comment-16893872 ] Wes McKinney commented on ARROW-6044: - We're passing through calls to libhdfs. It's p

[jira] [Updated] (ARROW-6044) Pyarrow HDFS client gets hung after a while

2019-07-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6044: Priority: Major (was: Blocker) > Pyarrow HDFS client gets hung after a while > ---

[jira] [Updated] (ARROW-6046) Slice RecordBatch of String array with offset 0 returns whole batch

2019-07-26 Thread Sascha Hofmann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sascha Hofmann updated ARROW-6046: -- Summary: Slice RecordBatch of String array with offset 0 returns whole batch (was: Slice Recor

[jira] [Created] (ARROW-6046) Slice RecordBatch of String array with offset 0

2019-07-26 Thread Sascha Hofmann (JIRA)
Sascha Hofmann created ARROW-6046: - Summary: Slice RecordBatch of String array with offset 0 Key: ARROW-6046 URL: https://issues.apache.org/jira/browse/ARROW-6046 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-6038) [Python] pyarrow.Table.from_batches produces corrupted table if any of the batches were empty

2019-07-26 Thread Piotr Bajger (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piotr Bajger updated ARROW-6038: Affects Version/s: 0.14.1 > [Python] pyarrow.Table.from_batches produces corrupted table if any of

[jira] [Updated] (ARROW-6045) Benchmark for Parquet float and NaN encoding/decoding

2019-07-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6045: -- Labels: pull-request-available (was: ) > Benchmark for Parquet float and NaN encoding/decoding

[jira] [Commented] (ARROW-6045) Benchmark for Parquet float and NaN encoding/decoding

2019-07-26 Thread Itamar Turner-Trauring (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16893676#comment-16893676 ] Itamar Turner-Trauring commented on ARROW-6045: --- PR: https://github.com/apa

[jira] [Created] (ARROW-6045) Benchmark for Parquet float and NaN encoding/decoding

2019-07-26 Thread Itamar Turner-Trauring (JIRA)
Itamar Turner-Trauring created ARROW-6045: - Summary: Benchmark for Parquet float and NaN encoding/decoding Key: ARROW-6045 URL: https://issues.apache.org/jira/browse/ARROW-6045 Project: Apache

[jira] [Updated] (ARROW-5959) [C++][CI] Fuzzit does not know about branch + commit hash

2019-07-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5959: -- Labels: CI fuzzer pull-request-available (was: CI fuzzer) > [C++][CI] Fuzzit does not know abo

[jira] [Assigned] (ARROW-5959) [C++][CI] Fuzzit does not know about branch + commit hash

2019-07-26 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann reassigned ARROW-5959: Assignee: Marco Neumann (was: Yevgeny Pats) > [C++][CI] Fuzzit does not know about branch

[jira] [Resolved] (ARROW-5967) [Java] DateUtility#timeZoneList is not correct

2019-07-26 Thread Pindikura Ravindra (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pindikura Ravindra resolved ARROW-5967. --- Resolution: Fixed Fix Version/s: 1.0.0 Issue resolved by pull request 4904 [ht

[jira] [Updated] (ARROW-6038) [Python] pyarrow.Table.from_batches produces corrupted table if any of the batches were empty

2019-07-26 Thread Piotr Bajger (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piotr Bajger updated ARROW-6038: Description: When creating a Table from a list/iterator of batches which contains an "empty" Recor

[jira] [Updated] (ARROW-6038) [Python] pyarrow.Table.from_batches produces corrupted table if any of the batches were empty

2019-07-26 Thread Piotr Bajger (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piotr Bajger updated ARROW-6038: Affects Version/s: 0.14.0 > [Python] pyarrow.Table.from_batches produces corrupted table if any of

[jira] [Commented] (ARROW-6038) [Python] pyarrow.Table.from_batches produces corrupted table if any of the batches were empty

2019-07-26 Thread Piotr Bajger (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16893383#comment-16893383 ] Piotr Bajger commented on ARROW-6038: - Yes, it does, I updated the version labels. >

[jira] [Updated] (ARROW-6038) [Python] pyarrow.Table.from_batches produces corrupted table if any of the batches were empty

2019-07-26 Thread Piotr Bajger (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piotr Bajger updated ARROW-6038: Description: When creating a Table from an list/iterator of batches which contains an "empty" Reco

[jira] [Comment Edited] (ARROW-6038) [Python] pyarrow.Table.from_batches produces corrupted table if any of the batches were empty

2019-07-26 Thread Piotr Bajger (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16893383#comment-16893383 ] Piotr Bajger edited comment on ARROW-6038 at 7/26/19 6:59 AM: -