[jira] [Resolved] (ARROW-8280) [C++] MinGW builds failing due to CARES-related toolchain issue

2020-03-31 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-8280. - Resolution: Fixed Issue resolved by pull request 6778 [https://github.com/apache/arrow/pull/6778]

[jira] [Created] (ARROW-8281) [R] Name collision of arrow.dll on Windows

2020-03-31 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-8281: --- Summary: [R] Name collision of arrow.dll on Windows Key: ARROW-8281 URL: https://issues.apache.org/jira/browse/ARROW-8281 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-8063) [Python] Add user guide documentation for Datasets API

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8063?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8063: -- Labels: documentation parquet pull-request-available (was: documentation parquet) > [Python]

[jira] [Resolved] (ARROW-8198) [C++] Diffing should handle null arrays

2020-03-31 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8198. --- Fix Version/s: (was: 1.0.0) 0.17.0 Resolution: Fixed Issue reso

[jira] [Updated] (ARROW-8238) [C++][Compute] Failed to build compute tests on windows with msvc2015

2020-03-31 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-8238: -- Priority: Blocker (was: Critical) > [C++][Compute] Failed to build compute tests on windows wi

[jira] [Resolved] (ARROW-8271) [Packaging] Allow wheel upload failures to gemfury

2020-03-31 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-8271. Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6761 [https:/

[jira] [Updated] (ARROW-8185) [Packaging] Document the available nightly wheels and conda packages

2020-03-31 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8185: --- Summary: [Packaging] Document the available nightly wheels and conda packages (was: [Packagi

[jira] [Updated] (ARROW-8185) [Packaging] Document the available nightly wheels and conda packages

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8185: -- Labels: pull-request-available (was: ) > [Packaging] Document the available nightly wheels and

[jira] [Created] (ARROW-8282) [C++/Python][Dataset] Support schema evolution for integer columns

2020-03-31 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-8282: --- Summary: [C++/Python][Dataset] Support schema evolution for integer columns Key: ARROW-8282 URL: https://issues.apache.org/jira/browse/ARROW-8282 Project: Apache Arrow

[jira] [Created] (ARROW-8283) [C++/Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset

2020-03-31 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-8283: --- Summary: [C++/Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset Key: ARROW-8283 URL: https://issues.apache.org/jira/browse/ARROW-8283 Pro

[jira] [Created] (ARROW-8284) [C++][Dataset] Schema evolution for timestamp columns

2020-03-31 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-8284: --- Summary: [C++][Dataset] Schema evolution for timestamp columns Key: ARROW-8284 URL: https://issues.apache.org/jira/browse/ARROW-8284 Project: Apache Arrow Issue Type:

[GitHub] [arrow-testing] pitrou merged pull request #24: PARQUET-1831: [C++] Add Parquet fuzz files

2020-03-31 Thread GitBox
pitrou merged pull request #24: PARQUET-1831: [C++] Add Parquet fuzz files URL: https://github.com/apache/arrow-testing/pull/24 This is an automated message from the Apache Git Service. To respond to the message, please log o

[GitHub] [arrow-testing] pitrou opened a new pull request #24: PARQUET-1831: [C++] Add Parquet fuzz files

2020-03-31 Thread GitBox
pitrou opened a new pull request #24: PARQUET-1831: [C++] Add Parquet fuzz files URL: https://github.com/apache/arrow-testing/pull/24 This is an automated message from the Apache Git Service. To respond to the message, please

[jira] [Updated] (ARROW-8284) [C++][Dataset] Schema evolution for timestamp columns

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8284: - Docs Text: (was: In a dataset, one can timestamp columns with different resolut

[jira] [Updated] (ARROW-8283) [C++/Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8283: - Description: When passing a list of files to the constructor of {{pyarrow.datase

[jira] [Updated] (ARROW-8284) [C++][Dataset] Schema evolution for timestamp columns

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8284: - Description: In a dataset, one can have timestamp columns with different resoluti

[jira] [Updated] (ARROW-8284) [C++][Dataset] Schema evolution for timestamp columns

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8284: - Description: In a dataset, one can timestamp columns with different resolutions.

[jira] [Created] (ARROW-8285) [Python][Dataset] ScalarExpression doesn't accept numpy scalars

2020-03-31 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-8285: --- Summary: [Python][Dataset] ScalarExpression doesn't accept numpy scalars Key: ARROW-8285 URL: https://issues.apache.org/jira/browse/ARROW-8285 Project: Apache Arrow I

[GitHub] [arrow-testing] pitrou merged pull request #25: PARQUET-1831: [C++] Add Parquet fuzz files

2020-03-31 Thread GitBox
pitrou merged pull request #25: PARQUET-1831: [C++] Add Parquet fuzz files URL: https://github.com/apache/arrow-testing/pull/25 This is an automated message from the Apache Git Service. To respond to the message, please log o

[GitHub] [arrow-testing] pitrou opened a new pull request #25: PARQUET-1831: [C++] Add Parquet fuzz files

2020-03-31 Thread GitBox
pitrou opened a new pull request #25: PARQUET-1831: [C++] Add Parquet fuzz files URL: https://github.com/apache/arrow-testing/pull/25 This is an automated message from the Apache Git Service. To respond to the message, please

[jira] [Created] (ARROW-8286) [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset

2020-03-31 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8286: Summary: [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset Key: ARROW-8286 URL: https://issues.apache.org/jira/browse/ARR

[jira] [Updated] (ARROW-8286) [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8286: - Labels: dataset (was: ) > [Python] Creating dataset from pathlib results in Unio

[jira] [Updated] (ARROW-8286) [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8286: - Component/s: Python > [Python] Creating dataset from pathlib results in UnionData

[jira] [Updated] (ARROW-8230) [Java] Move Netty memory manager into a separate module

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8230: -- Labels: pull-request-available (was: ) > [Java] Move Netty memory manager into a separate modu

[jira] [Updated] (ARROW-8286) [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8286: -- Labels: dataset pull-request-available (was: dataset) > [Python] Creating dataset from pathlib

[jira] [Created] (ARROW-8287) [Rust] Arrow examples should use utility to print results

2020-03-31 Thread Andy Grove (Jira)
Andy Grove created ARROW-8287: - Summary: [Rust] Arrow examples should use utility to print results Key: ARROW-8287 URL: https://issues.apache.org/jira/browse/ARROW-8287 Project: Apache Arrow Issu

[jira] [Created] (ARROW-8288) [Python] Expose with_ modifiers on DataType

2020-03-31 Thread Uwe Korn (Jira)
Uwe Korn created ARROW-8288: --- Summary: [Python] Expose with_ modifiers on DataType Key: ARROW-8288 URL: https://issues.apache.org/jira/browse/ARROW-8288 Project: Apache Arrow Issue Type: Improvemen

[jira] [Created] (ARROW-8289) Implement Arrow Parquet writer

2020-03-31 Thread Andy Grove (Jira)
Andy Grove created ARROW-8289: - Summary: Implement Arrow Parquet writer Key: ARROW-8289 URL: https://issues.apache.org/jira/browse/ARROW-8289 Project: Apache Arrow Issue Type: New Feature

[jira] [Updated] (ARROW-8288) [Python] Expose with_ modifiers on DataType

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8288: -- Labels: pull-request-available (was: ) > [Python] Expose with_ modifiers on DataType > ---

[jira] [Commented] (ARROW-8244) [Python][Parquet] Add `write_to_dataset` option to populate the "file_path" metadata fields

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17071845#comment-17071845 ] Joris Van den Bossche commented on ARROW-8244: -- So to summarize the issue: t

[jira] [Updated] (ARROW-8289) Implement Arrow Parquet writer

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8289: -- Labels: pull-request-available (was: ) > Implement Arrow Parquet writer >

[jira] [Created] (ARROW-8290) [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor

2020-03-31 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8290: Summary: [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor Key: ARROW-8290 URL: https://issues.apache.org/jira/browse/ARROW-8290

[jira] [Updated] (ARROW-8290) [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8290: - Labels: dataset (was: ) > [Python][Dataset] Improve ergonomy of the FileSystemDa

[jira] [Commented] (ARROW-8281) [R] Name collision of arrow.dll on Windows

2020-03-31 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17071856#comment-17071856 ] Neal Richardson commented on ARROW-8281: Not sure if this is possible, but if it

[jira] [Updated] (ARROW-8281) [R] Name collision of arrow.dll on Windows conda

2020-03-31 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8281: --- Summary: [R] Name collision of arrow.dll on Windows conda (was: [R] Name collision of arrow.

[jira] [Commented] (ARROW-8290) [Python][Dataset] Improve ergonomy of the FileSystemDataset constructor

2020-03-31 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17071858#comment-17071858 ] Ben Kietzman commented on ARROW-8290: - Small amenity: if an empty vector is passed fo

[jira] [Updated] (ARROW-8289) [Rust] Implement Arrow Parquet writer

2020-03-31 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-8289: -- Summary: [Rust] Implement Arrow Parquet writer (was: Implement Arrow Parquet writer) > [Rust] Impleme

[jira] [Closed] (ARROW-8283) [C++/Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset

2020-03-31 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson closed ARROW-8283. -- Resolution: Duplicate > [C++/Python][Dataset] Non-existent files are silently dropped in > pa.

[jira] [Commented] (ARROW-8283) [C++/Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17071871#comment-17071871 ] Joris Van den Bossche commented on ARROW-8283: -- [~npr] I am not fully sure i

[jira] [Updated] (ARROW-8291) [Packaging] Conda nightly builds can't locate Numpy

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8291: -- Labels: pull-request-available (was: ) > [Packaging] Conda nightly builds can't locate Numpy >

[jira] [Created] (ARROW-8291) [Packaging] Conda nightly builds can't locate Numpy

2020-03-31 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8291: -- Summary: [Packaging] Conda nightly builds can't locate Numpy Key: ARROW-8291 URL: https://issues.apache.org/jira/browse/ARROW-8291 Project: Apache Arrow

[jira] [Commented] (ARROW-8283) [C++/Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset

2020-03-31 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17071880#comment-17071880 ] Neal Richardson commented on ARROW-8283: Oh, alright, I'll reopen, and we can clo

[jira] [Reopened] (ARROW-8283) [C++/Python][Dataset] Non-existent files are silently dropped in pa.dataset.FileSystemDataset

2020-03-31 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8283?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reopened ARROW-8283: > [C++/Python][Dataset] Non-existent files are silently dropped in > pa.dataset.FileSystemData

[jira] [Resolved] (ARROW-8238) [C++][Compute] Failed to build compute tests on windows with msvc2015

2020-03-31 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8238. --- Resolution: Fixed Issue resolved by pull request 6775 [https://github.com/apache/arrow/pull/6

[jira] [Created] (ARROW-8292) [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function

2020-03-31 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8292: Summary: [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function Key: ARROW-8292 URL: https://issues.apache.org/jira/browse/ARROW-8292

[jira] [Updated] (ARROW-8292) [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8292: - Description: This is already a very simple fix to allow manually specifying the s

[jira] [Updated] (ARROW-8292) [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8292: - Parent: ARROW-8221 Issue Type: Sub-task (was: Task) > [Python][Dataset]

[jira] [Updated] (ARROW-8292) [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8292: - Fix Version/s: 0.17.0 > [Python][Dataset] Passthrough schema to Factory.finish()

[jira] [Assigned] (ARROW-8292) [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-8292: Assignee: Joris Van den Bossche > [Python][Dataset] Passthrough schema to

[jira] [Updated] (ARROW-8292) [Python][Dataset] Passthrough schema to Factory.finish() in dataset() function

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8292: -- Labels: pull-request-available (was: ) > [Python][Dataset] Passthrough schema to Factory.finis

[jira] [Created] (ARROW-8293) [Python] Run flake8 on python/examples also

2020-03-31 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8293: --- Summary: [Python] Run flake8 on python/examples also Key: ARROW-8293 URL: https://issues.apache.org/jira/browse/ARROW-8293 Project: Apache Arrow Issue Type: Im

[jira] [Created] (ARROW-8294) [Format][Flight] Add DoExchange RPC to Flight protocol

2020-03-31 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8294: --- Summary: [Format][Flight] Add DoExchange RPC to Flight protocol Key: ARROW-8294 URL: https://issues.apache.org/jira/browse/ARROW-8294 Project: Apache Arrow Iss

[jira] [Updated] (ARROW-8295) [C++][Dataset] IpcFileFormat should expliclity push down column projection

2020-03-31 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-8295: Fix Version/s: (was: 0.17.0) 1.0.0 > [C++][Dataset] IpcFileFormat should exp

[jira] [Created] (ARROW-8295) [C++][Dataset] IpcFileFormat should expliclity push down column projection

2020-03-31 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-8295: --- Summary: [C++][Dataset] IpcFileFormat should expliclity push down column projection Key: ARROW-8295 URL: https://issues.apache.org/jira/browse/ARROW-8295 Project: Apach

[jira] [Updated] (ARROW-8294) [Format][Flight] Add DoExchange RPC to Flight protocol

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8294: -- Labels: pull-request-available (was: ) > [Format][Flight] Add DoExchange RPC to Flight protoco

[jira] [Resolved] (ARROW-8294) [Format][Flight] Add DoExchange RPC to Flight protocol

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8294. - Resolution: Fixed Issue resolved by pull request 6686 [https://github.com/apache/arrow/pull/6686]

[jira] [Updated] (ARROW-8295) [C++][Dataset] IpcFileFormat should expliclity push down column projection

2020-03-31 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-8295: Description: {{IpcReadOptions::included_fields}} allows explicit skipping read/decompression of fie

[jira] [Created] (ARROW-8296) [C++][Dataset] IpcFileFormat should support writing files with compressed buffers

2020-03-31 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-8296: --- Summary: [C++][Dataset] IpcFileFormat should support writing files with compressed buffers Key: ARROW-8296 URL: https://issues.apache.org/jira/browse/ARROW-8296 Project

[jira] [Resolved] (ARROW-8286) [Python] Creating dataset from pathlib results in UnionDataset instead of FileSystemDataset

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8286. - Resolution: Fixed Issue resolved by pull request 6783 [https://github.com/apache/arrow/pull/6783]

[jira] [Updated] (ARROW-8293) [Python] Run flake8 on python/examples also

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8293: Fix Version/s: (was: 0.17.0) > [Python] Run flake8 on python/examples also > --

[jira] [Resolved] (ARROW-8288) [Python] Expose with_ modifiers on DataType

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8288. - Resolution: Fixed Issue resolved by pull request 6784 [https://github.com/apache/arrow/pull/6784]

[jira] [Updated] (ARROW-3329) [Python] Error casting decimal(38, 4) to int64

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3329: Fix Version/s: (was: 0.17.0) 1.0.0 > [Python] Error casting decimal(38, 4) t

[jira] [Assigned] (ARROW-7847) [Website] Write a blog post about fuzzing

2020-03-31 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-7847: - Assignee: Antoine Pitrou > [Website] Write a blog post about fuzzing > -

[jira] [Updated] (ARROW-8295) [C++][Dataset] IpcFileFormat should expliclity push down column projection

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8295: -- Labels: pull-request-available (was: ) > [C++][Dataset] IpcFileFormat should expliclity push d

[jira] [Updated] (ARROW-7847) [Website] Write a blog post about fuzzing

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7847: -- Labels: pull-request-available (was: ) > [Website] Write a blog post about fuzzing > -

[jira] [Resolved] (ARROW-8270) [Python][Flight] Example Flight server with TLS's certificate and key is not working

2020-03-31 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8270. --- Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6787 [https://g

[jira] [Created] (ARROW-8297) [FlightRPC][C++] Implement Flight DoExchange for C++

2020-03-31 Thread David Li (Jira)
David Li created ARROW-8297: --- Summary: [FlightRPC][C++] Implement Flight DoExchange for C++ Key: ARROW-8297 URL: https://issues.apache.org/jira/browse/ARROW-8297 Project: Apache Arrow Issue Type: N

[jira] [Created] (ARROW-8298) [C++][CI] MinGW builds fail building grpc

2020-03-31 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8298: - Summary: [C++][CI] MinGW builds fail building grpc Key: ARROW-8298 URL: https://issues.apache.org/jira/browse/ARROW-8298 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-8298) [C++][CI] MinGW builds fail building grpc

2020-03-31 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17071972#comment-17071972 ] Antoine Pitrou commented on ARROW-8298: --- cc [~kou]   > [C++][CI] MinGW builds fai

[jira] [Updated] (ARROW-8297) [FlightRPC][C++] Implement Flight DoExchange for C++

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8297: -- Labels: pull-request-available (was: ) > [FlightRPC][C++] Implement Flight DoExchange for C++

[jira] [Created] (ARROW-8299) [C++] Reusable "optional ParallelFor" function for optional use of multithreading

2020-03-31 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8299: --- Summary: [C++] Reusable "optional ParallelFor" function for optional use of multithreading Key: ARROW-8299 URL: https://issues.apache.org/jira/browse/ARROW-8299 Project

[jira] [Created] (ARROW-8300) [R] Documentation and changelog updates for 0.17

2020-03-31 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8300: -- Summary: [R] Documentation and changelog updates for 0.17 Key: ARROW-8300 URL: https://issues.apache.org/jira/browse/ARROW-8300 Project: Apache Arrow Iss

[jira] [Created] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-03-31 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8301: -- Summary: [C++][Python][R] Handle ChunkedArray and Table in C data interface Key: ARROW-8301 URL: https://issues.apache.org/jira/browse/ARROW-8301 Project: Apache

[jira] [Commented] (ARROW-3329) [Python] Error casting decimal(38, 4) to int64

2020-03-31 Thread Jacek Pliszka (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072066#comment-17072066 ] Jacek Pliszka commented on ARROW-3329: -- I did start from fresh git clone. Thank you

[jira] [Commented] (ARROW-3329) [Python] Error casting decimal(38, 4) to int64

2020-03-31 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072070#comment-17072070 ] Antoine Pitrou commented on ARROW-3329: --- We do have a docker-compose setup. For exa

[jira] [Assigned] (ARROW-8221) [Python][Dataset] Expose schema inference / validation options in the factory

2020-03-31 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-8221: -- Assignee: Joris Van den Bossche (was: Krisztian Szucs) > [Python][Dataset] Expose sch

[jira] [Assigned] (ARROW-8079) [Python] Implement a wrapper for KeyValueMetadata, duck-typing dict where relevant

2020-03-31 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-8079: -- Assignee: Krisztian Szucs > [Python] Implement a wrapper for KeyValueMetadata, duck-ty

[jira] [Updated] (ARROW-7740) [R] Crash/bad data in converting Arrow list struct type

2020-03-31 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-7740: -- Component/s: (was: R) > [R] Crash/bad data in converting Arrow list struct

[jira] [Updated] (ARROW-7740) [R] Crash/bad data in converting Arrow list struct type

2020-03-31 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-7740: -- Priority: Critical (was: Minor) > [R] Crash/bad data in converting Arrow list

[jira] [Updated] (ARROW-7740) [R] Crash/bad data in converting Arrow list struct type

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7740: -- Labels: pull-request-available (was: ) > [R] Crash/bad data in converting Arrow list struct ty

[jira] [Created] (ARROW-8302) Start plasma store with STDOUT, STDERR arguments

2020-03-31 Thread Tal Pritzker (Jira)
Tal Pritzker created ARROW-8302: --- Summary: Start plasma store with STDOUT, STDERR arguments Key: ARROW-8302 URL: https://issues.apache.org/jira/browse/ARROW-8302 Project: Apache Arrow Issue Typ

[jira] [Updated] (ARROW-8221) [Python][Dataset] Expose schema inference / validation options in the factory

2020-03-31 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8221: - Fix Version/s: (was: 0.17.0) 1.0.0 > [Python][Dataset] Exp

[jira] [Updated] (ARROW-8213) [Python][Dataset] Opening a dataset with a local incorrect path gives confusing error message

2020-03-31 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-8213: -- Component/s: C++ - Dataset > [Python][Dataset] Opening a dataset with a local i

[jira] [Commented] (ARROW-8213) [Python][Dataset] Opening a dataset with a local incorrect path gives confusing error message

2020-03-31 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072174#comment-17072174 ] Francois Saint-Jacques commented on ARROW-8213: --- Make the default construct

[jira] [Commented] (ARROW-8282) [C++/Python][Dataset] Support schema evolution for integer columns

2020-03-31 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072178#comment-17072178 ] Francois Saint-Jacques commented on ARROW-8282: --- Once we have instanciated

[jira] [Updated] (ARROW-8079) [Python] Implement a wrapper for KeyValueMetadata, duck-typing dict where relevant

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8079: -- Labels: pull-request-available (was: ) > [Python] Implement a wrapper for KeyValueMetadata, du

[jira] [Resolved] (ARROW-8218) [C++] Parallelize decompression at field level in experimental IPC compression code

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8218. - Resolution: Fixed Issue resolved by pull request 6777 [https://github.com/apache/arrow/pull/6777]

[jira] [Commented] (ARROW-3329) [Python] Error casting decimal(38, 4) to int64

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072210#comment-17072210 ] Wes McKinney commented on ARROW-3329: - Right, the docker-compose setup would be a goo

[jira] [Resolved] (ARROW-8279) [C++] Do not export symbols from Codec implementations, remove need for PIMPL pattern

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8279. - Resolution: Fixed Issue resolved by pull request 6774 [https://github.com/apache/arrow/pull/6774]

[jira] [Resolved] (ARROW-8277) [Python] RecordBatch interface improvements

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8277. - Resolution: Fixed Issue resolved by pull request 6768 [https://github.com/apache/arrow/pull/6768]

[jira] [Assigned] (ARROW-8298) [C++][CI] MinGW builds fail building grpc

2020-03-31 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou reassigned ARROW-8298: --- Assignee: Kouhei Sutou > [C++][CI] MinGW builds fail building grpc > ---

[jira] [Commented] (ARROW-8217) [R][C++] Fix crashing test in test-dataset.R on 32-bit Windows from ARROW-7979

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072270#comment-17072270 ] Wes McKinney commented on ARROW-8217: - Since the debug build issue should be resolved

[jira] [Assigned] (ARROW-8291) [Packaging] Conda nightly builds can't locate Numpy

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8291: --- Assignee: Krisztian Szucs > [Packaging] Conda nightly builds can't locate Numpy > --

[jira] [Resolved] (ARROW-7428) [Format][C++] Add serialization for CSF sparse tensors

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7428. - Resolution: Fixed Issue resolved by pull request 6340 [https://github.com/apache/arrow/pull/6340]

[jira] [Updated] (ARROW-8302) [Python] Start plasma store with STDOUT, STDERR arguments

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8302: Summary: [Python] Start plasma store with STDOUT, STDERR arguments (was: Start plasma store with S

[jira] [Commented] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17072279#comment-17072279 ] Wes McKinney commented on ARROW-7939: - I'll try to look at this tomorrow to see if I

[jira] [Updated] (ARROW-7740) [C++] Array internals corruption in StructArray::Flatten

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7740: Summary: [C++] Array internals corruption in StructArray::Flatten (was: [R] Crash/bad data in conv

[jira] [Resolved] (ARROW-7740) [C++] Array internals corruption in StructArray::Flatten

2020-03-31 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7740. - Resolution: Fixed Issue resolved by pull request 6792 [https://github.com/apache/arrow/pull/6792]

[jira] [Updated] (ARROW-8227) [C++] Refine SIMD feature definitions

2020-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8227: -- Labels: pull-request-available (was: ) > [C++] Refine SIMD feature definitions > -

[jira] [Assigned] (ARROW-7740) [C++] Array internals corruption in StructArray::Flatten

2020-03-31 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-7740: -- Assignee: Francois Saint-Jacques (was: Wes McKinney) > [C++] Array internals corrupti

  1   2   >