[jira] [Created] (ARROW-11922) Implement feather::feather_metadata(path) in R arrow

2021-03-10 Thread Matteo Sostero (Jira)
Matteo Sostero created ARROW-11922: -- Summary: Implement feather::feather_metadata(path) in R arrow Key: ARROW-11922 URL: https://issues.apache.org/jira/browse/ARROW-11922 Project: Apache Arrow

[jira] [Updated] (ARROW-11922) [R] Implement feather::feather_metadata(path) in R arrow

2021-03-10 Thread Matteo Sostero (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matteo Sostero updated ARROW-11922: --- Summary: [R] Implement feather::feather_metadata(path) in R arrow (was: Implement feather::

[jira] [Updated] (ARROW-11922) [R] Implement feather::feather_metadata(path) in R arrow

2021-03-10 Thread Matteo Sostero (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matteo Sostero updated ARROW-11922: --- Description: The old R `feather` package had a function called  {{feather::feather_metadata(

[jira] [Resolved] (ARROW-10514) [C++][Parquet] Data inconsistency in parquet-reader output modes

2021-03-10 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-10514. Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 9649 [https:/

[jira] [Assigned] (ARROW-10514) [C++][Parquet] Data inconsistency in parquet-reader output modes

2021-03-10 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-10514: -- Assignee: Zosimova Zhanna > [C++][Parquet] Data inconsistency in parquet-reader outpu

[jira] [Commented] (ARROW-10694) [Python] ds.write_dataset() generates empty files for each final partition

2021-03-10 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298828#comment-17298828 ] Lance Dacey commented on ARROW-10694: - This is being worked on in the adlfs library

[jira] [Closed] (ARROW-10694) [Python] ds.write_dataset() generates empty files for each final partition

2021-03-10 Thread Lance Dacey (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lance Dacey closed ARROW-10694. --- Fix Version/s: 3.0.0 Resolution: Fixed https://github.com/dask/adlfs/pull/193 > [Python] ds.

[jira] [Created] (ARROW-11923) [CI] Update branch name for dask dev integration tests

2021-03-10 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-11923: - Summary: [CI] Update branch name for dask dev integration tests Key: ARROW-11923 URL: https://issues.apache.org/jira/browse/ARROW-11923 Project: Apac

[jira] [Updated] (ARROW-11923) [CI] Update branch name for dask dev integration tests

2021-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11923: --- Labels: pull-request-available (was: ) > [CI] Update branch name for dask dev integration t

[jira] [Commented] (ARROW-7224) [C++][Dataset] Partition level filters should be able to provide filtering to file systems

2021-03-10 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298920#comment-17298920 ] Joris Van den Bossche commented on ARROW-7224: -- bq. FWIW, Spark as has APIs

[jira] [Updated] (ARROW-8658) [C++][Dataset] Implement subtree pruning for FileSystemDataset::GetFragments

2021-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8658: -- Labels: dataset pull-request-available (was: dataset) > [C++][Dataset] Implement subtree pruni

[jira] [Resolved] (ARROW-11877) [C++] Add initial microbenchmarks for Dataset internals

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-11877. -- Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 9638 [https://git

[jira] [Created] (ARROW-11924) [C++] Provide streaming output from GetFileInfo

2021-03-10 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-11924: Summary: [C++] Provide streaming output from GetFileInfo Key: ARROW-11924 URL: https://issues.apache.org/jira/browse/ARROW-11924 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-7905) [Go][Parquet] Port the C++ Parquet implementation to Go

2021-03-10 Thread Nick Poorman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298934#comment-17298934 ] Nick Poorman commented on ARROW-7905: - [~zeroshade] I'm happy you were able to pick t

[jira] [Updated] (ARROW-7905) [Go][Parquet] Port the C++ Parquet implementation to Go

2021-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7905: -- Labels: Go Parquet golang pull-request-available (was: Go Parquet golang) > [Go][Parquet] Port

[jira] [Commented] (ARROW-7905) [Go][Parquet] Port the C++ Parquet implementation to Go

2021-03-10 Thread Matt Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298940#comment-17298940 ] Matt Topol commented on ARROW-7905: --- I ended up needing this at work, and is actually c

[jira] [Commented] (ARROW-11871) [C++] Add element-wise power() compute function

2021-03-10 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298942#comment-17298942 ] Joris Van den Bossche commented on ARROW-11871: --- Yes, I already closed it.

[jira] [Updated] (ARROW-11070) [C++] Implement power / exponentiation compute kernel

2021-03-10 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-11070: -- Summary: [C++] Implement power / exponentiation compute kernel (was: [C++] [R

[jira] [Commented] (ARROW-11070) [C++] Implement power / exponentiation compute kernel

2021-03-10 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298945#comment-17298945 ] Joris Van den Bossche commented on ARROW-11070: --- Copying my note about nul

[jira] [Assigned] (ARROW-11070) [C++] Implement power / exponentiation compute kernel

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11070: --- Assignee: Rok Mihevc (was: Michal Nowakiewicz) > [C++] Implement power / exponenti

[jira] [Assigned] (ARROW-11658) [R] Handle mutate/rename inside group_by

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11658: --- Assignee: Neal Richardson > [R] Handle mutate/rename inside group_by >

[jira] [Assigned] (ARROW-11785) [R] Fallback when filtering Table with if_any() expression fails

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11785?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11785: --- Assignee: Ian Cook > [R] Fallback when filtering Table with if_any() expression fai

[jira] [Assigned] (ARROW-11912) [R] Remove args from FeatherReader$create

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11912?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11912: --- Assignee: Mauricio 'Pachá' Vargas Sepúlveda > [R] Remove args from FeatherReader$cr

[jira] [Assigned] (ARROW-11921) [R] Set LC_COLLATE in r/data-raw/codegen.R

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11921: --- Assignee: Mauricio 'Pachá' Vargas Sepúlveda > [R] Set LC_COLLATE in r/data-raw/code

[jira] [Assigned] (ARROW-11589) [R] Add methods for modifying Schemas

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11589: --- Assignee: Jonathan Keane > [R] Add methods for modifying Schemas >

[jira] [Assigned] (ARROW-11659) [R] Preserve group_by .drop argument

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11659: --- Assignee: Ian Cook > [R] Preserve group_by .drop argument > ---

[jira] [Assigned] (ARROW-11660) [C++] Move RecordBatch::SelectColumns method from R to C++ library

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11660: --- Assignee: Mauricio 'Pachá' Vargas Sepúlveda > [C++] Move RecordBatch::SelectColumns

[jira] [Updated] (ARROW-11392) [R] Remove/revisit ARROW_R_WITH_ARROW flags

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11392: Summary: [R] Remove/revisit ARROW_R_WITH_ARROW flags (was: [R] Remove ARROW_R_WITH_ARROW

[jira] [Assigned] (ARROW-11392) [R] Remove ARROW_R_WITH_ARROW flags

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11392: --- Assignee: Neal Richardson > [R] Remove ARROW_R_WITH_ARROW flags > -

[jira] [Updated] (ARROW-4512) [R] Stream reader/writer API that takes socket stream

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-4512: --- Fix Version/s: (was: 4.0.0) 5.0.0 > [R] Stream reader/writer API that

[jira] [Updated] (ARROW-9235) [R] Support for `connection` class when reading and writing files

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9235: --- Fix Version/s: (was: 4.0.0) 5.0.0 > [R] Support for `connection` class

[jira] [Updated] (ARROW-8470) [Python][R] Expose incremental write API for Feather files

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8470: --- Fix Version/s: (was: 4.0.0) 5.0.0 > [Python][R] Expose incremental wri

[jira] [Updated] (ARROW-10734) [R] Build and test on Solaris

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10734: Fix Version/s: 4.0.0 > [R] Build and test on Solaris > - > >

[jira] [Assigned] (ARROW-11703) [R] Implement dplyr::arrange()

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11703: --- Assignee: Ian Cook > [R] Implement dplyr::arrange() > -

[jira] [Assigned] (ARROW-11755) [R] Add tests from dplyr/test-mutate.r

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11755: --- Assignee: Mauricio 'Pachá' Vargas Sepúlveda > [R] Add tests from dplyr/test-mutate.

[jira] [Updated] (ARROW-11441) [R] Read CSV from character vector

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11441: Description: `readr::read_csv()` lets you read in data from a character vector, useful for

[jira] [Assigned] (ARROW-9657) [R][Dataset] Expose more FileSystemDatasetFactory options

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9657: -- Assignee: Ian Cook > [R][Dataset] Expose more FileSystemDatasetFactory options > -

[jira] [Updated] (ARROW-9657) [R][Dataset] Expose more FileSystemDatasetFactory options

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9657: --- Description: Among the features: * ignore_prefixes option * Pass an explicit list of files +

[jira] [Assigned] (ARROW-11338) [R] Bindings for quantile and median

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11338: --- Assignee: Ian Cook > [R] Bindings for quantile and median > --

[jira] [Resolved] (ARROW-11516) [R] Allow all C++ compute functions to be called by name in dplyr

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-11516. - Resolution: Fixed Issue resolved by pull request 9659 [https://github.com/apache/arrow/p

[jira] [Commented] (ARROW-11924) [C++] Provide streaming output from GetFileInfo

2021-03-10 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299051#comment-17299051 ] Antoine Pitrou commented on ARROW-11924: Then it would probably be a non-reentra

[jira] [Commented] (ARROW-11924) [C++] Provide streaming output from GetFileInfo

2021-03-10 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299062#comment-17299062 ] Weston Pace commented on ARROW-11924: - That should be ok.  If we get to a point that

[jira] [Updated] (ARROW-11861) [R][Packaging] Apply changes in r/tools/autobrew upstream

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11861: Priority: Critical (was: Blocker) > [R][Packaging] Apply changes in r/tools/autobrew upst

[jira] [Assigned] (ARROW-11475) [C++] Upgrade mimalloc

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-11475: --- Assignee: Antoine Pitrou > [C++] Upgrade mimalloc > -- > >

[jira] [Updated] (ARROW-10305) [R] Filter with regular expressions

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10305: Component/s: (was: C++) > [R] Filter with regular expressions > --

[jira] [Commented] (ARROW-11475) [C++] Upgrade mimalloc

2021-03-10 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299073#comment-17299073 ] Antoine Pitrou commented on ARROW-11475: [~npr] [~jonkeane] did you have a chanc

[jira] [Commented] (ARROW-11924) [C++] Provide streaming output from GetFileInfo

2021-03-10 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299081#comment-17299081 ] Antoine Pitrou commented on ARROW-11924: Note that the implementation could very

[jira] [Created] (ARROW-11925) Add `between` method for arrow_dplyr_query

2021-03-10 Thread Sam Albers (Jira)
Sam Albers created ARROW-11925: -- Summary: Add `between` method for arrow_dplyr_query Key: ARROW-11925 URL: https://issues.apache.org/jira/browse/ARROW-11925 Project: Apache Arrow Issue Type: New

[jira] [Updated] (ARROW-11925) [R] Add `between` method for arrow_dplyr_query

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11925: Fix Version/s: 4.0.0 > [R] Add `between` method for arrow_dplyr_query > --

[jira] [Updated] (ARROW-11925) [R] Add `between` method for arrow_dplyr_query

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-11925: Summary: [R] Add `between` method for arrow_dplyr_query (was: Add `between` method for ar

[jira] [Commented] (ARROW-11925) Add `between` method for arrow_dplyr_query

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299101#comment-17299101 ] Neal Richardson commented on ARROW-11925: - Sure! You'd register the function aro

[jira] [Resolved] (ARROW-11672) [R] Fix string function test failure on R 3.3

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-11672. - Resolution: Fixed Issue resolved by pull request 9664 [https://github.com/apache/arrow/p

[jira] [Resolved] (ARROW-10953) [R] Validate when creating Table with schema

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-10953. - Resolution: Fixed Issue resolved by pull request 9665 [https://github.com/apache/arrow/p

[jira] [Created] (ARROW-11926) [R] Pass on the new UCRT windows builds

2021-03-10 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-11926: -- Summary: [R] Pass on the new UCRT windows builds Key: ARROW-11926 URL: https://issues.apache.org/jira/browse/ARROW-11926 Project: Apache Arrow Issue Type

[jira] [Updated] (ARROW-11926) [R] Pass on the new UCRT CRAN windows builds

2021-03-10 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-11926: --- Summary: [R] Pass on the new UCRT CRAN windows builds (was: [R] Pass on the new UCRT window

[jira] [Commented] (ARROW-11926) [R] Pass on the new UCRT CRAN windows builds

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299116#comment-17299116 ] Neal Richardson commented on ARROW-11926: - The C++ libraries will have to be bui

[jira] [Updated] (ARROW-10440) [C++][Dataset][Python] Add a callback to visit file writers just before Finish()

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-10440: - Fix Version/s: (was: 4.0.0) 5.0.0 > [C++][Dataset][Python] Add a callback

[jira] [Updated] (ARROW-11749) [C++][Dataset] Support projections between children of UnionDatasets

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-11749: - Fix Version/s: (was: 4.0.0) 5.0.0 > [C++][Dataset] Support projections be

[jira] [Updated] (ARROW-5745) [C++] properties of Map(Array|Type) are confusingly named

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-5745: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++] properties of Map(Array|Type) are

[jira] [Commented] (ARROW-11647) [C++][Compute] CastFromNull does not use preallocated buffers

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11647?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299123#comment-17299123 ] Ben Kietzman commented on ARROW-11647: -- [~edponce] > [C++][Compute] CastFromNull d

[jira] [Updated] (ARROW-11647) [C++][Compute] CastFromNull does not use preallocated buffers

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-11647: - Fix Version/s: (was: 4.0.0) 5.0.0 > [C++][Compute] CastFromNull does not

[jira] [Updated] (ARROW-11402) [C++][Dataset] Allow more aggresive implicit casts for literals

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-11402: - Fix Version/s: (was: 4.0.0) 5.0.0 > [C++][Dataset] Allow more aggresive i

[jira] [Updated] (ARROW-10524) [C++][Dataset] Add FlightFragment

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-10524: - Fix Version/s: (was: 4.0.0) 5.0.0 > [C++][Dataset] Add FlightFragment > -

[jira] [Updated] (ARROW-7179) [C++][Compute] Array support for fill_null

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7179: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++][Compute] Array support for fill_nu

[jira] [Updated] (ARROW-5423) [C++] implement partial schema class to extend JSON conversion

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-5423: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++] implement partial schema class to

[jira] [Commented] (ARROW-8981) [C++][Dataset] Add support for compressed FileSources

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299129#comment-17299129 ] Ben Kietzman commented on ARROW-8981: - I'm not sure this is worthwhile, actually. I t

[jira] [Assigned] (ARROW-11611) [C++] Update third party dependency mirrors

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-11611: Assignee: Ian Cook (was: Ben Kietzman) > [C++] Update third party dependency mirrors > -

[jira] [Updated] (ARROW-6407) [C++] Consolidate thirdparty bundle URLs, version bumping logic, etc

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6407: --- Description: This will prevent issues like ARROW-6406 After ARROW-8266 ensures every _ep has

[jira] [Commented] (ARROW-11611) [C++] Update third party dependency mirrors

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299131#comment-17299131 ] Neal Richardson commented on ARROW-11611: - I updated the trimmed boost yesterday

[jira] [Resolved] (ARROW-11923) [CI] Update branch name for dask dev integration tests

2021-03-10 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-11923. -- Resolution: Fixed Issue resolved by pull request 9669 [https://github.com/apache/arrow/pull/96

[jira] [Updated] (ARROW-11923) [CI] Update branch name for dask dev integration tests

2021-03-10 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-11923: - Component/s: (was: CI) Continuous Integration > [CI] Update branch name for

[jira] [Updated] (ARROW-11921) [R] Set LC_COLLATE in r/data-raw/codegen.R

2021-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11921: --- Labels: pull-request-available (was: ) > [R] Set LC_COLLATE in r/data-raw/codegen.R > -

[jira] [Assigned] (ARROW-7364) [Rust] Add cast options to cast kernel

2021-03-10 Thread Mike Seddon (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mike Seddon reassigned ARROW-7364: -- Assignee: Mike Seddon > [Rust] Add cast options to cast kernel > -

[jira] [Updated] (ARROW-3822) [C++] parquet::arrow::FileReader::GetRecordBatchReader may not iterate through chunked columns completely

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-3822: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++] parquet::arrow::FileReader::GetRec

[jira] [Updated] (ARROW-5327) [C++] allow construction of ArrayBuilders from existing arrays

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-5327: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++] allow construction of ArrayBuilder

[jira] [Updated] (ARROW-4706) [C++] shared conversion framework for JSON/CSV parsers

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-4706: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++] shared conversion framework for JS

[jira] [Updated] (ARROW-7894) [C++] DefineOptions should invoke add_definitions

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7894: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++] DefineOptions should invoke add_de

[jira] [Updated] (ARROW-4698) [C++] Let StringBuilder be constructible with a pre allocated buffer for character data

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-4698: Fix Version/s: (was: 4.0.0) > [C++] Let StringBuilder be constructible with a pre allocated buf

[jira] [Updated] (ARROW-9543) [C++] Simplify parsing/conversion utilities

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-9543: Fix Version/s: (was: 4.0.0) 5.0.0 > [C++] Simplify parsing/conversion utilit

[jira] [Assigned] (ARROW-10439) [C++][Dataset] Add max file size as a dataset writing option

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-10439: Assignee: Weston Pace (was: Ben Kietzman) > [C++][Dataset] Add max file size as a datase

[jira] [Resolved] (ARROW-6604) [C++] Add support for nested types to MakeArrayFromScalar

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-6604. - Resolution: Fixed > [C++] Add support for nested types to MakeArrayFromScalar > -

[jira] [Comment Edited] (ARROW-8981) [C++][Dataset] Add support for compressed FileSources

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299129#comment-17299129 ] Ben Kietzman edited comment on ARROW-8981 at 3/10/21, 9:59 PM:

[jira] [Commented] (ARROW-7224) [C++][Dataset] Partition level filters should be able to provide filtering to file systems

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299155#comment-17299155 ] Ben Kietzman commented on ARROW-7224: - Streaming GetFileInfo: ARROW-11924 > [C++][Da

[jira] [Created] (ARROW-11927) [Rust][DataFusion] Limit push down

2021-03-10 Thread Jira
Daniël Heres created ARROW-11927: Summary: [Rust][DataFusion] Limit push down Key: ARROW-11927 URL: https://issues.apache.org/jira/browse/ARROW-11927 Project: Apache Arrow Issue Type: Improv

[jira] [Updated] (ARROW-11927) [Rust][DataFusion] Support limit push down

2021-03-10 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-11927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniël Heres updated ARROW-11927: - Summary: [Rust][DataFusion] Support limit push down (was: [Rust][DataFusion] Limit push down)

[jira] [Updated] (ARROW-11927) [Rust][DataFusion] Support limit push down

2021-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11927: --- Labels: pull-request-available (was: ) > [Rust][DataFusion] Support limit push down >

[jira] [Created] (ARROW-11928) [C++][Compute] Add ExecNode hierarchy

2021-03-10 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-11928: Summary: [C++][Compute] Add ExecNode hierarchy Key: ARROW-11928 URL: https://issues.apache.org/jira/browse/ARROW-11928 Project: Apache Arrow Issue Type: Impr

[jira] [Resolved] (ARROW-11921) [R] Set LC_COLLATE in r/data-raw/codegen.R

2021-03-10 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-11921. - Resolution: Fixed Issue resolved by pull request 9673 [https://github.com/apache/arrow/p

[jira] [Created] (ARROW-11929) [C++][Compute] Promote Expression to the compute namespace

2021-03-10 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-11929: Summary: [C++][Compute] Promote Expression to the compute namespace Key: ARROW-11929 URL: https://issues.apache.org/jira/browse/ARROW-11929 Project: Apache Arrow

[jira] [Updated] (ARROW-11929) [C++][Compute] Promote Expression to the compute namespace

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-11929: - Description: See discussion in https://docs.google.com/document/d/1AyTdLU-RxA-Gsb9EsYnrQrmqPMOY

[jira] [Created] (ARROW-11930) [C++][Dataset][Compute] Provide ScanNode implementation which wraps a Dataset

2021-03-10 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-11930: Summary: [C++][Dataset][Compute] Provide ScanNode implementation which wraps a Dataset Key: ARROW-11930 URL: https://issues.apache.org/jira/browse/ARROW-11930 Project

[jira] [Updated] (ARROW-11928) [C++][Compute] Add ExecNode hierarchy

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-11928: - Description: Per discussion on https://docs.google.com/document/d/1AyTdLU-RxA-Gsb9EsYnrQrmqPMOY

[jira] [Updated] (ARROW-11930) [C++][Dataset][Compute] Refactor Dataset scans to use an ExecNode graph

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-11930: - Summary: [C++][Dataset][Compute] Refactor Dataset scans to use an ExecNode graph (was: [C++][Da

[jira] [Updated] (ARROW-11930) [C++][Dataset][Compute] Refactor Dataset scans to use an ExecNode graph

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-11930: - Description: Per discussion on https://docs.google.com/document/d/1AyTdLU-RxA-Gsb9EsYnrQrmqPMOY

[jira] [Created] (ARROW-11931) [Go][CI] Bump CI to use Go 1.15

2021-03-10 Thread Matt Topol (Jira)
Matt Topol created ARROW-11931: -- Summary: [Go][CI] Bump CI to use Go 1.15 Key: ARROW-11931 URL: https://issues.apache.org/jira/browse/ARROW-11931 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-11925) [R] Add `between` method for arrow_dplyr_query

2021-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11925: --- Labels: pull-request-available (was: ) > [R] Add `between` method for arrow_dplyr_query > -

[jira] [Updated] (ARROW-11931) [Go][CI] Bump CI to use Go 1.15

2021-03-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11931: --- Labels: pull-request-available (was: ) > [Go][CI] Bump CI to use Go 1.15 >

[jira] [Updated] (ARROW-11656) Left over functions/fixes

2021-03-10 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-11656: Component/s: Rust - DataFusion > Left over functions/fixes > - > >

[jira] [Resolved] (ARROW-11656) Left over functions/fixes

2021-03-10 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb resolved ARROW-11656. - Fix Version/s: 4.0.0 Resolution: Fixed Issue resolved by pull request 9654 [https://githu

[jira] [Commented] (ARROW-11206) [C++][Dataset][Python] Consider hiding/renaming "project"

2021-03-10 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17299192#comment-17299192 ] Ben Kietzman commented on ARROW-11206: -- Note: after ARROW-19230 projection will be

  1   2   >