[jira] [Resolved] (ARROW-13173) [C++] TestAsyncUtil.ReadaheadFailed asserts occasionally

2021-07-05 Thread Yibo Cai (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yibo Cai resolved ARROW-13173. -- Resolution: Fixed Issue resolved by pull request 10602 [https://github.com/apache/arrow/pull/10602]

[jira] [Updated] (ARROW-12122) [Python] Cannot install via pip. M1 mac

2021-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-12122: --- Labels: pull-request-available (was: ) > [Python] Cannot install via pip. M1 mac >

[jira] [Commented] (ARROW-13051) [Release][Packaging] Update the java post release task to use the crossbow artifacts

2021-07-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17375073#comment-17375073 ] Kouhei Sutou commented on ARROW-13051: -- Wow. I didn't know it. I haven't tried them yet. >

[jira] [Updated] (ARROW-13216) [R] Schema retention with rtools35

2021-07-05 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-13216: - Priority: Blocker (was: Major) > [R] Schema retention with rtools35 >

[jira] [Assigned] (ARROW-13216) [R] Schema retention with rtools35

2021-07-05 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook reassigned ARROW-13216: Assignee: Ian Cook > [R] Schema retention with rtools35 > -- > >

[jira] [Commented] (ARROW-13051) [Release][Packaging] Update the java post release task to use the crossbow artifacts

2021-07-05 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374997#comment-17374997 ] Krisztian Szucs commented on ARROW-13051: - [~kou] If I understang Artifatory's offerings we

[jira] [Commented] (ARROW-13051) [Release][Packaging] Update the java post release task to use the crossbow artifacts

2021-07-05 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374992#comment-17374992 ] Kouhei Sutou commented on ARROW-13051: -- We don't need to use Artifactory for Java binaries. I think

[jira] [Assigned] (ARROW-12853) [R] Install fails with LTO flags

2021-07-05 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane reassigned ARROW-12853: -- Assignee: Jonathan Keane > [R] Install fails with LTO flags >

[jira] [Resolved] (ARROW-13199) [R] add ubuntu 21.04 to nightly builds

2021-07-05 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane resolved ARROW-13199. Fix Version/s: 5.0.0 Resolution: Fixed Issue resolved by pull request 10611

[jira] [Created] (ARROW-13261) [CI] Remove extra ubuntu-r-only-r service from docker-compose.yml

2021-07-05 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-13261: -- Summary: [CI] Remove extra ubuntu-r-only-r service from docker-compose.yml Key: ARROW-13261 URL: https://issues.apache.org/jira/browse/ARROW-13261 Project:

[jira] [Commented] (ARROW-13051) [Release][Packaging] Update the java post release task to use the crossbow artifacts

2021-07-05 Thread Anthony Louis Gotlib Ferreira (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374973#comment-17374973 ] Anthony Louis Gotlib Ferreira commented on ARROW-13051: --- [~kszucs] Thanks for your

[jira] [Assigned] (ARROW-13198) [C++][Dataset] Async scanner occasionally segfaulting in CI

2021-07-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-13198: --- Assignee: Weston Pace > [C++][Dataset] Async scanner occasionally segfaulting in CI >

[jira] [Commented] (ARROW-13051) [Release][Packaging] Update the java post release task to use the crossbow artifacts

2021-07-05 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374969#comment-17374969 ] Krisztian Szucs commented on ARROW-13051: - The workflow looks like this: 1. execute the crossbow

[jira] [Closed] (ARROW-13254) [Python] Processes killed and semaphore objects leaked when reading pandas data

2021-07-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-13254. --- Fix Version/s: 5.0.0 Resolution: Duplicate I'm going to go ahead and close this as a

[jira] [Resolved] (ARROW-13244) [C++] Add facility to get current thread id

2021-07-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-13244. Fix Version/s: 5.0.0 Resolution: Fixed Issue resolved by pull request 10644

[jira] [Comment Edited] (ARROW-12587) [R][C++][Packaging] Illegal opcode error on aggregate Array/ChunkedArray of integer

2021-07-05 Thread Carlo Cabrera (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374956#comment-17374956 ] Carlo Cabrera edited comment on ARROW-12587 at 7/5/21, 5:59 PM: I've

[jira] [Comment Edited] (ARROW-12587) [R][C++][Packaging] Illegal opcode error on aggregate Array/ChunkedArray of integer

2021-07-05 Thread Carlo Cabrera (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374956#comment-17374956 ] Carlo Cabrera edited comment on ARROW-12587 at 7/5/21, 5:58 PM: I've

[jira] [Commented] (ARROW-12587) [R][C++][Packaging] Illegal opcode error on aggregate Array/ChunkedArray of integer

2021-07-05 Thread Carlo Cabrera (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374956#comment-17374956 ] Carlo Cabrera commented on ARROW-12587: --- I've merged a fix that disables the filtering of `-march`

[jira] [Commented] (ARROW-13259) [C++] Enable slicing to end of string using "utf8_slice_codeunits" when string length unknown or different lengths

2021-07-05 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374939#comment-17374939 ] Mauricio 'Pachá' Vargas Sepúlveda commented on ARROW-13259: --- thanks a lot,

[jira] [Commented] (ARROW-13151) [Python] Unable to read single child field of struct column from Parquet

2021-07-05 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374935#comment-17374935 ] Micah Kornfield commented on ARROW-13151: - {quote}As a side-note, if you do change the ugly

[jira] [Updated] (ARROW-13054) [C++] Add option to specify the first day of the week for the "day_of_week" temporal kernel

2021-07-05 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-13054: --- Fix Version/s: 5.0.0 > [C++] Add option to specify the first day of the week for the "day_of_week"

[jira] [Commented] (ARROW-12203) [C++][Python] Switch default Parquet version to 2.0

2021-07-05 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374930#comment-17374930 ] Micah Kornfield commented on ARROW-12203: - I'm OK moving the default logical types to 2.0 and

[jira] [Resolved] (ARROW-13158) [Python] Fix repr and contains of StructScalar with duplicate field names

2021-07-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-13158. Fix Version/s: 5.0.0 Resolution: Fixed Issue resolved by pull request 10591

[jira] [Commented] (ARROW-13259) [C++] Enable slicing to end of string using "utf8_slice_codeunits" when string length unknown or different lengths

2021-07-05 Thread Nic Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374924#comment-17374924 ] Nic Crane commented on ARROW-13259: --- Thanks very much [~maartenbreddels] and [~jorisvandenbossche] ! 

[jira] [Resolved] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-13258. -- Resolution: Fixed Issue resolved by pull request 10654 [https://github.com/apache/arrow/pull/10654]

[jira] [Created] (ARROW-13260) [Doc] Host different released versions of the documentation + version switcher

2021-07-05 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13260: - Summary: [Doc] Host different released versions of the documentation + version switcher Key: ARROW-13260 URL: https://issues.apache.org/jira/browse/ARROW-13260

[jira] [Updated] (ARROW-13134) [C++] SSL-related arrow-s3fs-test failures with aws-sdk-cpp 1.9.51

2021-07-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-13134: --- Summary: [C++] SSL-related arrow-s3fs-test failures with aws-sdk-cpp 1.9.51 (was: [C++]

[jira] [Commented] (ARROW-13134) [C++] arrow-s3fs-test fails or hangs with aws-sdk-cpp 1.9.45

2021-07-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374914#comment-17374914 ] Antoine Pitrou commented on ARROW-13134: Ok, 1.9.51 does fix some of the issues. However, the

[jira] [Commented] (ARROW-1299) [Doc] Publish nightly documentation against master somewhere

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374913#comment-17374913 ] Joris Van den Bossche commented on ARROW-1299: -- > A crossbow nightly task could push the

[jira] [Updated] (ARROW-13230) Add CSV Writer documentation

2021-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13230: --- Labels: pull-request-available (was: ) > Add CSV Writer documentation >

[jira] [Commented] (ARROW-13259) [C++] Enable slicing to end of string using "utf8_slice_codeunits" when string length unknown or different lengths

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374908#comment-17374908 ] David Li commented on ARROW-13259: -- Maybe we could add a SliceOptions::kEnd constant just to make it

[jira] [Commented] (ARROW-13259) [C++] Enable slicing to end of string using "utf8_slice_codeunits" when string length unknown or different lengths

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374907#comment-17374907 ] Joris Van den Bossche commented on ARROW-13259: --- To copy over the practical example:

[jira] [Comment Edited] (ARROW-13051) [Release][Packaging] Update the java post release task to use the crossbow artifacts

2021-07-05 Thread Anthony Louis Gotlib Ferreira (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17373848#comment-17373848 ] Anthony Louis Gotlib Ferreira edited comment on ARROW-13051 at 7/5/21, 3:34 PM:

[jira] [Commented] (ARROW-1299) [Doc] Publish nightly documentation against master somewhere

2021-07-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374898#comment-17374898 ] Antoine Pitrou commented on ARROW-1299: --- Indeed, CUDA is not strictly needed, but then API docs

[jira] [Commented] (ARROW-1299) [Doc] Publish nightly documentation against master somewhere

2021-07-05 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374896#comment-17374896 ] Krisztian Szucs commented on ARROW-1299: We don't need CUDA to build the docs. A crossbow nightly

[jira] [Commented] (ARROW-12203) [C++][Python] Switch default Parquet version to 2.0

2021-07-05 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-12203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374891#comment-17374891 ] Jorge Leitão commented on ARROW-12203: -- I am of the opinion that it is time to move on; version 2.0

[jira] [Commented] (ARROW-13259) [C++] Enable slicing to end of string using "utf8_slice_codeunits" when string length unknown or different lengths

2021-07-05 Thread Maarten Breddels (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374890#comment-17374890 ] Maarten Breddels commented on ARROW-13259: -- Does my comment

[jira] [Created] (ARROW-13259) [C++] Enable slicing to end of string using "utf8_slice_codeunits" when string length unknown or different lengths

2021-07-05 Thread Nic Crane (Jira)
Nic Crane created ARROW-13259: - Summary: [C++] Enable slicing to end of string using "utf8_slice_codeunits" when string length unknown or different lengths Key: ARROW-13259 URL:

[jira] [Commented] (ARROW-1299) [Doc] Publish nightly documentation against master somewhere

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374881#comment-17374881 ] Joris Van den Bossche commented on ARROW-1299: -- cc [~amol-] > [Doc] Publish nightly

[jira] [Assigned] (ARROW-13051) [Release][Packaging] Update the java post release task to use the crossbow artifacts

2021-07-05 Thread Anthony Louis Gotlib Ferreira (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anthony Louis Gotlib Ferreira reassigned ARROW-13051: - Assignee: Anthony Louis Gotlib Ferreira >

[jira] [Resolved] (ARROW-12512) [C++][Dataset] Implement CSV writing support

2021-07-05 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-12512. Fix Version/s: 5.0.0 Resolution: Fixed Issue resolved by pull request 10230

[jira] [Resolved] (ARROW-12988) [CI] The kartothek nightly integration build is failing (test_update_dataset_from_ddf_empty)

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-12988. --- Resolution: Fixed Issue resolved by pull request 10655

[jira] [Closed] (ARROW-12365) [Python] [Dataset] Add partition_filename_cb to ds.write_dataset()

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-12365. - Resolution: Not A Problem > [Python] [Dataset] Add partition_filename_cb to

[jira] [Reopened] (ARROW-12365) [Python] [Dataset] Add partition_filename_cb to ds.write_dataset()

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reopened ARROW-12365: --- > [Python] [Dataset] Add partition_filename_cb to ds.write_dataset() >

[jira] [Updated] (ARROW-12365) [Python] [Dataset] Add partition_filename_cb to ds.write_dataset()

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-12365: -- Fix Version/s: (was: 5.0.0) > [Python] [Dataset] Add

[jira] [Updated] (ARROW-11781) [Python] Reading small amount of files from a partitioned dataset is unexpectedly slow

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-11781: -- Fix Version/s: (was: 5.0.0) 4.0.0 > [Python] Reading

[jira] [Commented] (ARROW-13198) [C++][Dataset] Async scanner occasionally segfaulting in CI

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13198?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374799#comment-17374799 ] David Li commented on ARROW-13198: -- Another one in AMD64 Conda Python 3.8 Without Pandas: {noformat}

[jira] [Updated] (ARROW-13198) [C++][Dataset] Async scanner occasionally segfaulting in CI

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-13198: - Issue Type: Bug (was: Improvement) > [C++][Dataset] Async scanner occasionally segfaulting in CI >

[jira] [Resolved] (ARROW-11781) [Python] Reading small amount of files from a partitioned dataset is unexpectedly slow

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-11781. -- Resolution: Fixed > [Python] Reading small amount of files from a partitioned dataset is >

[jira] [Comment Edited] (ARROW-11781) [Python] Reading small amount of files from a partitioned dataset is unexpectedly slow

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374798#comment-17374798 ] David Li edited comment on ARROW-11781 at 7/5/21, 12:40 PM: I think we can

[jira] [Commented] (ARROW-11781) [Python] Reading small amount of files from a partitioned dataset is unexpectedly slow

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374798#comment-17374798 ] David Li commented on ARROW-11781: -- I think we can close it for now, as it's tracked in benchmarks and

[jira] [Resolved] (ARROW-11980) [Python] Remove "experimental" status from Table.replace_schema_metadata

2021-07-05 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-11980. -- Resolution: Fixed Issue resolved by pull request 10653 [https://github.com/apache/arrow/pull/10653]

[jira] [Updated] (ARROW-10726) [Python] Reading multiple parquet files with different index column dtype (originating pandas) reads wrong data

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10726: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [Python] Reading

[jira] [Updated] (ARROW-13248) [Python] Segfault in test_dataset::test_scan_iterator on python 3.9

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13248: -- Fix Version/s: 5.0.0 > [Python] Segfault in test_dataset::test_scan_iterator

[jira] [Updated] (ARROW-10469) [CI][Python] Run dask integration tests on Windows

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10469: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [CI][Python] Run

[jira] [Commented] (ARROW-11781) [Python] Reading small amount of files from a partitioned dataset is unexpectedly slow

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374792#comment-17374792 ] Joris Van den Bossche commented on ARROW-11781: --- Can this be closed, or are there

[jira] [Updated] (ARROW-13153) [C++] `parquet_dataset` loses ordering of files in `_metadata`

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13153: -- Fix Version/s: 5.0.0 > [C++] `parquet_dataset` loses ordering of files in

[jira] [Updated] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

2021-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13258: --- Labels: pull-request-available (was: ) > [Python] Improve the repr of ParquetFileFragment

[jira] [Updated] (ARROW-13137) [C++][Documentation] Make in-table references consistent

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13137: -- Fix Version/s: (was: 5.0.0) > [C++][Documentation] Make in-table

[jira] [Updated] (ARROW-13137) [C++][Documentation] Make in-table references consistent

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13137: -- Fix Version/s: 5.0.0 > [C++][Documentation] Make in-table references

[jira] [Resolved] (ARROW-13137) [C++][Documentation] Make in-table references consistent

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-13137. --- Fix Version/s: (was: 6.0.0) 5.0.0 Resolution:

[jira] [Updated] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13258: -- Fix Version/s: 5.0.0 > [Python] Improve the repr of ParquetFileFragment >

[jira] [Assigned] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-13258: - Assignee: Joris Van den Bossche > [Python] Improve the repr of

[jira] [Created] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

2021-07-05 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13258: - Summary: [Python] Improve the repr of ParquetFileFragment Key: ARROW-13258 URL: https://issues.apache.org/jira/browse/ARROW-13258 Project: Apache

[jira] [Updated] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13258: -- Description: Compare with the legacy version: {code} In [5]: d1 =

[jira] [Updated] (ARROW-13074) [Python] Start with deprecating ParquetDataset custom attributes

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13074: -- Fix Version/s: (was: 6.0.0) 5.0.0 > [Python] Start

[jira] [Updated] (ARROW-11980) [Python] Remove "experimental" status from Table.replace_schema_metadata

2021-07-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-11980: --- Labels: pull-request-available (was: ) > [Python] Remove "experimental" status from

[jira] [Commented] (ARROW-13086) [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374714#comment-17374714 ] Joris Van den Bossche commented on ARROW-13086: --- There is a PR in progress, so moving back

[jira] [Updated] (ARROW-13086) [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13086: -- Fix Version/s: (was: 6.0.0) 5.0.0 > [Python] Expose

[jira] [Assigned] (ARROW-13086) [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-13086: - Assignee: Karik Isichei (was: Joris Van den Bossche) > [Python]

[jira] [Assigned] (ARROW-13086) [Python] Expose Parquet ArrowReaderProperties::coerce_int96_timestamp_unit_

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-13086: - Assignee: Joris Van den Bossche > [Python] Expose Parquet

[jira] [Assigned] (ARROW-11980) [Python] Remove "experimental" status from Table.replace_schema_metadata

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-11980: - Assignee: Joris Van den Bossche > [Python] Remove "experimental"

[jira] [Updated] (ARROW-13141) [C++][Python] HadoopFileSystem: automatically set CLASSPATH based on HADOOP_HOME env variable?

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13141: -- Fix Version/s: (was: 6.0.0) 5.0.0 > [C++][Python]

[jira] [Updated] (ARROW-11980) [Python] Remove "experimental" status from Table.replace_schema_metadata

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-11980: -- Fix Version/s: (was: 6.0.0) 5.0.0 > [Python] Remove

[jira] [Assigned] (ARROW-12016) [C++] Implement array_sort_indices and sort_indices for BOOL type

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-12016: - Assignee: Niranda Perera (was: Joris Van den Bossche) > [C++]

[jira] [Assigned] (ARROW-12016) [C++] Implement array_sort_indices and sort_indices for BOOL type

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-12016: - Assignee: Joris Van den Bossche (was: Niranda Perera) > [C++]

[jira] [Updated] (ARROW-12016) [C++] Implement array_sort_indices and sort_indices for BOOL type

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-12016: -- Fix Version/s: (was: 6.0.0) 5.0.0 > [C++] Implement

[jira] [Commented] (ARROW-12016) [C++] Implement array_sort_indices and sort_indices for BOOL type

2021-07-05 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374711#comment-17374711 ] Joris Van den Bossche commented on ARROW-12016: --- This has a PR in progress, so moving back

[jira] [Updated] (ARROW-13151) [Python] Unable to read single child field of struct column from Parquet

2021-07-05 Thread Angus Hollands (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Angus Hollands updated ARROW-13151: --- Description: Given the following table {code:java} data = {"root": [[{"addr": {"this": 3,

[jira] [Resolved] (ARROW-13032) [Java] Update gauva version

2021-07-05 Thread Kazuaki Ishizaki (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki resolved ARROW-13032. -- Fix Version/s: 5.0.0 Resolution: Fixed Issue resolved by pull request 10501

[jira] [Updated] (ARROW-9246) [JS] Add forward compatibility checks for Decimal::bitWidth

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-9246: - Fix Version/s: (was: 5.0.0) 6.0.0 > [JS] Add forward

[jira] [Commented] (ARROW-12701) [Website][Release] Include Rust and DataFusion commits, contributors, changes in release notes

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17374683#comment-17374683 ] Alessandro Molina commented on ARROW-12701: --- [~icook] can you confirm this has been addressed?

[jira] [Updated] (ARROW-12701) [Website][Release] Include Rust and DataFusion commits, contributors, changes in release notes

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-12701: -- Fix Version/s: (was: 6.0.0) 5.0.0 > [Website][Release] Include

[jira] [Updated] (ARROW-12701) [Website][Release] Include Rust and DataFusion commits, contributors, changes in release notes

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-12701: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [Website][Release] Include

[jira] [Updated] (ARROW-12359) [C++] Deprecate or remove FileSystem::OpenAppendStream

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-12359: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [C++] Deprecate or remove

[jira] [Updated] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-12264: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [C++][Dataset] Handle NaNs

[jira] [Updated] (ARROW-12060) [Python] Enable calling compute functions on Expressions

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-12060: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [Python] Enable calling

[jira] [Updated] (ARROW-12105) [R] Replace vars_select, vars_rename with eval_select, eval_rename

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-12105: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [R] Replace vars_select,

[jira] [Updated] (ARROW-8470) [Python][R] Expose incremental write API for Feather files

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-8470: - Fix Version/s: (was: 5.0.0) 6.0.0 > [Python][R] Expose

[jira] [Updated] (ARROW-11243) [C++] Parse time32 from string and infer in CSV reader

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-11243: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [C++] Parse time32 from

[jira] [Updated] (ARROW-13238) [C++][Dataset][Compute] Substitute ExecPlan impl for dataset scans

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-13238: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [C++][Dataset][Compute]

[jira] [Updated] (ARROW-9612) [Python] Automatically back on larger IO block size when JSON parsing fails

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-9612: - Fix Version/s: (was: 5.0.0) 6.0.0 > [Python] Automatically back

[jira] [Updated] (ARROW-9433) [C++/Python] Add option to Take kernel to interpret negative indices as NULL

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-9433: - Fix Version/s: (was: 5.0.0) 6.0.0 > [C++/Python] Add option to

[jira] [Updated] (ARROW-12728) [C++][Compute] Aggregates: implement count distinct

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-12728: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [C++][Compute] Aggregates:

[jira] [Updated] (ARROW-9111) [Python] csv.read_csv progress bar

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-9111: - Fix Version/s: (was: 5.0.0) 6.0.0 > [Python] csv.read_csv

[jira] [Updated] (ARROW-11206) [C++][Dataset][Python] Consider hiding/renaming "project"

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-11206: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [C++][Dataset][Python]

[jira] [Updated] (ARROW-9434) [C++] Store type_code information in UnionScalar::value

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-9434: - Fix Version/s: (was: 5.0.0) 6.0.0 > [C++] Store type_code

[jira] [Updated] (ARROW-8991) [C++][Compute] Add scalar_hash function

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-8991: - Fix Version/s: (was: 5.0.0) 6.0.0 > [C++][Compute] Add

[jira] [Updated] (ARROW-10142) [C++] RecordBatchStreamReader should use StreamDecoder

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10142?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-10142: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [C++]

[jira] [Updated] (ARROW-13074) [Python] Start with deprecating ParquetDataset custom attributes

2021-07-05 Thread Alessandro Molina (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alessandro Molina updated ARROW-13074: -- Fix Version/s: (was: 5.0.0) 6.0.0 > [Python] Start with

  1   2   >