[jira] [Assigned] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2023-01-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-18400: -- Assignee: Will Jones > [Python] Quadratic memory usage of Table.to_pandas with nested data >

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2023-01-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655100#comment-17655100 ] Will Jones commented on ARROW-18400: Took a look at the issue in Joris' last repro. Is seems to stem

[jira] [Resolved] (ARROW-18411) [Python] MapType comparison ignores nullable flag of item_field

2023-01-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones resolved ARROW-18411. Resolution: Fixed > [Python] MapType comparison ignores nullable flag of item_field >

[jira] [Resolved] (ARROW-17302) [R] Configure curl timeout policy for S3

2023-01-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones resolved ARROW-17302. Resolution: Fixed > [R] Configure curl timeout policy for S3 >

[jira] [Commented] (ARROW-18202) [R][C++] Different behaviour of R's base::gsub() binding aka libarrow's replace_string_regex kernel since 10.0.0

2022-12-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17653186#comment-17653186 ] Will Jones commented on ARROW-18202: The following lines were added to early return if the input

[jira] [Assigned] (ARROW-18202) [R][C++] Different behaviour of R's base::gsub() binding aka libarrow's replace_string_regex kernel since 10.0.0

2022-12-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-18202: -- Assignee: Will Jones > [R][C++] Different behaviour of R's base::gsub() binding aka

[jira] [Commented] (ARROW-18195) [R][C++] Final value returned by case_when is NA when input has 64 or more values and 1 or more NAs

2022-12-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17653169#comment-17653169 ] Will Jones commented on ARROW-18195: Thank you for all the reproductions. I zeroed in on one simple

[jira] [Assigned] (ARROW-18195) [R][C++] Final value returned by case_when is NA when input has 64 or more values and 1 or more NAs

2022-12-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18195?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-18195: -- Assignee: Will Jones > [R][C++] Final value returned by case_when is NA when input has 64 or

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-11-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641484#comment-17641484 ] Will Jones commented on ARROW-18400: Under the hood, {{pyarrow.parquet.read_table}} is using

[jira] [Commented] (ARROW-18411) [Python] MapType comparison ignores nullable flag of item_field

2022-11-28 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17640137#comment-17640137 ] Will Jones commented on ARROW-18411: Thanks for reporting this. This will be fixed by

[jira] [Assigned] (ARROW-18411) [Python] MapType comparison ignores nullable flag of item_field

2022-11-28 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-18411: -- Assignee: Will Jones > [Python] MapType comparison ignores nullable flag of item_field >

[jira] [Assigned] (ARROW-15812) [R] Allow user to supply col_names argument when reading in a CSV dataset

2022-11-21 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-15812: -- Assignee: Will Jones > [R] Allow user to supply col_names argument when reading in a CSV

[jira] [Commented] (ARROW-15812) [R] Allow user to supply col_names argument when reading in a CSV dataset

2022-11-21 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17636918#comment-17636918 ] Will Jones commented on ARROW-15812: Auto-generation of column names was added to Datasets in

[jira] [Assigned] (ARROW-15470) [R] Allows user to specify string to be used for missing data when writing CSV dataset

2022-11-18 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-15470: -- Assignee: Will Jones > [R] Allows user to specify string to be used for missing data when

[jira] [Commented] (ARROW-18355) [R] support the quoted_na argument in open_dataset for CSVs by mapping it to CSVConvertOptions$strings_can_be_null

2022-11-18 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17636012#comment-17636012 ] Will Jones commented on ARROW-18355: This feature is "soft-deprecated" in readr. Do we still want to

[jira] [Assigned] (ARROW-18355) [R] support the quoted_na argument in open_dataset for CSVs by mapping it to CSVConvertOptions$strings_can_be_null

2022-11-18 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-18355: -- Assignee: Will Jones > [R] support the quoted_na argument in open_dataset for CSVs by

[jira] [Created] (ARROW-18359) PrettyPrint Improvements

2022-11-17 Thread Will Jones (Jira)
Will Jones created ARROW-18359: -- Summary: PrettyPrint Improvements Key: ARROW-18359 URL: https://issues.apache.org/jira/browse/ARROW-18359 Project: Apache Arrow Issue Type: Improvement

[jira] [Resolved] (ARROW-15026) [Python] datetime.timedelta to pyarrow.duration('us') silently overflows

2022-11-15 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones resolved ARROW-15026. Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 13718

[jira] [Assigned] (ARROW-14196) [C++][Parquet] Default to compliant nested types in Parquet writer

2022-11-15 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-14196: -- Assignee: Will Jones > [C++][Parquet] Default to compliant nested types in Parquet writer >

[jira] [Resolved] (ARROW-17812) [C++][Documentation] Add Gandiva User Guide

2022-11-08 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones resolved ARROW-17812. Resolution: Fixed Issue resolved by pull request 14200

[jira] [Updated] (ARROW-18246) [Python][Docs] PyArrow table join docstring typos for left and right suffix arguments

2022-11-04 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-18246: --- Component/s: Documentation > [Python][Docs] PyArrow table join docstring typos for left and right

[jira] [Updated] (ARROW-18246) [Python][Docs] PyArrow table join docstring typos for left and right suffix arguments

2022-11-04 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-18246: --- Fix Version/s: 11.0.0 > [Python][Docs] PyArrow table join docstring typos for left and right suffix

[jira] [Commented] (ARROW-18246) [Python][Docs] PyArrow table join docstring typos for left and right suffix arguments

2022-11-04 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629039#comment-17629039 ] Will Jones commented on ARROW-18246: Thanks for reporting. I have created an update fixing those and

[jira] [Assigned] (ARROW-18246) [Python][Docs] PyArrow table join docstring typos for left and right suffix arguments

2022-11-04 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-18246: -- Assignee: Will Jones > [Python][Docs] PyArrow table join docstring typos for left and right

[jira] [Closed] (ARROW-18245) wheels for PyArrow + Python 3.11

2022-11-04 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones closed ARROW-18245. -- Resolution: Duplicate Hello! This is being actively worked on in ARROW-17487. I've closed this ticket

[jira] [Commented] (ARROW-18228) AWS Error SLOW_DOWN during PutObject operation

2022-11-04 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629016#comment-17629016 ] Will Jones commented on ARROW-18228: If you are still getting errors, it might be worth reviewing

[jira] [Commented] (ARROW-18228) AWS Error SLOW_DOWN during PutObject operation

2022-11-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17628391#comment-17628391 ] Will Jones commented on ARROW-18228: I think this may have been caused by

[jira] [Commented] (ARROW-18210) [C++][Parquet] Skip check in StreamWriter

2022-11-03 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17628379#comment-17628379 ] Will Jones commented on ARROW-18210: Created https://issues.apache.org/jira/browse/ARROW-18239 >

[jira] [Created] (ARROW-18239) [C++][Docs] Add examples of Parquet TypedColumnWriter to user guide

2022-11-03 Thread Will Jones (Jira)
Will Jones created ARROW-18239: -- Summary: [C++][Docs] Add examples of Parquet TypedColumnWriter to user guide Key: ARROW-18239 URL: https://issues.apache.org/jira/browse/ARROW-18239 Project: Apache

[jira] [Created] (ARROW-18230) [Python] Pass Cmake args to Python CPP

2022-11-02 Thread Will Jones (Jira)
Will Jones created ARROW-18230: -- Summary: [Python] Pass Cmake args to Python CPP Key: ARROW-18230 URL: https://issues.apache.org/jira/browse/ARROW-18230 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-18204) [R] Allow setting field metadata

2022-10-31 Thread Will Jones (Jira)
Will Jones created ARROW-18204: -- Summary: [R] Allow setting field metadata Key: ARROW-18204 URL: https://issues.apache.org/jira/browse/ARROW-18204 Project: Apache Arrow Issue Type: Improvement

[jira] [Commented] (ARROW-14999) [C++] List types with different field names are not equal

2022-10-24 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623449#comment-17623449 ] Will Jones commented on ARROW-14999: So here are the conclusions I've gathered so far: 1. Equality

[jira] [Assigned] (ARROW-16817) [C++][Python] Segfaults for unsupported datatypes in the ORC writer

2022-10-18 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-16817: -- Assignee: Will Jones (was: Ian Alexander Joiner) > [C++][Python] Segfaults for unsupported

[jira] [Updated] (ARROW-17994) [C++] Add overflow argument is required when it shouldn't be

2022-10-13 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-17994: --- Attachment: generate_ibis_queries.py > [C++] Add overflow argument is required when it shouldn't be

[jira] [Updated] (ARROW-17994) [C++] Add overflow argument is required when it shouldn't be

2022-10-13 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-17994: --- Attachment: try_queries_acero.py > [C++] Add overflow argument is required when it shouldn't be >

[jira] [Assigned] (ARROW-17069) [Python][R] GCSFIleSystem reports cannot resolve host on public buckets

2022-10-12 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-17069: -- Assignee: Will Jones > [Python][R] GCSFIleSystem reports cannot resolve host on public

[jira] [Commented] (ARROW-17069) [Python][R] GCSFIleSystem reports cannot resolve host on public buckets

2022-10-12 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17616691#comment-17616691 ] Will Jones commented on ARROW-17069: Sure. I had done this earlier for R:

[jira] [Commented] (ARROW-17994) [C++] Add overflow argument is required when it shouldn't be

2022-10-11 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17616096#comment-17616096 ] Will Jones commented on ARROW-17994: Example plan that is broken: {code:json} { "extensionUris":

[jira] [Created] (ARROW-17994) [C++] Add overflow argument is required when it shouldn't be

2022-10-11 Thread Will Jones (Jira)
Will Jones created ARROW-17994: -- Summary: [C++] Add overflow argument is required when it shouldn't be Key: ARROW-17994 URL: https://issues.apache.org/jira/browse/ARROW-17994 Project: Apache Arrow

[jira] [Created] (ARROW-17963) [C++] Implement cast_dictionary for string

2022-10-07 Thread Will Jones (Jira)
Will Jones created ARROW-17963: -- Summary: [C++] Implement cast_dictionary for string Key: ARROW-17963 URL: https://issues.apache.org/jira/browse/ARROW-17963 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-17438) [R] glimpse() errors if there is a UDF

2022-10-06 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones resolved ARROW-17438. Resolution: Duplicate > [R] glimpse() errors if there is a UDF >

[jira] [Commented] (ARROW-17438) [R] glimpse() errors if there is a UDF

2022-10-06 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613721#comment-17613721 ] Will Jones commented on ARROW-17438: I just tested, and this is now fixed. (I believe in

[jira] [Closed] (ARROW-16897) [R][C++] Full join on Arrow objects is incorrect

2022-10-06 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones closed ARROW-16897. -- Resolution: Duplicate > [R][C++] Full join on Arrow objects is incorrect >

[jira] [Updated] (ARROW-17149) [R] Enable GCS tests for Windows

2022-10-06 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-17149: --- Fix Version/s: 11.0.0 (was: 10.0.0) > [R] Enable GCS tests for Windows >

[jira] [Created] (ARROW-17954) [R] Update News for 10.0.0

2022-10-06 Thread Will Jones (Jira)
Will Jones created ARROW-17954: -- Summary: [R] Update News for 10.0.0 Key: ARROW-17954 URL: https://issues.apache.org/jira/browse/ARROW-17954 Project: Apache Arrow Issue Type: Improvement

[jira] [Commented] (ARROW-14342) [Python] Add support for the SSO credential provider

2022-10-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613238#comment-17613238 ] Will Jones commented on ARROW-14342: In the meantime, you can work around this by using boto3 to

[jira] [Updated] (ARROW-14342) Add support for the SSO credential provider

2022-10-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-14342: --- Fix Version/s: 11.0.0 > Add support for the SSO credential provider >

[jira] [Updated] (ARROW-14342) [Python] Add support for the SSO credential provider

2022-10-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-14342: --- Summary: [Python] Add support for the SSO credential provider (was: Add support for the SSO

[jira] [Commented] (ARROW-14342) Add support for the SSO credential provider

2022-10-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613235#comment-17613235 ] Will Jones commented on ARROW-14342: SSO support was added in aws-sdk-cpp 1.9. Once we upgrade that

[jira] [Created] (ARROW-17944) [Python] Accept bytes object in pyarrow.substrait.run_query

2022-10-05 Thread Will Jones (Jira)
Will Jones created ARROW-17944: -- Summary: [Python] Accept bytes object in pyarrow.substrait.run_query Key: ARROW-17944 URL: https://issues.apache.org/jira/browse/ARROW-17944 Project: Apache Arrow

[jira] [Commented] (ARROW-17349) [C++] Add casting support for map type

2022-10-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17613030#comment-17613030 ] Will Jones commented on ARROW-17349: Yes, I've updated the title. Casting lists only was broken if

[jira] [Updated] (ARROW-17349) [C++] Add casting support for map type

2022-10-05 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-17349: --- Summary: [C++] Add casting support for map type (was: [C++] Support casting field names of list

[jira] [Created] (ARROW-17923) [C++] Consider dictionary arrays for special fragment fields

2022-10-03 Thread Will Jones (Jira)
Will Jones created ARROW-17923: -- Summary: [C++] Consider dictionary arrays for special fragment fields Key: ARROW-17923 URL: https://issues.apache.org/jira/browse/ARROW-17923 Project: Apache Arrow

[jira] [Created] (ARROW-17897) [Packaging][Conan] Add back ARROW_GCS to conanfile.py

2022-09-29 Thread Will Jones (Jira)
Will Jones created ARROW-17897: -- Summary: [Packaging][Conan] Add back ARROW_GCS to conanfile.py Key: ARROW-17897 URL: https://issues.apache.org/jira/browse/ARROW-17897 Project: Apache Arrow

[jira] [Assigned] (ARROW-14159) [R] Re-allow some multithreading on Windows

2022-09-26 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-14159: -- Assignee: (was: Will Jones) > [R] Re-allow some multithreading on Windows >

[jira] [Assigned] (ARROW-16880) [R] Test GCS auth with gargle/googleAuthR

2022-09-26 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-16880: -- Assignee: (was: Will Jones) > [R] Test GCS auth with gargle/googleAuthR >

[jira] [Updated] (ARROW-16880) [R] Test GCS auth with gargle/googleAuthR

2022-09-26 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-16880: --- Fix Version/s: (was: 10.0.0) > [R] Test GCS auth with gargle/googleAuthR >

[jira] [Assigned] (ARROW-17069) [Python][R] GCSFIleSystem reports cannot resolve host on public buckets

2022-09-26 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-17069: -- Assignee: (was: Will Jones) > [Python][R] GCSFIleSystem reports cannot resolve host on

[jira] [Commented] (ARROW-16089) [Packaging] Add support for Coan C/C++ package manager

2022-09-26 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17609576#comment-17609576 ] Will Jones commented on ARROW-16089: [~kou] I heard we are waiting on the 10.0.0 release to upstream

[jira] [Created] (ARROW-17845) [CI][Conan] Re-enable Flight in Conan CI check

2022-09-26 Thread Will Jones (Jira)
Will Jones created ARROW-17845: -- Summary: [CI][Conan] Re-enable Flight in Conan CI check Key: ARROW-17845 URL: https://issues.apache.org/jira/browse/ARROW-17845 Project: Apache Arrow Issue

[jira] [Assigned] (ARROW-15838) [C++] Key column behavior in joins

2022-09-22 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-15838: -- Assignee: Will Jones > [C++] Key column behavior in joins >

[jira] [Created] (ARROW-17812) [C++][Documentation] Add Gandiva User Guide

2022-09-21 Thread Will Jones (Jira)
Will Jones created ARROW-17812: -- Summary: [C++][Documentation] Add Gandiva User Guide Key: ARROW-17812 URL: https://issues.apache.org/jira/browse/ARROW-17812 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-17349) [C++] Support casting field names of list and map when nested

2022-09-20 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17607422#comment-17607422 ] Will Jones commented on ARROW-17349: What's actually going on is we don't have any cast kernel for

[jira] [Assigned] (ARROW-17349) [C++] Support casting field names of list and map when nested

2022-09-20 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-17349: -- Assignee: Will Jones > [C++] Support casting field names of list and map when nested >

[jira] [Created] (ARROW-17788) [R][Doc] Add example of using Scanner

2022-09-20 Thread Will Jones (Jira)
Will Jones created ARROW-17788: -- Summary: [R][Doc] Add example of using Scanner Key: ARROW-17788 URL: https://issues.apache.org/jira/browse/ARROW-17788 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-17776) [C++] Stabilize Parquet ArrowReaderProperties

2022-09-19 Thread Will Jones (Jira)
Will Jones created ARROW-17776: -- Summary: [C++] Stabilize Parquet ArrowReaderProperties Key: ARROW-17776 URL: https://issues.apache.org/jira/browse/ARROW-17776 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-17400) [C++] Move Parquet APIs to use Result instead of Status

2022-09-14 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17604933#comment-17604933 ] Will Jones commented on ARROW-17400: [~devavret] Are you still working on this? I did a little bit

[jira] [Commented] (ARROW-17593) [C++] Try and maintain input shape in Acero

2022-09-01 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17599079#comment-17599079 ] Will Jones commented on ARROW-17593: I've been reading through the Parquet implementation, and was

[jira] [Commented] (ARROW-17590) Lower memory usage with filters

2022-09-01 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17599019#comment-17599019 ] Will Jones commented on ARROW-17590: First, I don't believe the row-level filters avoid reading any

[jira] [Assigned] (ARROW-14161) [C++][Parquet][Docs] Reading/Writing Parquet Files

2022-08-31 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-14161: -- Assignee: Will Jones > [C++][Parquet][Docs] Reading/Writing Parquet Files >

[jira] [Assigned] (ARROW-13454) [C++][Docs] Tables vs Record Batches

2022-08-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-13454: -- Assignee: Will Jones > [C++][Docs] Tables vs Record Batches >

[jira] [Commented] (ARROW-15006) [Python][Doc] Iteratively enable more numpydoc checks

2022-08-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17597937#comment-17597937 ] Will Jones commented on ARROW-15006: Great spreadsheet! Best place for developer discussions is the

[jira] [Commented] (ARROW-17459) [C++] Support nested data conversions for chunked array

2022-08-30 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17597915#comment-17597915 ] Will Jones commented on ARROW-17459: We have a section of our docs devoted to [developer setup and

[jira] [Commented] (ARROW-17399) pyarrow may use a lot of memory to load a dataframe from parquet

2022-08-29 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17597400#comment-17597400 ] Will Jones commented on ARROW-17399: Sorry, you are right you had a single column there already. I

[jira] [Commented] (ARROW-17459) [C++] Support nested data conversions for chunked array

2022-08-29 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17597392#comment-17597392 ] Will Jones commented on ARROW-17459: Hi Arthur, Here's a simple repro I created in Python:

[jira] [Commented] (ARROW-17459) [C++] Support nested data conversions for chunked array

2022-08-18 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581502#comment-17581502 ] Will Jones commented on ARROW-17459: I haven't tried this, but perhaps {{GetRecordBatchReader}}

[jira] [Commented] (ARROW-15006) [Python][Doc] Iteratively enable more numpydoc checks

2022-08-18 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581404#comment-17581404 ] Will Jones commented on ARROW-15006: Perhaps we should start with the style-only ones first (PR06:

[jira] [Comment Edited] (ARROW-17441) [Python] Memory kept after del and pool.released_unused()

2022-08-16 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580475#comment-17580475 ] Will Jones edited comment on ARROW-17441 at 8/16/22 9:10 PM: - Going back to

[jira] [Commented] (ARROW-17441) [Python] Memory kept after del and pool.released_unused()

2022-08-16 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580475#comment-17580475 ] Will Jones commented on ARROW-17441: Going back to my original test with Parquet, it does seem like

[jira] [Commented] (ARROW-17441) [Python] Memory kept after del and pool.released_unused()

2022-08-16 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580471#comment-17580471 ] Will Jones commented on ARROW-17441: {quote}I must admit I don't understand the references to

[jira] [Commented] (ARROW-17441) [Python] Memory kept after del and pool.released_unused()

2022-08-16 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17580472#comment-17580472 ] Will Jones commented on ARROW-17441: I reran this in PyArrow 7.0.0 and got results where mimalloc is

[jira] [Updated] (ARROW-17441) [Python] Memory kept after del and pool.released_unused()

2022-08-16 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-17441: --- Description: I was trying reproduce another issue involving memory pools not releasing memory, but

[jira] [Created] (ARROW-17441) [Python] Memory kept after del and pool.released_unused()

2022-08-16 Thread Will Jones (Jira)
Will Jones created ARROW-17441: -- Summary: [Python] Memory kept after del and pool.released_unused() Key: ARROW-17441 URL: https://issues.apache.org/jira/browse/ARROW-17441 Project: Apache Arrow

[jira] [Updated] (ARROW-15368) [C++] [Docs] Improve our SIMD documentation

2022-08-15 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-15368: --- Fix Version/s: 10.0.0 > [C++] [Docs] Improve our SIMD documentation >

[jira] [Resolved] (ARROW-17397) [R] Does R API for Apache Arrow has a tableFromIPC function ?

2022-08-12 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones resolved ARROW-17397. Assignee: Will Jones Resolution: Information Provided > [R] Does R API for Apache Arrow has

[jira] [Commented] (ARROW-17399) pyarrow may use a lot of memory to load a dataframe from parquet

2022-08-12 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579120#comment-17579120 ] Will Jones commented on ARROW-17399: That's helps narrow it down. Are you able to narrow down and

[jira] [Updated] (ARROW-17400) [C++] Move Parquet APIs to use Result instead of Status

2022-08-12 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-17400: --- Labels: good-first-issue (was: ) > [C++] Move Parquet APIs to use Result instead of Status >

[jira] [Commented] (ARROW-17399) pyarrow may use a lot of memory to load a dataframe from parquet

2022-08-12 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579071#comment-17579071 ] Will Jones commented on ARROW-17399: Hi Gianluca, There are two conversions happening when reading:

[jira] [Commented] (ARROW-17397) [R] Does R API for Apache Arrow has a tableFromIPC function ?

2022-08-12 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17397?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17579067#comment-17579067 ] Will Jones commented on ARROW-17397: Hi Roy, I think what you are looking for is a

[jira] [Created] (ARROW-17401) [C++] Add ReadTable method to RecordBatchFileReader

2022-08-12 Thread Will Jones (Jira)
Will Jones created ARROW-17401: -- Summary: [C++] Add ReadTable method to RecordBatchFileReader Key: ARROW-17401 URL: https://issues.apache.org/jira/browse/ARROW-17401 Project: Apache Arrow Issue

[jira] [Created] (ARROW-17400) [C++] Move Parquet APIs to use Result instead of Status

2022-08-12 Thread Will Jones (Jira)
Will Jones created ARROW-17400: -- Summary: [C++] Move Parquet APIs to use Result instead of Status Key: ARROW-17400 URL: https://issues.apache.org/jira/browse/ARROW-17400 Project: Apache Arrow

[jira] [Commented] (ARROW-14999) [C++] List types with different field names are not equal

2022-08-11 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578528#comment-17578528 ] Will Jones commented on ARROW-14999: Do you expect to be able to roundtrip that from Parquet? It

[jira] [Assigned] (ARROW-14999) [C++] List types with different field names are not equal

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-14999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones reassigned ARROW-14999: -- Assignee: Will Jones > [C++] List types with different field names are not equal >

[jira] [Updated] (ARROW-12958) [CI][Developer] Build + host the docs for PR branches

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-12958: --- Component/s: Documentation > [CI][Developer] Build + host the docs for PR branches >

[jira] [Updated] (ARROW-12958) [CI][Developer] Build + host the docs for PR branches

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-12958: --- Fix Version/s: 10.0.0 > [CI][Developer] Build + host the docs for PR branches >

[jira] [Commented] (ARROW-12958) [CI][Developer] Build + host the docs for PR branches

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578148#comment-17578148 ] Will Jones commented on ARROW-12958: Alternatively, we could possibly host on Github pages, where a

[jira] [Commented] (ARROW-12958) [CI][Developer] Build + host the docs for PR branches

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578147#comment-17578147 ] Will Jones commented on ARROW-12958: Yeah I think this could likely be solved by: # Create someĀ 

[jira] [Updated] (ARROW-17076) [Python][Docs] Enable building documentation with pyarrow nightly builds

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-17076: --- Fix Version/s: 10.0.0 > [Python][Docs] Enable building documentation with pyarrow nightly builds >

[jira] [Updated] (ARROW-13457) [C++][Docs] Scalars User Guide

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-13457: --- Fix Version/s: 10.0.0 > [C++][Docs] Scalars User Guide > -- > >

[jira] [Updated] (ARROW-13454) [C++][Docs] Tables vs Record Batches

2022-08-10 Thread Will Jones (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Will Jones updated ARROW-13454: --- Fix Version/s: 10.0.0 > [C++][Docs] Tables vs Record Batches >

  1   2   3   4   5   >