[jira] [Assigned] (ARROW-18008) [Python][C++] Add use_threads to run_substrait_query

2022-10-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-18008: --- Assignee: Weston Pace > [Python][C++] Add use_threads to run_substrait_query >

[jira] [Commented] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-10-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17622416#comment-17622416 ] Weston Pace commented on ARROW-17783: - Let's leave this open then. I'm going to una

[jira] [Assigned] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-10-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17783: --- Assignee: (was: Weston Pace) > [C++] Aggregate kernel should not mandate alignment > --

[jira] [Created] (ARROW-18134) [C++][CI] Add Substrait integration testing to CI

2022-10-21 Thread Weston Pace (Jira)
Weston Pace created ARROW-18134: --- Summary: [C++][CI] Add Substrait integration testing to CI Key: ARROW-18134 URL: https://issues.apache.org/jira/browse/ARROW-18134 Project: Apache Arrow Issue

[jira] [Created] (ARROW-18133) [C++] Update "options" handling for Substrait functions

2022-10-21 Thread Weston Pace (Jira)
Weston Pace created ARROW-18133: --- Summary: [C++] Update "options" handling for Substrait functions Key: ARROW-18133 URL: https://issues.apache.org/jira/browse/ARROW-18133 Project: Apache Arrow

[jira] [Commented] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-10-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17622388#comment-17622388 ] Weston Pace commented on ARROW-17783: - Do the compute kernels all have tests to veri

[jira] [Commented] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-10-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17622376#comment-17622376 ] Weston Pace commented on ARROW-17783: - FWIW, the particular check failing here isn't

[jira] [Commented] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-10-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17622362#comment-17622362 ] Weston Pace commented on ARROW-17783: - We would only be requiring 64-byte alignment

[jira] [Created] (ARROW-18119) [C++] Utility method to ensure an array object meetings an alignment requirement

2022-10-20 Thread Weston Pace (Jira)
Weston Pace created ARROW-18119: --- Summary: [C++] Utility method to ensure an array object meetings an alignment requirement Key: ARROW-18119 URL: https://issues.apache.org/jira/browse/ARROW-18119 Projec

[jira] [Comment Edited] (ARROW-18113) Implement a read range process without caching

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17621427#comment-17621427 ] Weston Pace edited comment on ARROW-18113 at 10/21/22 12:43 AM: --

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17621427#comment-17621427 ] Weston Pace commented on ARROW-18113: - > Just to be clear: to the filesystem, or on

[jira] [Commented] (ARROW-18102) [R] dplyr::count and dplyr::tally implementation return NA instead of 0

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17621425#comment-17621425 ] Weston Pace commented on ARROW-18102: - Supposedly both behaviors are useful (returni

[jira] [Assigned] (ARROW-17207) [C++][CI] Occasional timeout failures on arrow-compute-scalar-test

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17207: --- Assignee: Weston Pace > [C++][CI] Occasional timeout failures on arrow-compute-scalar-test

[jira] [Commented] (ARROW-17207) [C++][CI] Occasional timeout failures on arrow-compute-scalar-test

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17621404#comment-17621404 ] Weston Pace commented on ARROW-17207: - In this case I think I would prefer splitting

[jira] [Commented] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17621402#comment-17621402 ] Weston Pace commented on ARROW-17783: - My concern is less performance and more compl

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17621398#comment-17621398 ] Weston Pace commented on ARROW-18113: - On reflection, I don't really prefer my autom

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17621395#comment-17621395 ] Weston Pace commented on ARROW-18115: - CC [~apitrou][~sakras][~michalno][~marsupialt

[jira] [Created] (ARROW-18115) [C++] Acero buffer alignment

2022-10-20 Thread Weston Pace (Jira)
Weston Pace created ARROW-18115: --- Summary: [C++] Acero buffer alignment Key: ARROW-18115 URL: https://issues.apache.org/jira/browse/ARROW-18115 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-18100) [C++] Intermittent failure in TestNewScanner.Backpressure

2022-10-19 Thread Weston Pace (Jira)
Weston Pace created ARROW-18100: --- Summary: [C++] Intermittent failure in TestNewScanner.Backpressure Key: ARROW-18100 URL: https://issues.apache.org/jira/browse/ARROW-18100 Project: Apache Arrow

[jira] [Created] (ARROW-18070) [C++] Valgrind reports leaks from protobuf allocated memory in substrait tests

2022-10-16 Thread Weston Pace (Jira)
Weston Pace created ARROW-18070: --- Summary: [C++] Valgrind reports leaks from protobuf allocated memory in substrait tests Key: ARROW-18070 URL: https://issues.apache.org/jira/browse/ARROW-18070 Project:

[jira] [Updated] (ARROW-18055) [C++] arrow-dataset-dataset-writer-test still times out occassionally

2022-10-15 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18055: Summary: [C++] arrow-dataset-dataset-writer-test still times out occassionally (was: arrow-datase

[jira] [Commented] (ARROW-17559) [R][C++] Regression: big performance hit after removing schema binding

2022-10-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17559?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17617977#comment-17617977 ] Weston Pace commented on ARROW-17559: - [~npr] did we have any benchmarks that were r

[jira] [Resolved] (ARROW-17556) [C++] Unbound scan projection expression leads to all fields being loaded

2022-10-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17556. - Resolution: Fixed Issue resolved by pull request 14264 [https://github.com/apache/arrow/pull/142

[jira] [Commented] (ARROW-18063) [C++][Python] Custom streaming data providers in {{run_query}}

2022-10-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17617900#comment-17617900 ] Weston Pace commented on ARROW-18063: - Another alternative, which might be a more lo

[jira] [Commented] (ARROW-18063) [C++][Python] Custom streaming data providers in {{run_query}}

2022-10-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17617896#comment-17617896 ] Weston Pace commented on ARROW-18063: - {quote} Refactor NamedTableProvider from a la

[jira] [Assigned] (ARROW-18055) arrow-dataset-dataset-writer-test still times out occassionally

2022-10-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-18055: --- Assignee: Weston Pace > arrow-dataset-dataset-writer-test still times out occassionally > -

[jira] [Created] (ARROW-18055) arrow-dataset-dataset-writer-test still times out occassionally

2022-10-14 Thread Weston Pace (Jira)
Weston Pace created ARROW-18055: --- Summary: arrow-dataset-dataset-writer-test still times out occassionally Key: ARROW-18055 URL: https://issues.apache.org/jira/browse/ARROW-18055 Project: Apache Arrow

[jira] [Created] (ARROW-18051) [C++] Enable tests skipped by ARROW-16392

2022-10-14 Thread Weston Pace (Jira)
Weston Pace created ARROW-18051: --- Summary: [C++] Enable tests skipped by ARROW-16392 Key: ARROW-18051 URL: https://issues.apache.org/jira/browse/ARROW-18051 Project: Apache Arrow Issue Type: Im

[jira] [Created] (ARROW-18050) [C++] Substrait consumer should reject plans containing options that it doesn't recognize

2022-10-14 Thread Weston Pace (Jira)
Weston Pace created ARROW-18050: --- Summary: [C++] Substrait consumer should reject plans containing options that it doesn't recognize Key: ARROW-18050 URL: https://issues.apache.org/jira/browse/ARROW-18050

[jira] [Created] (ARROW-18025) [C++] SubstraitSinkConsumer should handle backpressure

2022-10-12 Thread Weston Pace (Jira)
Weston Pace created ARROW-18025: --- Summary: [C++] SubstraitSinkConsumer should handle backpressure Key: ARROW-18025 URL: https://issues.apache.org/jira/browse/ARROW-18025 Project: Apache Arrow I

[jira] [Commented] (ARROW-17292) [C++] Segmentation fault on arrow-compute-hash-join-node-test on macos nightlies

2022-10-12 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17616715#comment-17616715 ] Weston Pace commented on ARROW-17292: - The asof join test failure is very useful. I

[jira] [Created] (ARROW-18018) [C++] Potential segmentation fault in unit tests due to usage of AllComplete instead of AllFinished

2022-10-12 Thread Weston Pace (Jira)
Weston Pace created ARROW-18018: --- Summary: [C++] Potential segmentation fault in unit tests due to usage of AllComplete instead of AllFinished Key: ARROW-18018 URL: https://issues.apache.org/jira/browse/ARROW-18018

[jira] [Closed] (ARROW-17931) [C++][CI] Thread Sanitizer failure around the dataset "new scanner" on CI

2022-10-12 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-17931. --- Assignee: Weston Pace Resolution: Fixed This was resolved as part of ARROW-17687. I'm not clo

[jira] [Resolved] (ARROW-17853) [Python][CI] Timeout in test_dataset.py::test_write_dataset_s3_put_only

2022-10-12 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17853. - Resolution: Fixed Issue resolved by pull request 14257 [https://github.com/apache/arrow/pull/142

[jira] [Created] (ARROW-18009) [Python][C++] Add ability for python to specify sink node when running Substrait

2022-10-12 Thread Weston Pace (Jira)
Weston Pace created ARROW-18009: --- Summary: [Python][C++] Add ability for python to specify sink node when running Substrait Key: ARROW-18009 URL: https://issues.apache.org/jira/browse/ARROW-18009 Projec

[jira] [Created] (ARROW-18008) [Python][C++] Add use_threads to run_substrait_query

2022-10-12 Thread Weston Pace (Jira)
Weston Pace created ARROW-18008: --- Summary: [Python][C++] Add use_threads to run_substrait_query Key: ARROW-18008 URL: https://issues.apache.org/jira/browse/ARROW-18008 Project: Apache Arrow Iss

[jira] [Commented] (ARROW-17820) Implement arithmetic kernels on List(number)

2022-10-11 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17616184#comment-17616184 ] Weston Pace commented on ARROW-17820: - Could we do this by running the kernel on the

[jira] [Commented] (ARROW-17996) [C++] Potential race condition in readahead generator

2022-10-11 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17616112#comment-17616112 ] Weston Pace commented on ARROW-17996: - This commit demonstrates the problem fairly e

[jira] [Created] (ARROW-17996) [C++] Potential race condition in readahead generator

2022-10-11 Thread Weston Pace (Jira)
Weston Pace created ARROW-17996: --- Summary: [C++] Potential race condition in readahead generator Key: ARROW-17996 URL: https://issues.apache.org/jira/browse/ARROW-17996 Project: Apache Arrow Is

[jira] [Commented] (ARROW-17994) [C++] Add overflow argument is required when it shouldn't be

2022-10-11 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17616097#comment-17616097 ] Weston Pace commented on ARROW-17994: - Duplicate of ARROW-17966? > [C++] Add overfl

[jira] [Commented] (ARROW-17974) [C++] random function can't actually be used

2022-10-11 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17615975#comment-17615975 ] Weston Pace commented on ARROW-17974: - Probably related are ARROW-16286 and ARROW-16

[jira] [Created] (ARROW-17966) [C++] Adjust to new format for Substrait optional arguments

2022-10-07 Thread Weston Pace (Jira)
Weston Pace created ARROW-17966: --- Summary: [C++] Adjust to new format for Substrait optional arguments Key: ARROW-17966 URL: https://issues.apache.org/jira/browse/ARROW-17966 Project: Apache Arrow

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17614216#comment-17614216 ] Weston Pace commented on ARROW-17913: - Yes, I think caching will always need to be a

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17614191#comment-17614191 ] Weston Pace commented on ARROW-17913: - I think we could add a ReadRanges method to t

[jira] [Commented] (ARROW-17961) Add read/write optimization for pyarrow.fs.S3FileSystem

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17614189#comment-17614189 ] Weston Pace commented on ARROW-17961: - I think David's right. If you know you're go

[jira] [Commented] (ARROW-16211) [C++][Python] Unregister compute functions

2022-10-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17614178#comment-17614178 ] Weston Pace commented on ARROW-16211: - I'm +1 on Yaron's points regarding safety. U

[jira] [Commented] (ARROW-17927) [C++] Sporadic crashes in arrow-dataset-scanner-test

2022-10-06 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613635#comment-17613635 ] Weston Pace commented on ARROW-17927: - {quote} I'll note that more than 400 threads

[jira] [Commented] (ARROW-17937) [C++] Building of Arrow C++ (dataset) errors on Windows

2022-10-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613229#comment-17613229 ] Weston Pace commented on ARROW-17937: - This isn't new exactly. {{WriteNodeOptions}}

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613228#comment-17613228 ] Weston Pace commented on ARROW-17913: - As a workaround in the meantime there is alwa

[jira] [Commented] (ARROW-17913) feather.read_table 150x slower when reading columns in newer versions

2022-10-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613226#comment-17613226 ] Weston Pace commented on ARROW-17913: - I'm not sure {{ReadRangeCache}} is the answer

[jira] [Closed] (ARROW-13952) [Python] Add initial type testing for compute kernels

2022-10-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-13952. --- Resolution: Not A Problem I haven't heard much worry recently about type support for kernels and our

[jira] [Commented] (ARROW-17740) [c++][compute]Is there any other way to use Join besides Acero?

2022-10-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613157#comment-17613157 ] Weston Pace commented on ARROW-17740: - {quote} Looks like there's something even mor

[jira] [Commented] (ARROW-16211) [C++][Python] Unregister compute functions

2022-10-05 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17613130#comment-17613130 ] Weston Pace commented on ARROW-16211: - I think both use cases are probably useful an

[jira] [Resolved] (ARROW-17687) [C++] ScanningStress test is flaky in CI

2022-10-04 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17687. - Resolution: Fixed Issue resolved by pull request 14314 [https://github.com/apache/arrow/pull/143

[jira] [Closed] (ARROW-17682) [CI][C++] Nightly test-ubuntu-20.04-cpp-thread-sanitizer fails arrow-utility-test around the AsyncTaskScheduler

2022-10-04 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-17682. --- Resolution: Duplicate Technically this one came first but closing as a duplicate of ARROW-17687 as i

[jira] [Assigned] (ARROW-17292) [C++] Segmentation fault on arrow-compute-hash-join-node-test on macos nightlies

2022-10-04 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17292: --- Assignee: Vibhatha Lakmal Abeykoon > [C++] Segmentation fault on arrow-compute-hash-join-no

[jira] [Resolved] (ARROW-17287) [C++] Create scan node that doesn't rely on the merged generator

2022-10-03 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17287. - Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13782 [https://git

[jira] [Commented] (ARROW-17836) [C++] Allow specifying of alignment in MemoryPool's allocations

2022-09-27 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17610137#comment-17610137 ] Weston Pace commented on ARROW-17836: - Yes, the direct I/O PR offers a generic files

[jira] [Updated] (ARROW-17852) [python] `dtype` of `Categorical` category columns are not preserved

2022-09-27 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-17852: Summary: [python] `dtype` of `Categorical` category columns are not preserved (was: `dtype` of `C

[jira] [Commented] (ARROW-17836) [C++] Allow specifying of alignment in MemoryPool's allocations

2022-09-27 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17610098#comment-17610098 ] Weston Pace commented on ARROW-17836: - I suspect this was a typo and meant to be 512

[jira] [Resolved] (ARROW-17736) [C++] Add fallback for shorthand Substrait URIs without scheme

2022-09-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17736. - Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 14143 [https://git

[jira] [Assigned] (ARROW-17614) [CI][Python] test test_write_dataset_max_rows_per_file is producing several nightly build failures

2022-09-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17614: --- Assignee: Weston Pace > [CI][Python] test test_write_dataset_max_rows_per_file is producing

[jira] [Resolved] (ARROW-17614) [CI][Python] test test_write_dataset_max_rows_per_file is producing several nightly build failures

2022-09-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17614. - Resolution: Fixed Issue resolved by pull request 14199 [https://github.com/apache/arrow/pull/141

[jira] [Commented] (ARROW-16958) [C++][FlightRPC] Flight generates misaligned buffers

2022-09-22 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17608449#comment-17608449 ] Weston Pace commented on ARROW-16958: - I can look into {{CheckAlignment}} but I stil

[jira] [Assigned] (ARROW-17556) [C++] Unbound scan projection expression leads to all fields being loaded

2022-09-22 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17556: --- Assignee: Vibhatha Lakmal Abeykoon (was: Weston Pace) > [C++] Unbound scan projection expr

[jira] [Commented] (ARROW-17802) [R] Merging multi file datasets on particular columns that are present in all the datasets.

2022-09-22 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17608360#comment-17608360 ] Weston Pace commented on ARROW-17802: - Example of the join {noformat} library(arrow

[jira] [Commented] (ARROW-17740) [c++][compute]Is there any other way to use Join besides Acero?

2022-09-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17608025#comment-17608025 ] Weston Pace commented on ARROW-17740: - {quote} I would like to ask if we can set pro

[jira] [Updated] (ARROW-17740) [c++][compute]Is there any other way to use Join besides Acero?

2022-09-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-17740: Attachment: test_join.cpp > [c++][compute]Is there any other way to use Join besides Acero? >

[jira] [Commented] (ARROW-17740) [c++][compute]Is there any other way to use Join besides Acero?

2022-09-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17607435#comment-17607435 ] Weston Pace commented on ARROW-17740: - Thanks for uploading the new example. I was

[jira] [Commented] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-09-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17607418#comment-17607418 ] Weston Pace commented on ARROW-17783: - Hm, the [format page|https://arrow.apache.or

[jira] [Commented] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-09-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17607390#comment-17607390 ] Weston Pace commented on ARROW-17783: - Yes, I'll try and find some time to look at t

[jira] [Assigned] (ARROW-17783) [C++] Aggregate kernel should not mandate alignment

2022-09-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17783: --- Assignee: Weston Pace > [C++] Aggregate kernel should not mandate alignment > -

[jira] [Commented] (ARROW-17599) [C++] ReadRangeCache should not retain data after read

2022-09-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17607293#comment-17607293 ] Weston Pace commented on ARROW-17599: - Although the more I think about it the less I

[jira] [Commented] (ARROW-17599) [C++] ReadRangeCache should not retain data after read

2022-09-20 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17607287#comment-17607287 ] Weston Pace commented on ARROW-17599: - Ack. I was not really aware that was how the

[jira] [Resolved] (ARROW-17647) [C++] Using better namespace style when using protobuf with Substrait

2022-09-19 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17647. - Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 14121 https://gith

[jira] [Comment Edited] (ARROW-17484) [C++] Substrait to Arrow Aggregate doesn't take the provided Output Type for aggregates

2022-09-19 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17606681#comment-17606681 ] Weston Pace edited comment on ARROW-17484 at 9/19/22 5:19 PM:

[jira] [Commented] (ARROW-17484) [C++] Substrait to Arrow Aggregate doesn't take the provided Output Type for aggregates

2022-09-19 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17606681#comment-17606681 ] Weston Pace commented on ARROW-17484: - Aggregate functions typically have very small

[jira] [Created] (ARROW-17762) [C++] Add ordering information to exec batches

2022-09-16 Thread Weston Pace (Jira)
Weston Pace created ARROW-17762: --- Summary: [C++] Add ordering information to exec batches Key: ARROW-17762 URL: https://issues.apache.org/jira/browse/ARROW-17762 Project: Apache Arrow Issue Typ

[jira] [Created] (ARROW-17758) [C++] Add OT spans / events to new scan node

2022-09-16 Thread Weston Pace (Jira)
Weston Pace created ARROW-17758: --- Summary: [C++] Add OT spans / events to new scan node Key: ARROW-17758 URL: https://issues.apache.org/jira/browse/ARROW-17758 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-17757) [C++] Support type promotion in basic evolution strategy

2022-09-16 Thread Weston Pace (Jira)
Weston Pace created ARROW-17757: --- Summary: [C++] Support type promotion in basic evolution strategy Key: ARROW-17757 URL: https://issues.apache.org/jira/browse/ARROW-17757 Project: Apache Arrow

[jira] [Created] (ARROW-17756) [C++] Switch bindings-scanner to the new scan node, remove the old scan node

2022-09-16 Thread Weston Pace (Jira)
Weston Pace created ARROW-17756: --- Summary: [C++] Switch bindings-scanner to the new scan node, remove the old scan node Key: ARROW-17756 URL: https://issues.apache.org/jira/browse/ARROW-17756 Project: A

[jira] [Created] (ARROW-17755) [C++] Add pause capability to async task scheduler and support pause producing in new scan node

2022-09-16 Thread Weston Pace (Jira)
Weston Pace created ARROW-17755: --- Summary: [C++] Add pause capability to async task scheduler and support pause producing in new scan node Key: ARROW-17755 URL: https://issues.apache.org/jira/browse/ARROW-17755

[jira] [Commented] (ARROW-17754) [C++] Add Substrait Function Tests for Filter and Join

2022-09-16 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605855#comment-17605855 ] Weston Pace commented on ARROW-17754: - I'm not sure I understand what this means. T

[jira] [Commented] (ARROW-17719) [Python] Improve error message when all values in a column are null in a parquet partition

2022-09-16 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605793#comment-17605793 ] Weston Pace commented on ARROW-17719: - Although, I suppose I am thinking of the alte

[jira] [Commented] (ARROW-17719) [Python] Improve error message when all values in a column are null in a parquet partition

2022-09-16 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605791#comment-17605791 ] Weston Pace commented on ARROW-17719: - The performance impact is pretty small when y

[jira] [Created] (ARROW-17736) [C++] Add fallback for shorthand Substrait URIs without scheme

2022-09-14 Thread Weston Pace (Jira)
Weston Pace created ARROW-17736: --- Summary: [C++] Add fallback for shorthand Substrait URIs without scheme Key: ARROW-17736 URL: https://issues.apache.org/jira/browse/ARROW-17736 Project: Apache Arrow

[jira] [Updated] (ARROW-17686) [C++] AsofJoinBasicParams has no gtest printer defined, leading to valgrind errors

2022-09-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-17686: Attachment: valgrind.txt > [C++] AsofJoinBasicParams has no gtest printer defined, leading to valg

[jira] [Commented] (ARROW-17686) [C++] AsofJoinBasicParams has no gtest printer defined, leading to valgrind errors

2022-09-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605107#comment-17605107 ] Weston Pace commented on ARROW-17686: - Can you try running the valgrind nightly test

[jira] [Assigned] (ARROW-17517) [C++] "arrow/engine/api.h" exposes internal headers

2022-09-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17517?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17517: --- Assignee: Weston Pace (was: Antoine Pitrou) > [C++] "arrow/engine/api.h" exposes internal

[jira] [Commented] (ARROW-17686) [C++] AsofJoinBasicParams has no gtest printer defined, leading to valgrind errors

2022-09-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17604968#comment-17604968 ] Weston Pace commented on ARROW-17686: - Also, my machine is Ubuntu, x86_64 and I get

[jira] [Commented] (ARROW-17686) [C++] AsofJoinBasicParams has no gtest printer defined, leading to valgrind errors

2022-09-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17604967#comment-17604967 ] Weston Pace commented on ARROW-17686: - On Linux I make sure to build (with tests on)

[jira] [Resolved] (ARROW-17521) [Python] Add python bindings for NamedTableProvider for Substrait consumer

2022-09-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17521. - Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 14024 https://gith

[jira] [Resolved] (ARROW-15584) [C++] Add support for Substrait's RelCommon::Emit

2022-09-13 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-15584. - Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13914 https://gith

[jira] [Commented] (ARROW-17599) [C++] ReadRangeCache should not retain data after read

2022-09-12 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17603200#comment-17603200 ] Weston Pace commented on ARROW-17599: - > Should ReadRangeCache::read remove the cach

[jira] [Created] (ARROW-17687) [C++] ScanningStress test is flaky in CI

2022-09-12 Thread Weston Pace (Jira)
Weston Pace created ARROW-17687: --- Summary: [C++] ScanningStress test is flaky in CI Key: ARROW-17687 URL: https://issues.apache.org/jira/browse/ARROW-17687 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-17686) [C++] AsofJoinBasicParams has no gtest printer defined, leading to valgrind errors

2022-09-12 Thread Weston Pace (Jira)
Weston Pace created ARROW-17686: --- Summary: [C++] AsofJoinBasicParams has no gtest printer defined, leading to valgrind errors Key: ARROW-17686 URL: https://issues.apache.org/jira/browse/ARROW-17686 Proj

[jira] [Resolved] (ARROW-17412) [C++] AsofJoin multiple keys and types

2022-09-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17412. - Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13880 https://gith

[jira] [Updated] (ARROW-16855) [C++] Adding Read Relation ToProto

2022-09-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-16855: Component/s: C++ > [C++] Adding Read Relation ToProto > -- > >

[jira] [Resolved] (ARROW-16855) [C++] Adding Read Relation ToProto

2022-09-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-16855. - Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13401 https://gith

[jira] [Updated] (ARROW-17648) [C++] Add ScanOptions to support projection and filter in ToProto Read

2022-09-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-17648: Description: When converting from an Acero plan to a Substrait plan not all scan options are full

<    1   2   3   4   5   6   7   8   9   10   >