[jira] [Resolved] (ARROW-18427) [C++] Support negative tolerance in `AsofJoinNode`

2023-01-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18427. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14934

[jira] [Assigned] (ARROW-16795) [C#][Flight] Nightly verify-rc-source-csharp-macos-arm64 fails

2023-01-06 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-16795: --- Assignee: Weston Pace > [C#][Flight] Nightly verify-rc-source-csharp-macos-arm64 fails >

[jira] [Commented] (ARROW-16212) [C++][Python] Register Multiple Kernels for a UDF

2023-01-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17653650#comment-17653650 ] Weston Pace commented on ARROW-16212: - Sorry for the delay in reviewing. I put in a review now.

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2023-01-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17653642#comment-17653642 ] Weston Pace commented on ARROW-18400: - {quote} The reason this happens for parquet and not for

[jira] [Resolved] (ARROW-17980) [C++] As-of-Join Substrait extension

2022-12-31 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17980. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14485

[jira] [Resolved] (ARROW-15732) [C++] Do not use any CPU threads in execution plan when use_threads is false

2022-12-30 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-15732. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 15104

[jira] [Resolved] (ARROW-17837) [C++] Create ExecPlan-owned QueryContext that will store a plan's shared data structures

2022-12-22 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17837. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14227

[jira] [Resolved] (ARROW-15592) [C++] Add support for custom output field names in a substrait::PlanRel

2022-12-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-15592. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14292

[jira] [Resolved] (ARROW-17520) [C++] Implement SubStrait SetRel (UnionAll)

2022-12-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17520. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14186

[jira] [Assigned] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2022-12-13 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-12264: --- Assignee: Sanjiban Sengupta (was: Weston Pace) > [C++][Dataset] Handle NaNs correctly in

[jira] [Assigned] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2022-12-12 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-12264: --- Assignee: Weston Pace > [C++][Dataset] Handle NaNs correctly in Parquet predicate

[jira] [Commented] (ARROW-18431) Acero's Execution Plan never finishes.

2022-12-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17645419#comment-17645419 ] Weston Pace commented on ARROW-18431: - Are you able to provide some more information on the

[jira] [Assigned] (ARROW-18431) Acero's Execution Plan never finishes.

2022-12-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-18431: --- Assignee: Weston Pace > Acero's Execution Plan never finishes. >

[jira] [Commented] (ARROW-12264) [C++][Dataset] Handle NaNs correctly in Parquet predicate push-down

2022-12-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17645416#comment-17645416 ] Weston Pace commented on ARROW-12264: - Ok, so if I understand correctly, {{min}} may improperly be

[jira] [Commented] (ARROW-4283) [Python] Should RecordBatchStreamReader/Writer be AsyncIterable?

2022-12-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17645401#comment-17645401 ] Weston Pace commented on ARROW-4283: Also, a note on {{RecordBatchStreamWriter}}. In most cases you

[jira] [Comment Edited] (ARROW-4283) [Python] Should RecordBatchStreamReader/Writer be AsyncIterable?

2022-12-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17645386#comment-17645386 ] Weston Pace edited comment on ARROW-4283 at 12/9/22 4:44 PM: - Things have

[jira] [Commented] (ARROW-4283) [Python] Should RecordBatchStreamReader/Writer be AsyncIterable?

2022-12-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17645386#comment-17645386 ] Weston Pace commented on ARROW-4283: Things have changed a bit since 2019. The

[jira] [Closed] (ARROW-18367) [C++] Enable the creation of named table relations

2022-12-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-18367. --- Resolution: Fixed Fixed by PR https://github.com/apache/arrow/pull/14681 > [C++] Enable the

[jira] [Closed] (ARROW-18402) [C++] Expose `DeclarationInfo`

2022-12-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-18402. --- Resolution: Fixed Fixed by PR https://github.com/apache/arrow/pull/14765 > [C++] Expose

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-29 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17641015#comment-17641015 ] Weston Pace commented on ARROW-18265: - Ok. I think I understand now. The problem isn't how the

[jira] [Resolved] (ARROW-18406) [C++] Can't build Arrow with Substrait on Ubuntu 20.04

2022-11-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18406. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14735

[jira] [Assigned] (ARROW-18406) [C++] Can't build Arrow with Substrait on Ubuntu 20.04

2022-11-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-18406: --- Assignee: Weston Pace > [C++] Can't build Arrow with Substrait on Ubuntu 20.04 >

[jira] [Commented] (ARROW-18402) [C++] Expose `DeclarationInfo`

2022-11-25 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17638787#comment-17638787 ] Weston Pace commented on ARROW-18402: - This is a good idea. I'd like to eventually support a

[jira] [Commented] (ARROW-18240) [R] head() is crashing on some nightly builds

2022-11-25 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17638785#comment-17638785 ] Weston Pace commented on ARROW-18240: - Threads #3 and #6 are suspicious. Generally a CPU thread

[jira] [Commented] (ARROW-18408) [C++] Add nightly test that uses an older version of protoc

2022-11-25 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18408?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17638754#comment-17638754 ] Weston Pace commented on ARROW-18408: - [~vibhatha] I think you've been working on adding new nightly

[jira] [Created] (ARROW-18408) [C++] Add nightly test that uses an older version of protoc

2022-11-25 Thread Weston Pace (Jira)
Weston Pace created ARROW-18408: --- Summary: [C++] Add nightly test that uses an older version of protoc Key: ARROW-18408 URL: https://issues.apache.org/jira/browse/ARROW-18408 Project: Apache Arrow

[jira] [Commented] (ARROW-18400) [Python] Quadratic memory usage of Table.to_pandas with nested data

2022-11-25 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17638750#comment-17638750 ] Weston Pace commented on ARROW-18400: - It might be a good idea to test if the memory usage still

[jira] [Resolved] (ARROW-17966) [C++] Adjust to new format for Substrait optional arguments

2022-11-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17966. - Resolution: Fixed Issue resolved by pull request 14415

[jira] [Created] (ARROW-18388) [C++] Decide on duplicate column handling in scanner, add more tests

2022-11-22 Thread Weston Pace (Jira)
Weston Pace created ARROW-18388: --- Summary: [C++] Decide on duplicate column handling in scanner, add more tests Key: ARROW-18388 URL: https://issues.apache.org/jira/browse/ARROW-18388 Project: Apache

[jira] [Created] (ARROW-18387) [C++] Create many-column scanner microbenchmarks

2022-11-22 Thread Weston Pace (Jira)
Weston Pace created ARROW-18387: --- Summary: [C++] Create many-column scanner microbenchmarks Key: ARROW-18387 URL: https://issues.apache.org/jira/browse/ARROW-18387 Project: Apache Arrow Issue

[jira] [Created] (ARROW-18386) [C++] Add support for filename, file index, and batch index columns to exec plan based scanner

2022-11-22 Thread Weston Pace (Jira)
Weston Pace created ARROW-18386: --- Summary: [C++] Add support for filename, file index, and batch index columns to exec plan based scanner Key: ARROW-18386 URL: https://issues.apache.org/jira/browse/ARROW-18386

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-22 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637392#comment-17637392 ] Weston Pace commented on ARROW-18265: - I agree {{pc.field(0)}} should not mean to select the first

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-22 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17637308#comment-17637308 ] Weston Pace commented on ARROW-18265: - Hmm, I am not sure I understand. I think

[jira] [Resolved] (ARROW-17610) [C++] Support additional source types in SourceNode

2022-11-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17610. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14207

[jira] [Commented] (ARROW-18371) [C++] Expose *FromJSON helpers

2022-11-21 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17636820#comment-17636820 ] Weston Pace commented on ARROW-18371: - {{MakeBasicBatches}} I agree is a definite no. The new

[jira] [Resolved] (ARROW-18342) [C++] AsofJoinNode support for Boolean data field

2022-11-17 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18342. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14658

[jira] [Commented] (ARROW-18347) [C++] Hook up cancellation to exec plan

2022-11-17 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635351#comment-17635351 ] Weston Pace commented on ARROW-18347: - No. The main tasks still blocking ARROW-15732 are

[jira] [Created] (ARROW-18347) [C++] Hook up cancellation to exec plan

2022-11-16 Thread Weston Pace (Jira)
Weston Pace created ARROW-18347: --- Summary: [C++] Hook up cancellation to exec plan Key: ARROW-18347 URL: https://issues.apache.org/jira/browse/ARROW-18347 Project: Apache Arrow Issue Type:

[jira] [Closed] (ARROW-15139) [Python] write_dataset's file_write_options are too confusing and/or undocumented

2022-11-16 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace closed ARROW-15139. --- Resolution: Duplicate Closing this in favor of ARROW-18346 as David is more eloquent there :) >

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-11-16 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634916#comment-17634916 ] Weston Pace commented on ARROW-18113: - {quote} or we just can include caching.h in every filesystem

[jira] [Commented] (ARROW-18334) add function for timestamp/duration is not commutative

2022-11-15 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634553#comment-17634553 ] Weston Pace commented on ARROW-18334: - Example output type resolver (from [~bkietz]): {noformat}

[jira] [Updated] (ARROW-18334) add function for timestamp/duration is not commutative

2022-11-15 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18334: Description: The expression simplification currently has a small set of functions which it knows

[jira] [Updated] (ARROW-18334) add function for timestamp/duration is not commutative

2022-11-15 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18334: Summary: add function for timestamp/duration is not commutative (was: Expression::Canonicalize

[jira] [Commented] (ARROW-18334) Expression::Canonicalize does not unbind the expression from a kernel

2022-11-15 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634552#comment-17634552 ] Weston Pace commented on ARROW-18334: - I had an offline discussion with [~bkietz] on this issue: *

[jira] [Created] (ARROW-18334) Expression::Canonicalize does not unbind the expression from a kernel

2022-11-15 Thread Weston Pace (Jira)
Weston Pace created ARROW-18334: --- Summary: Expression::Canonicalize does not unbind the expression from a kernel Key: ARROW-18334 URL: https://issues.apache.org/jira/browse/ARROW-18334 Project: Apache

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17634111#comment-17634111 ] Weston Pace commented on ARROW-18113: - I would think that coalescing would be an implementation

[jira] [Created] (ARROW-18328) [C++] Remove legacy scanner code where possible

2022-11-14 Thread Weston Pace (Jira)
Weston Pace created ARROW-18328: --- Summary: [C++] Remove legacy scanner code where possible Key: ARROW-18328 URL: https://issues.apache.org/jira/browse/ARROW-18328 Project: Apache Arrow Issue

[jira] [Assigned] (ARROW-17288) [C++] Create fragment scanners for csv/parquet/orc/ipc

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17288: --- Assignee: Weston Pace > [C++] Create fragment scanners for csv/parquet/orc/ipc >

[jira] [Commented] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17633988#comment-17633988 ] Weston Pace commented on ARROW-15474: - I believe https://issues.apache.org/jira/browse/ARROW-15735

[jira] [Commented] (ARROW-18275) [Python] Allow custom reader/writer implementation for arrow dataset read/write path

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17633977#comment-17633977 ] Weston Pace commented on ARROW-18275: - {quote} 1. Custom formats can tell pa.dataset.dataset to use

[jira] [Commented] (ARROW-15716) [Dataset][Python] Parse a list of fragment paths to gather filters

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17633961#comment-17633961 ] Weston Pace commented on ARROW-15716: - Ah, yes. So if you wanted to simplify I agree it would be

[jira] [Comment Edited] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17633943#comment-17633943 ] Weston Pace edited comment on ARROW-15474 at 11/14/22 5:31 PM: --- {quote}

[jira] [Commented] (ARROW-15474) [Python] Possibility of a table.drop_duplicates() function?

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15474?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17633943#comment-17633943 ] Weston Pace commented on ARROW-15474: - {quote} Maybe even ordering function can be specified so

[jira] [Resolved] (ARROW-18310) [C++] Use atomic backpressure counter

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18310. - Resolution: Fixed Issue resolved by pull request 14622

[jira] [Commented] (ARROW-15716) [Dataset][Python] Parse a list of fragment paths to gather filters

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17633927#comment-17633927 ] Weston Pace commented on ARROW-15716: - I'm not certain I understand. A partition key will always be

[jira] [Commented] (ARROW-18269) [C++] Slash character in partition value handling

2022-11-14 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17633925#comment-17633925 ] Weston Pace commented on ARROW-18269: - On writing, we encode each component. If there is a `/` in

[jira] [Commented] (ARROW-18269) [C++] Slash character in partition value handling

2022-11-10 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17632074#comment-17632074 ] Weston Pace commented on ARROW-18269: - I think we should encode and decode the URIs for the user.

[jira] [Commented] (ARROW-15716) [Dataset][Python] Parse a list of fragment paths to gather filters

2022-11-10 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17631792#comment-17631792 ] Weston Pace commented on ARROW-15716: - I am pretty sure the operator is always OR based on: {quote}

[jira] [Resolved] (ARROW-17509) [C++] Simplify async scheduler by removing the need to call End

2022-11-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17509. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14524

[jira] [Assigned] (ARROW-17509) [C++] Simplify async scheduler by removing the need to call End

2022-11-09 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-17509: --- Assignee: Weston Pace > [C++] Simplify async scheduler by removing the need to call End >

[jira] [Commented] (ARROW-15716) [Dataset][Python] Parse a list of fragment paths to gather filters

2022-11-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630776#comment-17630776 ] Weston Pace commented on ARROW-15716: - [~vibhatha] I'm not sure {{new_table =

[jira] [Commented] (ARROW-18269) [C++] Slash character in partition value handling

2022-11-08 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630536#comment-17630536 ] Weston Pace commented on ARROW-18269: - I'm not personally working on this at the moment but it seems

[jira] [Commented] (ARROW-15716) [Dataset][Python] Parse a list of fragment paths to gather filters

2022-11-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17630089#comment-17630089 ] Weston Pace commented on ARROW-15716: - If I understand correctly, your goal is to get a list of

[jira] [Updated] (ARROW-18269) [C++] Slash character in partition value handling

2022-11-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18269: Summary: [C++] Slash character in partition value handling (was: Slash character in partition

[jira] [Updated] (ARROW-18269) Slash character in partition value handling

2022-11-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18269: Labels: good-first-issue (was: ) > Slash character in partition value handling >

[jira] [Commented] (ARROW-18269) Slash character in partition value handling

2022-11-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1763#comment-1763 ] Weston Pace commented on ARROW-18269: - Today we URI unescape fields as we read them. For this to

[jira] [Updated] (ARROW-18269) [C++] Slash character in partition value handling

2022-11-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18269: Component/s: C++ > [C++] Slash character in partition value handling >

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629969#comment-17629969 ] Weston Pace commented on ARROW-18265: - Is this just for fixed size lists? Or also for variable

[jira] [Commented] (ARROW-18265) [C++] Allow FieldPath to work with ListElement

2022-11-07 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629966#comment-17629966 ] Weston Pace commented on ARROW-18265: - Note that the {{list_element}} compute function exists which

[jira] [Comment Edited] (ARROW-17820) [C++] Implement arithmetic kernels on List(number)

2022-11-04 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629225#comment-17629225 ] Weston Pace edited comment on ARROW-17820 at 11/4/22 11:24 PM: --- {quote}

[jira] [Commented] (ARROW-17820) [C++] Implement arithmetic kernels on List(number)

2022-11-04 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17629225#comment-17629225 ] Weston Pace commented on ARROW-17820: - {quote} Such an approach doesn't really fit our kernels /

[jira] [Resolved] (ARROW-18183) [C++] cpp-micro benchmarks are failing on mac arm machine

2022-11-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18183. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14562

[jira] [Resolved] (ARROW-18051) [C++] Enable tests skipped by ARROW-16392

2022-11-02 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18051. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14425

[jira] [Assigned] (ARROW-18183) [C++] cpp-micro benchmarks are failing on mac arm machine

2022-11-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace reassigned ARROW-18183: --- Assignee: Weston Pace > [C++] cpp-micro benchmarks are failing on mac arm machine >

[jira] [Updated] (ARROW-18183) [C++] cpp-micro benchmarks are failing on mac arm machine

2022-11-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18183: Summary: [C++] cpp-micro benchmarks are failing on mac arm machine (was: [C++]cpp-micro

[jira] [Updated] (ARROW-18183) [C++]cpp-micro benchmarks are failing on mac arm machine

2022-11-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace updated ARROW-18183: Summary: [C++]cpp-micro benchmarks are failing on mac arm machine (was: cpp-micro benchmarks are

[jira] [Resolved] (ARROW-17640) [C++] Add File Handling Test cases for GlobFile handling in Substrait Read

2022-11-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17640. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14132

[jira] [Resolved] (ARROW-18205) [C++] Substrait consumer is not converting right side references correctly on joins

2022-11-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18205. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14558

[jira] [Commented] (ARROW-18183) cpp-micro benchmarks are failing on mac arm machine

2022-11-01 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18183?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17627172#comment-17627172 ] Weston Pace commented on ARROW-18183: - Thank you. I will look at this today. > cpp-micro

[jira] [Created] (ARROW-18205) [C++] Substrait consumer is not converting right side references correctly on joins

2022-10-31 Thread Weston Pace (Jira)
Weston Pace created ARROW-18205: --- Summary: [C++] Substrait consumer is not converting right side references correctly on joins Key: ARROW-18205 URL: https://issues.apache.org/jira/browse/ARROW-18205

[jira] [Created] (ARROW-18193) [C++] Acero should reject Substrait plans that require an implicit cast from decimal to float

2022-10-28 Thread Weston Pace (Jira)
Weston Pace created ARROW-18193: --- Summary: [C++] Acero should reject Substrait plans that require an implicit cast from decimal to float Key: ARROW-18193 URL: https://issues.apache.org/jira/browse/ARROW-18193

[jira] [Commented] (ARROW-17984) pq.read_table doesn't seem to be thread safe

2022-10-28 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17625884#comment-17625884 ] Weston Pace commented on ARROW-17984: - Can you share the full output of {{thread apply all bt}}?

[jira] [Commented] (ARROW-17774) [Python] write csv decimal cast error

2022-10-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17624677#comment-17624677 ] Weston Pace commented on ARROW-17774: - The underlying issues appears to have been solved. I just

[jira] [Resolved] (ARROW-17458) [C++] CSV Writer: Unsupported cast from decimal to utf8

2022-10-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-17458. - Resolution: Fixed Issue resolved by pull request 14232

[jira] [Commented] (ARROW-18156) [Python/C++] High memory usage/potential leak when reading parquet using Dataset API

2022-10-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17624620#comment-17624620 ] Weston Pace commented on ARROW-18156: - FWIW, I get similar results with Arrow 4 from pip. >

[jira] [Commented] (ARROW-18156) [Python/C++] High memory usage/potential leak when reading parquet using Dataset API

2022-10-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17624612#comment-17624612 ] Weston Pace commented on ARROW-18156: - Another experiment might be adding a five second sleep

[jira] [Comment Edited] (ARROW-18156) [Python/C++] High memory usage/potential leak when reading parquet using Dataset API

2022-10-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17624593#comment-17624593 ] Weston Pace edited comment on ARROW-18156 at 10/26/22 4:38 PM: --- {quote}

[jira] [Commented] (ARROW-18156) [Python/C++] High memory usage/potential leak when reading parquet using Dataset API

2022-10-26 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17624593#comment-17624593 ] Weston Pace commented on ARROW-18156: - {quote} What else could affect behavior here? Python version?

[jira] [Commented] (ARROW-18156) [Python/C++] High memory usage/potential leak when reading parquet using Dataset API

2022-10-25 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17624010#comment-17624010 ] Weston Pace commented on ARROW-18156: - I tested on both pyarrow 9.0.0 and 4.0.0 with the test data

[jira] [Commented] (ARROW-18160) [C++] Scanner slicing large row groups leads to inefficient RAM usage

2022-10-25 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623958#comment-17623958 ] Weston Pace commented on ARROW-18160: - Oh. Yes, that is copying. Sorry, didn't read your message

[jira] [Commented] (ARROW-18160) [C++] Scanner slicing large row groups leads to inefficient RAM usage

2022-10-25 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623957#comment-17623957 ] Weston Pace commented on ARROW-18160: - Well, there is a somewhat drastic approach, which would be to

[jira] [Created] (ARROW-18160) [C++] Scanner slicing large row groups leads to inefficient RAM usage

2022-10-25 Thread Weston Pace (Jira)
Weston Pace created ARROW-18160: --- Summary: [C++] Scanner slicing large row groups leads to inefficient RAM usage Key: ARROW-18160 URL: https://issues.apache.org/jira/browse/ARROW-18160 Project: Apache

[jira] [Resolved] (ARROW-18137) [Python][Docs] Allow passing no aggregations to TableGroupBy.aggregate

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weston Pace resolved ARROW-18137. - Fix Version/s: 11.0.0 Resolution: Fixed Issue resolved by pull request 14482

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623417#comment-17623417 ] Weston Pace commented on ARROW-18115: - {quote} I'm not sure which PR it is about, but I think

[jira] [Commented] (ARROW-18113) Implement a read range process without caching

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623386#comment-17623386 ] Weston Pace commented on ARROW-18113: - {quote} Linking io_uring to the Future API could be a bit

[jira] [Commented] (ARROW-18115) [C++] Acero buffer alignment

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623337#comment-17623337 ] Weston Pace commented on ARROW-18115: - [~sakras] FYI, there was some further discussion on this

[jira] [Commented] (ARROW-18114) [R] unify_schemas=FALSE does not improve open_dataset() read times

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623334#comment-17623334 ] Weston Pace commented on ARROW-18114: - Yes, I would expect there to be a difference. I'll try and

[jira] [Commented] (ARROW-16029) [Python] Runaway process with generator in "write_dataset()"

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623330#comment-17623330 ] Weston Pace commented on ARROW-16029: - {quote} Weston Pace that might be related to the fact that

[jira] [Commented] (ARROW-18140) The metadata info will lost in parquet file schema after writing the parquet file by calling the FileSystemDataset::Write() method.

2022-10-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-18140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17623326#comment-17623326 ] Weston Pace commented on ARROW-18140: - This could definitely be improved. The write node, in Acero,

[jira] [Created] (ARROW-18145) [C++] Populate Substrait producer version from cmake config variables

2022-10-24 Thread Weston Pace (Jira)
Weston Pace created ARROW-18145: --- Summary: [C++] Populate Substrait producer version from cmake config variables Key: ARROW-18145 URL: https://issues.apache.org/jira/browse/ARROW-18145 Project: Apache

  1   2   3   4   5   6   7   8   9   10   >