[I] Allow schema fields to merge with `Null` datatype [arrow-rs]

2023-10-08 Thread via GitHub
kskalski opened a new issue, #4901: URL: https://github.com/apache/arrow-rs/issues/4901 **Is your feature request related to a problem or challenge? Please describe what you are trying to do.** Right now `DataType::Null` is considered incompatible with any data type other than `Null`, ho

Re: [PR] Minor: improve documentation to `stagger_batch` [arrow-datafusion]

2023-10-08 Thread via GitHub
Dandandan commented on PR #7754: URL: https://github.com/apache/arrow-datafusion/pull/7754#issuecomment-1752426208 Thanks @alamb -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific com

Re: [PR] Minor: improve documentation to `stagger_batch` [arrow-datafusion]

2023-10-08 Thread via GitHub
Dandandan merged PR #7754: URL: https://github.com/apache/arrow-datafusion/pull/7754 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

[PR] GH-38145: [Docs][Python] Add tzdata on Windows subsection in Python install docs [arrow]

2023-10-08 Thread via GitHub
AlenkaF opened a new pull request, #38146: URL: https://github.com/apache/arrow/pull/38146 ### What changes are included in this PR? The option to set custom path to timezone database through python API has been implemented in https://github.com/apache/arrow/issues/35600 and is docum

Re: [PR] GH-37002: [C++][Parquet] Add api to get RecordReader from RowGroupReader [arrow]

2023-10-08 Thread via GitHub
wgtmac commented on code in PR #37003: URL: https://github.com/apache/arrow/pull/37003#discussion_r1349858700 ## cpp/src/parquet/file_reader.h: ## @@ -58,6 +62,10 @@ class PARQUET_EXPORT RowGroupReader { // column. Ownership is shared with the RowGroupReader. std::shared_p

Re: [PR] GH-35623: [C++][Python] FixedShapeTensorType.ToString() should print the type's parameters [arrow]

2023-10-08 Thread via GitHub
AlenkaF commented on PR #36496: URL: https://github.com/apache/arrow/pull/36496#issuecomment-1752310300 Thank you all for the reviews and suggestions! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] GH-35623: [C++][Python] FixedShapeTensorType.ToString() should print the type's parameters [arrow]

2023-10-08 Thread via GitHub
AlenkaF merged PR #36496: URL: https://github.com/apache/arrow/pull/36496 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apach

Re: [PR] GH-38088: [R] Remove outdated references to brew and autobrew [arrow]

2023-10-08 Thread via GitHub
assignUser merged PR #38089: URL: https://github.com/apache/arrow/pull/38089 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.ap

Re: [PR] GH-38088: [R] Remove outdated references to brew and autobrew [arrow]

2023-10-08 Thread via GitHub
assignUser commented on PR #38089: URL: https://github.com/apache/arrow/pull/38089#issuecomment-1752271688 I'll merge this so we get rid of the failing nightlies to reduce the noise for the release prep. -- This is an automated message from the Apache Git Service. To respond to the messag

Re: [PR] GH-38079: [R] Allow override of version used to download libarrow static library [arrow]

2023-10-08 Thread via GitHub
assignUser commented on PR #38080: URL: https://github.com/apache/arrow/pull/38080#issuecomment-1752271130 I like the idea but this will cause conflicts with #38115 but as that is done (imo) I will start on the 'get latest nightly binary'-feature now were I can integrate this. For y

Re: [I] [R] Option for `$schema$code()` to explicitly use package name [arrow]

2023-10-08 Thread via GitHub
orgadish commented on issue #38033: URL: https://github.com/apache/arrow/issues/38033#issuecomment-1752264154 @thisisnic I've created a PR for this, but have not been able to load the package in my local RStudio. I get the following errors: ``` Error in dyn.load(dll_copy_file) :

Re: [PR] GH-38033: [R] Allow `code()` to return package name prefix. [arrow]

2023-10-08 Thread via GitHub
github-actions[bot] commented on PR #38144: URL: https://github.com/apache/arrow/pull/38144#issuecomment-1752263951 :warning: GitHub issue #38033 **has been automatically assigned in GitHub** to PR creator. -- This is an automated message from the Apache Git Service. To respond to the mes

[PR] GH-38033: [R] Allow `code()` to return package name prefix. [arrow]

2023-10-08 Thread via GitHub
orgadish opened a new pull request, #38144: URL: https://github.com/apache/arrow/pull/38144 ### Rationale for this change #38033 ### What changes are included in this PR? - Added `add_pkg_name` helper. - Added `code_name` private method to `DataType` class to store the string

Re: [PR] Make parquet an option [arrow-datafusion]

2023-10-08 Thread via GitHub
ongchi commented on PR #7745: URL: https://github.com/apache/arrow-datafusion/pull/7745#issuecomment-1752261205 All checks in CI have passed with the current modification. I am going to spend some time to see if it's possible to extract some common logic on the file format. -- This is a

Re: [PR] GH-38142: [R] Add NEWS for 14.0.0 [arrow]

2023-10-08 Thread via GitHub
github-actions[bot] commented on PR #38143: URL: https://github.com/apache/arrow/pull/38143#issuecomment-1752257957 :warning: GitHub issue #38142 **has been automatically assigned in GitHub** to PR creator. -- This is an automated message from the Apache Git Service. To respond to the mes

[PR] GH-38142: [R] Add NEWS for 14.0.0 [arrow]

2023-10-08 Thread via GitHub
paleolimbot opened a new pull request, #38143: URL: https://github.com/apache/arrow/pull/38143 ### Rationale for this change The NEWS file needs updating for 14.0.0. ### What changes are included in this PR? The NEWS file is updated with commits since 13.0.0. #

Re: [PR] GH-38079: [R] Allow override of version used to download libarrow static library [arrow]

2023-10-08 Thread via GitHub
paleolimbot commented on PR #38080: URL: https://github.com/apache/arrow/pull/38080#issuecomment-1752243689 @assignUser is this fix worth merging? I do need this or something like it to debug the Windows issues we've been seeing. An alternative would be to go straight for `LIBARROW_BINARY_U

Re: [I] nulls in input to pc.case_when cause incorrect results when `len(cond) >~ 70` [arrow]

2023-10-08 Thread via GitHub
dannygoldstein commented on issue #38139: URL: https://github.com/apache/arrow/issues/38139#issuecomment-1752243332 This appears to be fixed in pyarrow 13 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above t

Re: [PR] GH-29184: [R] Read CSV with comma as decimal mark [arrow]

2023-10-08 Thread via GitHub
paleolimbot commented on code in PR #38002: URL: https://github.com/apache/arrow/pull/38002#discussion_r1349814104 ## r/R/csv.R: ## @@ -279,6 +280,31 @@ read_csv_arrow <- function(file, eval.parent(mc) } +#' @rdname read_delim_arrow +#' @export +read_csv2_arrow <- function

Re: [PR] GH-38138: [R] Add curl to suggests for use of `skip_if_offline()` [arrow]

2023-10-08 Thread via GitHub
github-actions[bot] commented on PR #38140: URL: https://github.com/apache/arrow/pull/38140#issuecomment-1752233825 :warning: GitHub issue #38138 **has been automatically assigned in GitHub** to PR creator. -- This is an automated message from the Apache Git Service. To respond to the mes

[PR] GH-38138: [R] Add curl to suggests for use of `skip_if_offline()` [arrow]

2023-10-08 Thread via GitHub
paleolimbot opened a new pull request, #38140: URL: https://github.com/apache/arrow/pull/38140 ### Rationale for this change testthat now requires the curl package for `skip_if_offline()`. Previously tests were skipped; how the tests fail (although the message is somewhat bizarre).

Re: [PR] GH-38088: [R] Remove outdated references to brew and autobrew [arrow]

2023-10-08 Thread via GitHub
paleolimbot commented on PR #38089: URL: https://github.com/apache/arrow/pull/38089#issuecomment-1752225986 @assignUser could you give this a review? I don't believe any of these failures are new as a result of these changes. -- This is an automated message from the Apache Git Service. To

Re: [PR] GH-37952: [C++] Make unique->shared explicit to fix build failure on at least one compiler [arrow]

2023-10-08 Thread via GitHub
github-actions[bot] commented on PR #38136: URL: https://github.com/apache/arrow/pull/38136#issuecomment-1752225504 Revision: 465b5af4fbc51c24e5d70334fafc41debbed8b6d Submitted crossbow builds: [ursacomputing/crossbow @ actions-ecf7a4c44e](https://github.com/ursacomputing/crossbow/bra

Re: [PR] GH-38071: [C++][CI] Fix Overlap column chunk ranges for pre-buffer [arrow]

2023-10-08 Thread via GitHub
conbench-apache-arrow[bot] commented on PR #38073: URL: https://github.com/apache/arrow/pull/38073#issuecomment-1752224485 After merging your PR, Conbench analyzed the 4 benchmarking runs that have been run so far on merge-commit 0b9f81771b30c5ebdd215b28eb41889efa2d76ed. There were no

Re: [PR] GH-37952: [C++] Make unique->shared explicit to fix build failure on at least one compiler [arrow]

2023-10-08 Thread via GitHub
paleolimbot commented on PR #38136: URL: https://github.com/apache/arrow/pull/38136#issuecomment-1752224422 @github-actions crossbow submit test-r-rstudio-r-base-4.1-opensuse153 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

Re: [PR] GH-38090: [C++][Emscripten][Parquet] Suppress shorten-64-to-32 warnings [arrow]

2023-10-08 Thread via GitHub
conbench-apache-arrow[bot] commented on PR #38129: URL: https://github.com/apache/arrow/pull/38129#issuecomment-1752200979 After merging your PR, Conbench analyzed the 6 benchmarking runs that have been run so far on merge-commit 60916fe658672bfc8330eaa488d960ae4bca1451. There were no

Re: [I] [R] Enable `col_select` or similar in `open_csv_dataset` to read files with a shared subset of columns [arrow]

2023-10-08 Thread via GitHub
orgadish commented on issue #38031: URL: https://github.com/apache/arrow/issues/38031#issuecomment-1752197296 @thisisnic I don't know if this was updated in a recent Arrow version, but it looks like what I want works now! Below is a reprex for it. `read_csv(col_select = ...)` actually

[PR] Minor: Remove unnecessary `#[cfg(feature = "avro")]` [arrow-datafusion]

2023-10-08 Thread via GitHub
sarutak opened a new pull request, #7773: URL: https://github.com/apache/arrow-datafusion/pull/7773 ## Rationale for this change In `datafusion/core/tests/sql/mod.rs`, `#[cfg(feature = "avro")]` is applied to `pub mod create_drop`, but it is not related to Avro. I guess this is a lefto

Re: [PR] GH-38071: [C++][CI] Fix Overlap column chunk ranges for pre-buffer [arrow]

2023-10-08 Thread via GitHub
kou merged PR #38073: URL: https://github.com/apache/arrow/pull/38073 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.or

Re: [PR] GH-38071: [C++][CI] Fix Overlap column chunk ranges for pre-buffer [arrow]

2023-10-08 Thread via GitHub
kou commented on PR #38073: URL: https://github.com/apache/arrow/pull/38073#issuecomment-1752168206 Passed. :-) I'll merge this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

Re: [I] [Go] Apache Arrow 12.0.1: use of deleted function 'std::atomic::atomic(const std::atomic&)' [arrow]

2023-10-08 Thread via GitHub
EXPEbdodla commented on issue #37442: URL: https://github.com/apache/arrow/issues/37442#issuecomment-1752162949 Thanks @zeroshade . I'm able to run locally on Mac with mallocator which was failing to build with CGO Allocator. Are there any performance benchmarks between CGOAllocator and Mal

Re: [PR] GH-38090: [C++][Emscripten][Parquet] Suppress shorten-64-to-32 warnings [arrow]

2023-10-08 Thread via GitHub
kou merged PR #38129: URL: https://github.com/apache/arrow/pull/38129 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.or

Re: [PR] GH-38077: [C++] Output bundled GoogleTest to ${BUILD_DIR}/${CONFIG} [arrow]

2023-10-08 Thread via GitHub
github-actions[bot] commented on PR #38132: URL: https://github.com/apache/arrow/pull/38132#issuecomment-1752160303 Revision: 859ecc9639336ce4a62797258e02ec81c668067a Submitted crossbow builds: [ursacomputing/crossbow @ actions-97d56254dd](https://github.com/ursacomputing/crossbow/bra

Re: [PR] GH-38077: [Java][CI][Windows] Set PATH to find GoogleTest [arrow]

2023-10-08 Thread via GitHub
kou commented on PR #38132: URL: https://github.com/apache/arrow/pull/38132#issuecomment-1752159855 @github-actions crossbow submit java-jars -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the s

Re: [PR] GH-37952: [C++] Make unique->shared explicit to fix build failure on at least one compiler [arrow]

2023-10-08 Thread via GitHub
kou commented on code in PR #38136: URL: https://github.com/apache/arrow/pull/38136#discussion_r1349773787 ## cpp/src/arrow/dataset/file_parquet.cc: ## @@ -462,7 +462,9 @@ Result> ParquetFileFormat::GetReader std::unique_ptr arrow_reader; RETURN_NOT_OK(parquet::arrow::Fil

Re: [PR] GH-37429: [C++] Add arrow::ipc::StreamDecoder::Reset() [arrow]

2023-10-08 Thread via GitHub
MrLolthe1st commented on PR #37970: URL: https://github.com/apache/arrow/pull/37970#issuecomment-1752149766 > Why do we need to care about `MessageReader`? This implementation re-creates `StreamDecoderImpl` instead of reusing existing `StreamDecoderImpl` (including `MessageReader`).

Re: [PR] feat(glib): add Vala VAPI for GADBC [arrow-adbc]

2023-10-08 Thread via GitHub
esodan commented on PR #1152: URL: https://github.com/apache/arrow-adbc/pull/1152#issuecomment-1752144017 I would like to add an example on how to access a data from a SELECT statement, could you guide me on how can I do that? I have a problem: GArrow.Statement.execute() returns an ou

Re: [PR] GH-38090: [C++][Emscripten] orc: Suppress shorten-64-to-32 warnings [arrow]

2023-10-08 Thread via GitHub
conbench-apache-arrow[bot] commented on PR #38097: URL: https://github.com/apache/arrow/pull/38097#issuecomment-1752122117 After merging your PR, Conbench analyzed the 6 benchmarking runs that have been run so far on merge-commit 839137e53078953d4dc8d0d71d9c9a04996e24df. There was 1 b

Re: [PR] GH-38090: [C++][Emscripten] orc: Suppress shorten-64-to-32 warnings [arrow]

2023-10-08 Thread via GitHub
conbench-apache-arrow[bot] commented on PR #38097: URL: https://github.com/apache/arrow/pull/38097#issuecomment-1752120955 After merging your PR, Conbench analyzed the 6 benchmarking runs that have been run so far on merge-commit 839137e53078953d4dc8d0d71d9c9a04996e24df. There was 1 b

Re: [I] Dynamically Determine the Number of Output Files based on Configs [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb commented on issue #7767: URL: https://github.com/apache/arrow-datafusion/issues/7767#issuecomment-1752117633 > I would like to provide users with options such as the following which will determine the number of output files: > > Maximum rows per file > Maximum I agree

Re: [I] [C++] [Acero] Incorrect results in inner join [arrow]

2023-10-08 Thread via GitHub
llama90 commented on issue #38074: URL: https://github.com/apache/arrow/issues/38074#issuecomment-1752111924 It seems that the issue has been fixed. I will clean up the code, write unit tests, and aim to submit a PR as soon as possible. Thanks to your review, I was able to reproduce t

Re: [PR] feat: expose PyWindowFrame [arrow-datafusion-python]

2023-10-08 Thread via GitHub
andygrove commented on PR #509: URL: https://github.com/apache/arrow-datafusion-python/pull/509#issuecomment-1752110818 @dlovell Could you upmerge to pick up the clippy fix? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

Re: [PR] add regr_* functions [arrow-datafusion-python]

2023-10-08 Thread via GitHub
andygrove commented on PR #499: URL: https://github.com/apache/arrow-datafusion-python/pull/499#issuecomment-1752110564 @jiangzhx Clippy tests are failing on this and your other PRs due to a new clippy rule in the latest Rust version. A fix has already been merged to main, so upmerging one

Re: [PR] Support InsertInto Sorted ListingTable [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb commented on code in PR #7743: URL: https://github.com/apache/arrow-datafusion/pull/7743#discussion_r1349747256 ## datafusion/core/src/datasource/listing/table.rs: ## @@ -907,17 +907,19 @@ impl TableProvider for ListingTable { "Cannot insert into a sor

Re: [PR] Support InsertInto Sorted ListingTable [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb merged PR #7743: URL: https://github.com/apache/arrow-datafusion/pull/7743 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arro

Re: [I] Allow inserts to a sorted ListingTable [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb closed issue #7354: Allow inserts to a sorted ListingTable URL: https://github.com/apache/arrow-datafusion/issues/7354 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

Re: [PR] Support InsertInto Sorted ListingTable [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb commented on PR #7743: URL: https://github.com/apache/arrow-datafusion/pull/7743#issuecomment-1752106345 > > But now that I look at that plan, perhaps the issue is that there is more than one file in each group, so the sort order can't be maintained 🤔 > > Perhaps if each output

Re: [PR] Updated sort.rs to show `TopK` [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb commented on code in PR #7751: URL: https://github.com/apache/arrow-datafusion/pull/7751#discussion_r1349746527 ## datafusion/physical-plan/src/sorts/sort.rs: ## @@ -769,7 +769,7 @@ impl DisplayAs for SortExec { write!(

Re: [PR] Minor: Improve `PruningPredicate` documentation [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb commented on PR #7738: URL: https://github.com/apache/arrow-datafusion/pull/7738#issuecomment-1752104901 Thank you @jackwener -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

Re: [I] Add `LogicalPlanBuilder::join_on` [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb commented on issue #7766: URL: https://github.com/apache/arrow-datafusion/issues/7766#issuecomment-1752104812 Thank you @haohuaijin -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the spe

Re: [I] TopK Fuzz Tests 🐝 [arrow-datafusion]

2023-10-08 Thread via GitHub
Tangruilin commented on issue #7749: URL: https://github.com/apache/arrow-datafusion/issues/7749#issuecomment-1752095464 > > > when i get the result with (string), (float).etc cols. I found that [batches_to_vec](https://github.com/apache/arrow-datafusion/blob/main/test-utils/src/lib.rs#L31)

Re: [PR] GH-38060: [Python][CI] Upgrade Spark versions [arrow]

2023-10-08 Thread via GitHub
conbench-apache-arrow[bot] commented on PR #38082: URL: https://github.com/apache/arrow/pull/38082#issuecomment-1752052141 After merging your PR, Conbench analyzed the 5 benchmarking runs that have been run so far on merge-commit 3697bcdd514fc5a0472dabd3928f6b4731554cda. There were no

Re: [I] TopK Fuzz Tests 🐝 [arrow-datafusion]

2023-10-08 Thread via GitHub
Tangruilin commented on issue #7749: URL: https://github.com/apache/arrow-datafusion/issues/7749#issuecomment-1752052104 > > when i get the result with (string), (float).etc cols. I found that [batches_to_vec](https://github.com/apache/arrow-datafusion/blob/main/test-utils/src/lib.rs#L31)

Re: [I] Add `LogicalPlanBuilder::join_on` [arrow-datafusion]

2023-10-08 Thread via GitHub
haohuaijin commented on issue #7766: URL: https://github.com/apache/arrow-datafusion/issues/7766#issuecomment-1752034970 I want to do it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

Re: [I] TopK Fuzz Tests 🐝 [arrow-datafusion]

2023-10-08 Thread via GitHub
alamb commented on issue #7749: URL: https://github.com/apache/arrow-datafusion/issues/7749#issuecomment-1752020153 > when i get the result with (string), (float).etc cols. I found that [batches_to_vec](https://github.com/apache/arrow-datafusion/blob/main/test-utils/src/lib.rs#L31) can onl

Re: [PR] GH-38084: [R] Do not memory map when explicitly checking for file removal [arrow]

2023-10-08 Thread via GitHub
conbench-apache-arrow[bot] commented on PR #38085: URL: https://github.com/apache/arrow/pull/38085#issuecomment-1752016117 After merging your PR, Conbench analyzed the 6 benchmarking runs that have been run so far on merge-commit f525b99b0fa0c49da656b0f45c4a0689934621d5. There was 1 b

[PR] [test] add fuzz test for topk [arrow-datafusion]

2023-10-08 Thread via GitHub
Tangruilin opened a new pull request, #7772: URL: https://github.com/apache/arrow-datafusion/pull/7772 ## Which issue does this PR close? Closes #7749 . ## Rationale for this change Add fuzz test for topk ## What changes are included in this PR? 1. This PR add i32 type

Re: [PR] Add operator section to user guide [arrow-datafusion]

2023-10-08 Thread via GitHub
ongchi commented on PR #7732: URL: https://github.com/apache/arrow-datafusion/pull/7732#issuecomment-1752012139 > Another thing comes to my mind if we support both syntaxes, we should prob have some notification when new operator is introduced. It might be a common trait that will require

Re: [PR] add interval arithmetic for timestamp types [arrow-datafusion]

2023-10-08 Thread via GitHub
berkaysynnada commented on PR #7758: URL: https://github.com/apache/arrow-datafusion/pull/7758#issuecomment-1752011933 I've observed the problem is in `new_zero()` function while defining the target node for timestamps. https://github.com/apache/arrow-datafusion/blob/3d1b23a04bdc04c

Re: [I] TopK Fuzz Tests 🐝 [arrow-datafusion]

2023-10-08 Thread via GitHub
Tangruilin commented on issue #7749: URL: https://github.com/apache/arrow-datafusion/issues/7749#issuecomment-1752011519 @alamb maybe i can add a enum to support other type? If it is ok, i will do it. -- This is an automated message from the Apache Git Service. To respond to the message,

Re: [I] TopK Fuzz Tests 🐝 [arrow-datafusion]

2023-10-08 Thread via GitHub
Tangruilin commented on issue #7749: URL: https://github.com/apache/arrow-datafusion/issues/7749#issuecomment-1752011211 when i get the result with (string), (float).etc cols. I found that [batches_to_vec](https://github.com/apache/arrow-datafusion/blob/main/test-utils/src/lib.rs#L31) can

Re: [PR] GH-37730: [C#] throw OverflowException in DecimalUtility if fractionalPart is too large [arrow]

2023-10-08 Thread via GitHub
conbench-apache-arrow[bot] commented on PR #37731: URL: https://github.com/apache/arrow/pull/37731#issuecomment-1751977073 After merging your PR, Conbench analyzed the 5 benchmarking runs that have been run so far on merge-commit 629ecbd8fd62173b62758282ca113a331940be4f. There was 1 b

[PR] Minor: make tests deterministic [arrow-datafusion]

2023-10-08 Thread via GitHub
Weijun-H opened a new pull request, #7771: URL: https://github.com/apache/arrow-datafusion/pull/7771 ## Which issue does this PR close? Closes #7764 ## Rationale for this change ## What changes are included in this PR? ## Are these changes t

Re: [PR] GH-37753: [C++][Gandiva] Add external function registry support [arrow]

2023-10-08 Thread via GitHub
niyue commented on PR #38116: URL: https://github.com/apache/arrow/pull/38116#issuecomment-1751963794 @kou this PR is ready for review, the failing CI checks don't seem to be related with my change. Could you please help? Thanks. -- This is an automated message from the Apache Git Service

Re: [PR] Minor: Improve `PruningPredicate` documentation [arrow-datafusion]

2023-10-08 Thread via GitHub
jackwener merged PR #7738: URL: https://github.com/apache/arrow-datafusion/pull/7738 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [PR] Minor: Change all file links to GitHub in document [arrow-datafusion]

2023-10-08 Thread via GitHub
jackwener merged PR #7768: URL: https://github.com/apache/arrow-datafusion/pull/7768 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [PR] Move nested union optimization from plan builder to logical optimizer [arrow-datafusion]

2023-10-08 Thread via GitHub
jackwener commented on code in PR #7695: URL: https://github.com/apache/arrow-datafusion/pull/7695#discussion_r1349639806 ## datafusion/optimizer/src/eliminate_nested_union.rs: ## @@ -0,0 +1,214 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contri

Re: [PR] Move nested union optimization from plan builder to logical optimizer [arrow-datafusion]

2023-10-08 Thread via GitHub
jackwener commented on code in PR #7695: URL: https://github.com/apache/arrow-datafusion/pull/7695#discussion_r1349639782 ## datafusion/optimizer/src/eliminate_nested_union.rs: ## @@ -0,0 +1,214 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contri