Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
MrGranday commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2303040045 ## datafusion/core/src/physical_planner.rs: ## @@ -946,6 +947,18 @@ impl DefaultPhysicalPlanner { )) } Review Comment: Makes

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
MrGranday commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2303038099 ## datafusion/core/src/dataframe/mod.rs: ## @@ -232,19 +242,31 @@ pub struct DataFrame { projection_requires_validation: bool, } -impl DataFrame { -/

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
MrGranday commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2303036694 ## datafusion/core/src/dataframe/mod.rs: ## @@ -232,19 +242,31 @@ pub struct DataFrame { projection_requires_validation: bool, } -impl DataFrame { -/

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
MrGranday commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2303035284 ## datafusion/core/src/dataframe/mod.rs: ## @@ -232,19 +242,31 @@ pub struct DataFrame { projection_requires_validation: bool, } -impl DataFrame { -/

Re: [PR] DRAFT Expose and generalize cast_column to enable struct → struct casting in more contexts [datafusion]

2025-08-26 Thread via GitHub
kosiew commented on code in PR #17281: URL: https://github.com/apache/datafusion/pull/17281#discussion_r2303007475 ## datafusion/common/src/nested_struct.rs: ## @@ -159,14 +200,23 @@ pub fn cast_column(source_col: &ArrayRef, target_field: &Field) -> Result Result { +) -> Result

Re: [PR] DRAFT Expose and generalize cast_column to enable struct → struct casting in more contexts [datafusion]

2025-08-26 Thread via GitHub
kosiew commented on code in PR #17281: URL: https://github.com/apache/datafusion/pull/17281#discussion_r2303006583 ## datafusion/expr-common/src/columnar_value.rs: ## @@ -210,9 +210,22 @@ impl ColumnarValue { ) -> Result { let cast_options = cast_options.cloned().

Re: [PR] Unnest Correlated Subquery [datafusion]

2025-08-26 Thread via GitHub
duongcongtoai commented on PR #17110: URL: https://github.com/apache/datafusion/pull/17110#issuecomment-3226655728 duckdb has a notion of `PropagatesNullValues`, i wonder if we have some thing similar ``` impl ExprSchemable for Expr { fn nullable(&self, input_schema: &dyn ExprSc

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
rishvin commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2302726421 ## native/spark-expr/src/map_funcs/map_filter.rs: ## @@ -0,0 +1,539 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor l

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
rishvin commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2302719310 ## spark/src/main/scala/org/apache/comet/serde/maps.scala: ## @@ -89,3 +89,21 @@ object CometMapFromArrays extends CometExpressionSerde[MapFromArrays] {

Re: [PR] Fix: common_sub_expression_eliminate optimizer rule failed [datafusion]

2025-08-26 Thread via GitHub
alamb commented on PR #16066: URL: https://github.com/apache/datafusion/pull/16066#issuecomment-3226515095 🤖: Benchmark completed Details ``` group fix_common_subexpression_eliminate main -

Re: [PR] feat: rpad support column for second arg instead of just literal [datafusion-comet]

2025-08-26 Thread via GitHub
coderfender commented on code in PR #2099: URL: https://github.com/apache/datafusion-comet/pull/2099#discussion_r2302668961 ## native/spark-expr/src/static_invoke/char_varchar_utils/read_side_padding.rs: ## @@ -71,44 +100,78 @@ fn spark_read_side_padding2( } } +enum RPad

Re: [PR] feat: Reset data buf of NativeBatchDecoderIterator on close [datafusion-comet]

2025-08-26 Thread via GitHub
wForget commented on code in PR #2235: URL: https://github.com/apache/datafusion-comet/pull/2235#discussion_r2302562911 ## spark/src/main/scala/org/apache/spark/sql/comet/execution/shuffle/NativeBatchDecoderIterator.scala: ## @@ -192,4 +193,8 @@ object NativeBatchDecoderIterator

Re: [I] Add support for SortAggregateExec [datafusion-comet]

2025-08-26 Thread via GitHub
rishvin commented on issue #1994: URL: https://github.com/apache/datafusion-comet/issues/1994#issuecomment-3226301493 I would like to explore this. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go t

Re: [PR] chore: Improve documentation for `CometBatchIterator` and fix a potential issue [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove merged PR #2168: URL: https://github.com/apache/datafusion-comet/pull/2168 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [PR] chore: Update stability suite to use `auto` scan instead of `native_comet` [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove merged PR #2178: URL: https://github.com/apache/datafusion-comet/pull/2178 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [PR] feat: Improve fallback mechanism for ANSI mode [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2211: URL: https://github.com/apache/datafusion-comet/pull/2211#discussion_r2302398942 ## dev/diffs/3.4.3.diff: ## @@ -894,6 +894,19 @@ index 525d97e4998..8a3e7457618 100644 AccumulatorSuite.verifyPeakExecutionMemorySet(sparkContext, "ext

Re: [PR] feat: Improve fallback mechanism for ANSI mode [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2211: URL: https://github.com/apache/datafusion-comet/pull/2211#discussion_r2302398587 ## spark/src/main/scala/org/apache/comet/serde/QueryPlanSerde.scala: ## @@ -2368,13 +2368,13 @@ object QueryPlanSerde extends Logging with CometExprShim {

Re: [PR] feat(spark): implement Spark conditional function if [datafusion]

2025-08-26 Thread via GitHub
shehabgamin commented on code in PR #16946: URL: https://github.com/apache/datafusion/pull/16946#discussion_r2302373800 ## datafusion/spark/src/function/conditional/if.rs: ## @@ -0,0 +1,101 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

Re: [PR] Fix: ListingTableFactory hive column detection [datafusion]

2025-08-26 Thread via GitHub
BlakeOrth commented on PR #17232: URL: https://github.com/apache/datafusion/pull/17232#issuecomment-3226017592 @alamb @nuno-faria I believe I touched all the asks from the previous round of feedback with the latest commit. I referenced a recent PR by @nuno-faria where the parquet metadata c

Re: [PR] MINOR: add missing examples to example list [datafusion]

2025-08-26 Thread via GitHub
waynexia merged PR #17333: URL: https://github.com/apache/datafusion/pull/17333 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [PR] feat: Improve fallback mechanism for ANSI mode [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2211: URL: https://github.com/apache/datafusion-comet/pull/2211#discussion_r2302355630 ## dev/diffs/3.4.3.diff: ## @@ -894,6 +894,19 @@ index 525d97e4998..8a3e7457618 100644 AccumulatorSuite.verifyPeakExecutionMemorySet(sparkContext, "ext

Re: [I] [native_iceberg_compat] `VariantShreddingSuite` test failures with Spark 4.0.0 [datafusion-comet]

2025-08-26 Thread via GitHub
parthchandra commented on issue #2209: URL: https://github.com/apache/datafusion-comet/issues/2209#issuecomment-3225999521 > For native_iceberg_compat the Comet doesnt use the SparkSchemaAdapter where most of cases already handled Au contraire https://github.com/apache/datafusio

Re: [PR] MINOR: add missing custom_file_casts example to the list [datafusion]

2025-08-26 Thread via GitHub
waynexia commented on PR #17333: URL: https://github.com/apache/datafusion/pull/17333#issuecomment-3226013554 > Thanks! Would you like to add [`main`/datafusion-examples/examples/default_column_values.rs](https://github.com/apache/datafusion/blob/main/datafusion-examples/examples/default_col

Re: [PR] feat: Improve fallback mechanism for ANSI mode [datafusion-comet]

2025-08-26 Thread via GitHub
kazuyukitanimura commented on code in PR #2211: URL: https://github.com/apache/datafusion-comet/pull/2211#discussion_r2302340455 ## dev/diffs/3.4.3.diff: ## @@ -894,6 +894,19 @@ index 525d97e4998..8a3e7457618 100644 AccumulatorSuite.verifyPeakExecutionMemorySet(sparkContex

Re: [I] [native_iceberg_compat] `VariantShreddingSuite` test failures with Spark 4.0.0 [datafusion-comet]

2025-08-26 Thread via GitHub
parthchandra commented on issue #2209: URL: https://github.com/apache/datafusion-comet/issues/2209#issuecomment-3226003226 It might be a while till we get to it. arrow-rs has a bunch of issues outstanding for variant. Not sure where we are with the support in parquet. -- This is an auto

Re: [I] [ANSI] Include original SQL in error messages [datafusion-comet]

2025-08-26 Thread via GitHub
kazuyukitanimura commented on issue #2215: URL: https://github.com/apache/datafusion-comet/issues/2215#issuecomment-3225998425 We should revert the diffs in https://github.com/apache/datafusion-comet/pull/2211 once this gets resovled -- This is an automated message from the Apache Git Se

Re: [PR] MINOR: add missing custom_file_casts example to the list [datafusion]

2025-08-26 Thread via GitHub
adriangb commented on PR #17333: URL: https://github.com/apache/datafusion/pull/17333#issuecomment-3225945382 Thanks! Would you like to add https://github.com/apache/datafusion/blob/main/datafusion-examples/examples/default_column_values.rs while we're at it in this or a followup PR? --

Re: [PR] chore: Improve documentation for `CometBatchIterator` and fix a potential issue [datafusion-comet]

2025-08-26 Thread via GitHub
parthchandra commented on code in PR #2168: URL: https://github.com/apache/datafusion-comet/pull/2168#discussion_r2302291779 ## spark/src/main/java/org/apache/comet/CometBatchIterator.java: ## @@ -26,13 +26,45 @@ import org.apache.comet.vector.NativeUtil; /** - * An iterator

Re: [PR] chore: Improve documentation for `CometBatchIterator` and fix a potential issue [datafusion-comet]

2025-08-26 Thread via GitHub
parthchandra commented on code in PR #2168: URL: https://github.com/apache/datafusion-comet/pull/2168#discussion_r2302290257 ## spark/src/main/java/org/apache/comet/CometBatchIterator.java: ## @@ -26,13 +26,45 @@ import org.apache.comet.vector.NativeUtil; /** - * An iterator

Re: [PR] MINOR: add missing custom_file_casts example to the list [datafusion]

2025-08-26 Thread via GitHub
waynexia commented on PR #17333: URL: https://github.com/apache/datafusion/pull/17333#issuecomment-3225903005 cc @adriangb -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[PR] MINOR: add missing custom_file_casts example to the list [datafusion]

2025-08-26 Thread via GitHub
waynexia opened a new pull request, #17333: URL: https://github.com/apache/datafusion/pull/17333 ## Which issue does this PR close? - Closes #. ## Rationale for this change Come across that example is not listed in readme ## What changes are include

Re: [PR] chore: Add spark compatible `MapSort` function along with limited support for grouping on Map type [datafusion-comet]

2025-08-26 Thread via GitHub
rishvin commented on PR #2221: URL: https://github.com/apache/datafusion-comet/pull/2221#issuecomment-3225869863 [PR Build (macOS) / macos-14/Spark 3.4, JDK 11, Scala 2.12 [exec] (pull_request)](https://github.com/apache/datafusion-comet/actions/runs/17228255188/job/48948193988?pr=2221)

Re: [PR] feat: Reset data buf of NativeBatchDecoderIterator on close [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2235: URL: https://github.com/apache/datafusion-comet/pull/2235#discussion_r2302221827 ## spark/src/main/scala/org/apache/spark/sql/comet/execution/shuffle/NativeBatchDecoderIterator.scala: ## @@ -192,4 +193,8 @@ object NativeBatchDecoderIterat

[PR] minor: make shuffle exec display consistent [datafusion-ballista]

2025-08-26 Thread via GitHub
milenkovicm opened a new pull request, #1299: URL: https://github.com/apache/datafusion-ballista/pull/1299 # Which issue does this PR close? Closes #. # Rationale for this change Make `DisplayAs` consistent across Shuffle*Exec # What changes are included in this P

Re: [I] `array_intersect` test fails when default scan is `auto` [datafusion-comet]

2025-08-26 Thread via GitHub
comphead commented on issue #2174: URL: https://github.com/apache/datafusion-comet/issues/2174#issuecomment-3225711921 [test.zip](https://github.com/user-attachments/files/21995757/test.zip) -- This is an automated message from the Apache Git Service. To respond to the message, please log

Re: [I] `array_intersect` test fails when default scan is `auto` [datafusion-comet]

2025-08-26 Thread via GitHub
comphead commented on issue #2174: URL: https://github.com/apache/datafusion-comet/issues/2174#issuecomment-3225710788 the issue is likely connected to parquet file reader, I can reproduce with specific parquet created by `makeParquetFileAllPrimitiveTypes`. The same data and data types wri

Re: [PR] feat: support `Utf8View` for more args of `regexp_replace` [datafusion]

2025-08-26 Thread via GitHub
mbutrovich commented on PR #17195: URL: https://github.com/apache/datafusion/pull/17195#issuecomment-3225676497 I haven't forgotten about this. I need to generate a new .slt file for the relevant tests. -- This is an automated message from the Apache Git Service. To respond to the message

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2302046403 ## datafusion/datasource-json/src/file_format.rs: ## @@ -251,16 +251,16 @@ impl FileFormat for JsonFormat { async fn create_physical_plan( &s

Re: [PR] feat: consistent hash scheduling implemented as `DistributionPolicy` [datafusion-ballista]

2025-08-26 Thread via GitHub
milenkovicm closed pull request #1272: feat: consistent hash scheduling implemented as `DistributionPolicy` URL: https://github.com/apache/datafusion-ballista/pull/1272 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

Re: [PR] chore: Refactor `Cast` serde [WIP] [datafusion-comet]

2025-08-26 Thread via GitHub
codecov-commenter commented on PR #2242: URL: https://github.com/apache/datafusion-comet/pull/2242#issuecomment-3225622361 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/2242?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2302051640 ## datafusion/datasource-parquet/src/source.rs: ## @@ -580,12 +616,12 @@ impl FileSource for ParquetSource { self } -fn metrics(&self)

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2302043230 ## datafusion-examples/examples/parquet_exec_visitor.rs: ## @@ -98,9 +98,11 @@ impl ExecutionPlanVisitor for ParquetExecVisitor { fn pre_visit(&mut se

Re: [PR] fix: separate type checking for CometExchange and CometColumnarExchange [datafusion-comet]

2025-08-26 Thread via GitHub
codecov-commenter commented on PR #2241: URL: https://github.com/apache/datafusion-comet/pull/2241#issuecomment-3225582066 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/2241?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

[PR] chore: Refactor `Cast` serde [WIP] [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove opened a new pull request, #2242: URL: https://github.com/apache/datafusion-comet/pull/2242 ## Which issue does this PR close? Closes https://github.com/apache/datafusion-comet/issues/2239 Builds on https://github.com/apache/datafusion-comet/pull/2211 ##

[PR] Add PostgreSQL `CREATE USER` and `ALTER USER` support [datafusion-sqlparser-rs]

2025-08-26 Thread via GitHub
ramnes opened a new pull request, #2015: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/2015 Currently, the library only supports `CREATE ROLE` and `ALTER ROLE` for PostgreSQL. `CREATE USER` and `ALTER USER` fail to parse with errors like `"Expected: an object type after CR

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301993969 ## datafusion/datasource/src/file.rs: ## @@ -270,8 +293,29 @@ impl DataSource for T { SchedulingType::Cooperative } -fn data_source_sta

Re: [PR] fix: separate type checking for CometExchange and CometColumnarExchange [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2241: URL: https://github.com/apache/datafusion-comet/pull/2241#discussion_r2301931560 ## spark/src/main/scala/org/apache/comet/rules/CometExecRule.scala: ## @@ -849,6 +891,41 @@ case class CometExecRule(session: SparkSession) extends Rule[Spa

Re: [PR] Snowflake: Minus char in stage name [datafusion-sqlparser-rs]

2025-08-26 Thread via GitHub
iffyio merged PR #2014: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/2014 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr

Re: [PR] fix: separate type checking for CometExchange and CometColumnarExchange [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2241: URL: https://github.com/apache/datafusion-comet/pull/2241#discussion_r2301927516 ## spark/src/main/scala/org/apache/comet/rules/CometExecRule.scala: ## @@ -849,6 +891,41 @@ case class CometExecRule(session: SparkSession) extends Rule[Spa

Re: [PR] Add support for PostgreSQL JSON function 'RETURNING' clauses [datafusion-sqlparser-rs]

2025-08-26 Thread via GitHub
iffyio merged PR #2001: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/2001 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr

Re: [I] Support Postgres' `JSON_OBJECT` function `RETURNING` clause [datafusion-sqlparser-rs]

2025-08-26 Thread via GitHub
iffyio closed issue #2000: Support Postgres' `JSON_OBJECT` function `RETURNING` clause URL: https://github.com/apache/datafusion-sqlparser-rs/issues/2000 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] Add SECURE keyword for views in Snowflake [datafusion-sqlparser-rs]

2025-08-26 Thread via GitHub
iffyio merged PR #2004: URL: https://github.com/apache/datafusion-sqlparser-rs/pull/2004 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr

[PR] fix: separate type checking for CometExchange and CometColumnarExchange [datafusion-comet]

2025-08-26 Thread via GitHub
mbutrovich opened a new pull request, #2241: URL: https://github.com/apache/datafusion-comet/pull/2241 ## Which issue does this PR close? Partially addresses #2199. ## Rationale for this change CometExchange (native) and CometColumnarExchange support diffe

Re: [PR] chore: Improve expression fallback reporting [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove merged PR #2240: URL: https://github.com/apache/datafusion-comet/pull/2240 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@

Re: [PR] chore: Improve expression fallback reporting [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on PR #2240: URL: https://github.com/apache/datafusion-comet/pull/2240#issuecomment-3225401745 Thanks for the review @mbutrovich -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
milenkovicm commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2301349185 ## datafusion/core/src/dataframe/mod.rs: ## @@ -232,19 +242,31 @@ pub struct DataFrame { projection_requires_validation: bool, } -impl DataFrame { -

[PR] feat: make sql an optional feature [datafusion]

2025-08-26 Thread via GitHub
timsaucer opened a new pull request, #17332: URL: https://github.com/apache/datafusion/pull/17332 ## Which issue does this PR close? - Closes https://github.com/apache/datafusion/issues/15907 ## Rationale for this change For some users, adding in sqlparser greatly increas

Re: [I] Improve SHA2 user-error message in serdes code [datafusion-comet]

2025-08-26 Thread via GitHub
comphead closed issue #2193: Improve SHA2 user-error message in serdes code URL: https://github.com/apache/datafusion-comet/issues/2193 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific co

Re: [PR] fix: Make Sha2 fallback message more user-friendly [datafusion-comet]

2025-08-26 Thread via GitHub
comphead merged PR #2213: URL: https://github.com/apache/datafusion-comet/pull/2213 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@d

Re: [PR] chore: Improve expression fallback reporting [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on PR #2240: URL: https://github.com/apache/datafusion-comet/pull/2240#issuecomment-3225303195 > * Should we prefer "Comet" over "we" in comments? Yes, I can update this in the next PR (https://github.com/apache/datafusion-comet/pull/2211) > * Do we

Re: [PR] fix: UnnestExec preserves relevant equivalence properties of input [datafusion]

2025-08-26 Thread via GitHub
vegarsti commented on code in PR #16985: URL: https://github.com/apache/datafusion/pull/16985#discussion_r2301809113 ## datafusion/sqllogictest/test_files/unnest.slt: ## @@ -941,3 +941,33 @@ where min_height * width1 = ( ) 4 7 4 28 + +## Unnest with ordering on unrelated

Re: [PR] chore: move Round serde into object [datafusion-comet]

2025-08-26 Thread via GitHub
mbutrovich merged PR #2237: URL: https://github.com/apache/datafusion-comet/pull/2237 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...

Re: [PR] chore: Improve expression fallback reporting [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2240: URL: https://github.com/apache/datafusion-comet/pull/2240#discussion_r2301765516 ## spark/src/main/scala/org/apache/comet/expressions/CometCast.scala: ## @@ -21,16 +21,7 @@ package org.apache.comet.expressions import org.apache.spark.s

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301560549 ## datafusion/datasource/src/file.rs: ## @@ -50,31 +65,31 @@ pub fn as_file_source(source: T) -> Arc /// * [`ParquetSource`](https://docs.rs/datafusion/

Re: [PR] chore: Improve expression fallback reporting [datafusion-comet]

2025-08-26 Thread via GitHub
codecov-commenter commented on PR #2240: URL: https://github.com/apache/datafusion-comet/pull/2240#issuecomment-3225142264 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/2240?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [PR] Fix ambiguous column names in substrait conversion as a result of literals having the same name during conversion. [datafusion]

2025-08-26 Thread via GitHub
xanderbailey commented on code in PR #17299: URL: https://github.com/apache/datafusion/pull/17299#discussion_r2301668484 ## datafusion/substrait/src/logical_plan/consumer/rel/project_rel.rs: ## @@ -62,7 +62,18 @@ pub async fn from_project_rel( // to transform it

[PR] Support level [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove opened a new pull request, #2240: URL: https://github.com/apache/datafusion-comet/pull/2240 ## Which issue does this PR close? Part of https://github.com/apache/datafusion-comet/issues/2239 ## Rationale for this change We currently have two diffe

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
adriangb commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301621010 ## datafusion/datasource/src/source.rs: ## @@ -179,6 +179,10 @@ pub trait DataSource: Send + Sync + Debug { vec![PushedDown::No; filters.len()],

Re: [PR] chore: Add type parameter to IncompatExpr [datafusion-comet]

2025-08-26 Thread via GitHub
codecov-commenter commented on PR #2238: URL: https://github.com/apache/datafusion-comet/pull/2238#issuecomment-3225031407 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/2238?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
adriangb commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301619788 ## datafusion/datasource/src/file.rs: ## @@ -50,31 +65,31 @@ pub fn as_file_source(source: T) -> Arc /// * [`ParquetSource`](https://docs.rs/datafusion/latest/

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
adriangb commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301621710 ## datafusion/datasource/src/source.rs: ## @@ -465,18 +468,9 @@ impl DataSourceExec { /// /// Returns `None` if /// 1. the datasource is not scanni

[PR] chore: Add type parameter to IncompatExpr [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove opened a new pull request, #2238: URL: https://github.com/apache/datafusion-comet/pull/2238 ## Which issue does this PR close? N/A ## Rationale for this change This is a step towards making `Cast` serde more consistent with other expressions.

Re: [PR] chore: Add type parameter to IncompatExpr [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove closed pull request #2238: chore: Add type parameter to IncompatExpr URL: https://github.com/apache/datafusion-comet/pull/2238 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific c

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301593271 ## datafusion/datasource/src/source.rs: ## @@ -179,6 +179,10 @@ pub trait DataSource: Send + Sync + Debug { vec![PushedDown::No; filters.len()

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301591751 ## datafusion/datasource/src/file.rs: ## @@ -150,3 +165,193 @@ pub trait FileSource: Send + Sync { None } } + +impl DataSource for T { +

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
friendlymatthew commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301595761 ## datafusion/datasource/src/source.rs: ## @@ -465,18 +468,9 @@ impl DataSourceExec { /// /// Returns `None` if /// 1. the datasource is not

Re: [PR] Fix ambiguous column names in substrait conversion as a result of literals having the same name during conversion. [datafusion]

2025-08-26 Thread via GitHub
xanderbailey commented on code in PR #17299: URL: https://github.com/apache/datafusion/pull/17299#discussion_r2301590959 ## datafusion/substrait/src/logical_plan/consumer/rel/project_rel.rs: ## @@ -62,7 +62,18 @@ pub async fn from_project_rel( // to transform it

[I] Refactor `Cast` serde [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove opened a new issue, #2239: URL: https://github.com/apache/datafusion-comet/issues/2239 ### What is the problem the feature request solves? I would ike to refactor the serde logic for `Cast` so that it is more consistent with how we implement serde for other expressions.

Re: [PR] Redesign ownership model between `FileScanConfig` and `FileSource`s [datafusion]

2025-08-26 Thread via GitHub
adriangb commented on code in PR #17242: URL: https://github.com/apache/datafusion/pull/17242#discussion_r2301428037 ## datafusion/core/src/datasource/physical_plan/arrow_file.rs: ## @@ -49,23 +62,37 @@ impl From for Arc { } impl FileSource for ArrowSource { +fn config(&

Re: [PR] Fix ambiguous column names in substrait conversion as a result of literals having the same name during conversion. [datafusion]

2025-08-26 Thread via GitHub
LiaCastaneda commented on code in PR #17299: URL: https://github.com/apache/datafusion/pull/17299#discussion_r2301441295 ## datafusion/substrait/src/logical_plan/consumer/rel/project_rel.rs: ## @@ -62,7 +62,18 @@ pub async fn from_project_rel( // to transform it

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
codetyri0n commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2301332228 ## native/spark-expr/src/map_funcs/map_filter.rs: ## @@ -0,0 +1,539 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributo

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2301444756 ## spark/src/main/scala/org/apache/comet/serde/QueryPlanSerde.scala: ## @@ -1524,6 +1525,11 @@ object QueryPlanSerde extends Logging with CometExprShim {

Re: [PR] Fix ambiguous column names in substrait conversion as a result of literals having the same name during conversion. [datafusion]

2025-08-26 Thread via GitHub
LiaCastaneda commented on code in PR #17299: URL: https://github.com/apache/datafusion/pull/17299#discussion_r2301440518 ## datafusion/substrait/src/logical_plan/consumer/rel/project_rel.rs: ## @@ -62,7 +62,18 @@ pub async fn from_project_rel( // to transform it

Re: [PR] fix: Support aggregate expressions in `QUALIFY` [datafusion]

2025-08-26 Thread via GitHub
rkrishn7 commented on code in PR #17313: URL: https://github.com/apache/datafusion/pull/17313#discussion_r2301400501 ## datafusion/sql/src/select.rs: ## @@ -944,15 +963,41 @@ impl SqlToRel<'_, S> { check_columns_satisfy_exprs( &column_exprs_post_agg

Re: [PR] chore: move Round serde into object [datafusion-comet]

2025-08-26 Thread via GitHub
codecov-commenter commented on PR #2237: URL: https://github.com/apache/datafusion-comet/pull/2237#issuecomment-3224729565 ## [Codecov](https://app.codecov.io/gh/apache/datafusion-comet/pull/2237?dropdown=coverage&src=pr&el=h1&utm_medium=referral&utm_source=github&utm_content=comment&utm_ca

Re: [PR] fix: lazy case else evaluation [datafusion]

2025-08-26 Thread via GitHub
adriangb merged PR #17311: URL: https://github.com/apache/datafusion/pull/17311 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@dataf

Re: [PR] fix: Support aggregate expressions in `QUALIFY` [datafusion]

2025-08-26 Thread via GitHub
rkrishn7 commented on PR #17313: URL: https://github.com/apache/datafusion/pull/17313#issuecomment-3224713649 @Vedin @alamb What do y'all think about sequencing out the second issue mentioned in the original ticket? For the sake of smaller PRs. If it makes sense, happy to file a separ

Re: [I] `DataFrame.cache()` does not work in distributed environments [datafusion]

2025-08-26 Thread via GitHub
milenkovicm commented on issue #17297: URL: https://github.com/apache/datafusion/issues/17297#issuecomment-3224661378 @alamb if you have time it would be great to get your opinion about this issue as solving it may require some coding. -- This is an automated message from the Apache Git

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
milenkovicm commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2301352397 ## datafusion/core/src/dataframe/mod.rs: ## @@ -232,19 +242,31 @@ pub struct DataFrame { projection_requires_validation: bool, } -impl DataFrame { -

[PR] choire: move Round serde into object [datafusion-comet]

2025-08-26 Thread via GitHub
andygrove opened a new pull request, #2237: URL: https://github.com/apache/datafusion-comet/pull/2237 ## Which issue does this PR close? N/A ## Rationale for this change This is a step towards improving ANSI fallback. ## What changes are included in

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
milenkovicm commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2301347560 ## datafusion/core/src/physical_planner.rs: ## @@ -946,6 +947,18 @@ impl DefaultPhysicalPlanner { )) } Review Comment: I do

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
codetyri0n commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2301332228 ## native/spark-expr/src/map_funcs/map_filter.rs: ## @@ -0,0 +1,539 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributo

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
milenkovicm commented on code in PR #17314: URL: https://github.com/apache/datafusion/pull/17314#discussion_r2301340322 ## datafusion/core/src/dataframe/mod.rs: ## @@ -232,19 +242,31 @@ pub struct DataFrame { projection_requires_validation: bool, } -impl DataFrame { -

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
codetyri0n commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2301332228 ## native/spark-expr/src/map_funcs/map_filter.rs: ## @@ -0,0 +1,539 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributo

Re: [PR] feat: add configurable cache mode (local_cache) with LogicalPlan::Cache (#17297) [datafusion]

2025-08-26 Thread via GitHub
milenkovicm commented on PR #17314: URL: https://github.com/apache/datafusion/pull/17314#issuecomment-3224607188 is this PR work in progress @MrGranday ? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [PR] chore: add memory catalog test to handle table removal before schema deregistration [datafusion]

2025-08-26 Thread via GitHub
caicancai commented on code in PR #17307: URL: https://github.com/apache/datafusion/pull/17307#discussion_r2301114549 ## datafusion/core/tests/catalog/memory.rs: ## @@ -47,6 +47,20 @@ fn memory_catalog_dereg_nonempty_schema() { assert!(cat.deregister_schema("foo", true).unw

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
codetyri0n commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2301322614 ## native/spark-expr/src/map_funcs/map_filter.rs: ## @@ -0,0 +1,539 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributo

Re: [PR] chore: add memory catalog test to handle table removal before schema deregistration [datafusion]

2025-08-26 Thread via GitHub
caicancai commented on code in PR #17307: URL: https://github.com/apache/datafusion/pull/17307#discussion_r2301285539 ## datafusion/core/tests/catalog/memory.rs: ## @@ -47,6 +47,20 @@ fn memory_catalog_dereg_nonempty_schema() { assert!(cat.deregister_schema("foo", true).unw

Re: [PR] Feat : Bringing in support for map_filter expression. [datafusion-comet]

2025-08-26 Thread via GitHub
mbutrovich commented on code in PR #2236: URL: https://github.com/apache/datafusion-comet/pull/2236#discussion_r2301183180 ## native/spark-expr/src/map_funcs/map_filter.rs: ## @@ -0,0 +1,539 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributo

Re: [PR] chore: add memory catalog test to handle table removal before schema deregistration [datafusion]

2025-08-26 Thread via GitHub
caicancai commented on code in PR #17307: URL: https://github.com/apache/datafusion/pull/17307#discussion_r2301114549 ## datafusion/core/tests/catalog/memory.rs: ## @@ -47,6 +47,20 @@ fn memory_catalog_dereg_nonempty_schema() { assert!(cat.deregister_schema("foo", true).unw

  1   2   >