Messages by Thread
-
-
Re: [I] [datafusion-spark] parse_url incompatibilities [datafusion]
via GitHub
-
Re: [I] `arrays_except` type mismatch [datafusion-comet]
via GitHub
-
Re: [PR] fix(spark): align parse_url empty FILE path [datafusion]
via GitHub
-
[I] Refactor scalar min/max dispatch from macro arms to function-based logic [datafusion]
via GitHub
-
[PR] Refactor unary filter pushdown logic for Aggregate and Window [datafusion]
via GitHub
-
[PR] feat(parquet): sub-column projection pushdown for struct fields [datafusion]
via GitHub
-
[PR] Add eager group join strategy [datafusion]
via GitHub
-
[PR] fix(spark): ANSI-aware custom nullability for `make_interval` [datafusion]
via GitHub
-
[I] [Optimizer] Consolidate repeated filter-rebuild patterns in PushDownFilter [datafusion]
via GitHub
-
[PR] fix(spark): implement ANSI-aware custom nullability for make_interval [datafusion]
via GitHub
-
[PR] chore(deps): bump ctor from 0.12.0 to 1.0.2 [datafusion-ballista]
via GitHub
-
[PR] chore(deps): bump tonic-build from 0.14.5 to 0.14.6 [datafusion-ballista]
via GitHub
-
[PR] chore(deps): bump tonic from 0.14.5 to 0.14.6 [datafusion-ballista]
via GitHub
-
Re: [I] Make PartitionedFile extensions a type map [datafusion]
via GitHub
-
Re: [PR] fix: dedup commutative AND/OR operands when canonicalize is disabled [datafusion]
via GitHub
-
Re: [PR] Allow benchmark allocator features together [datafusion]
via GitHub
-
Re: [PR] Make PartitionFile extensions a type map [datafusion]
via GitHub
-
Re: [PR] feat: support Spark unbase64 expression [datafusion-comet]
via GitHub
-
Re: [PR] feat: support stateless physical plans [datafusion]
via GitHub
-
Re: [PR] Fix distribution enforcement for chained aggregates to avoid SanityCheckPlan failures [datafusion]
via GitHub
-
Re: [PR] feat: Ensure catalog format in config is valid [datafusion]
via GitHub
-
Re: [PR] Enable parquet filter pushdown by default [datafusion]
via GitHub
-
Re: [PR] Use arrow pool to fix memory over accounting in aggregations [datafusion]
via GitHub
-
Re: [PR] metrics: added more metrics for multi-column group by [datafusion]
via GitHub
-
Re: [PR] perf: add prefetching for aggregate multi group by [WIP] [datafusion]
via GitHub
-
Re: [PR] Co-locate function parameter names and types with `Signature::with_parameters` [datafusion]
via GitHub
-
Re: [PR] feat(spark): add `arrays_overlap` with Spark three-valued null semantics [datafusion]
via GitHub
-
[PR] Support `IS (NOT) DISTINCT FROM` in Unparser [datafusion]
via GitHub
-
[I] Unparser broken with `IS DISTINCT FROM` statements [datafusion]
via GitHub
-
Re: [PR] [Minor]: unify ANY/ALL planning and align ANY NULL semantics with PG [datafusion]
via GitHub
-
[PR] feat: implement make_time and to_time [datafusion-comet]
via GitHub
-
[PR] fix: use datafusion_expr instead of datafusion crate [datafusion]
via GitHub
-
Re: [I] Create `spark-latest` profile [datafusion-comet]
via GitHub
-
[I] NTILE returns wrong result [datafusion-comet]
via GitHub
-
[PR] feat: add GroupsAccumulator for variance, stddev, covariance, correlation [datafusion-comet]
via GitHub
-
Re: [PR] feat: add GroupsAccumulator for variance, stddev, covariance, correlation [datafusion-comet]
via GitHub
-
Re: [PR] feat: add GroupsAccumulator for variance, stddev, covariance, correlation [datafusion-comet]
via GitHub
-
Re: [PR] feat: add GroupsAccumulator for variance, stddev, covariance, correlation [datafusion-comet]
via GitHub
-
Re: [PR] feat: add GroupsAccumulator for variance, stddev, covariance, correlation [datafusion-comet]
via GitHub
-
Re: [PR] feat: add GroupsAccumulator for variance, stddev, covariance, correlation [datafusion-comet]
via GitHub
-
Re: [PR] feat: add GroupsAccumulator for variance, stddev, covariance, correlation [datafusion-comet]
via GitHub
-
[PR] fix: re-enable tests skipped for Spark 4.1 (issue #4098) [datafusion-comet]
via GitHub
-
[PR] chore: remove legacy ENABLE_COMET_SCAN_ONLY and ENABLE_COMET_ANSI_MODE env vars from Spark diffs [datafusion-comet]
via GitHub
-
[PR] feat: accelerate Iceberg RewriteDataFiles reads via Comet native scan [datafusion-comet]
via GitHub
-
[I] Support Iceberg "Rewrite Data Files Procedure" [datafusion-comet]
via GitHub
-
Re: [PR] feat: support Spark base64 expression [datafusion-comet]
via GitHub
-
Re: [PR] feat: reduce shuffle format overhead with one IPC stream per partition [experimental] [datafusion-comet]
via GitHub
-
Re: [PR] feat: replace custom shuffle block format with Arrow IPC streams [datafusion-comet]
via GitHub
-
Re: [PR] [datafusion-spark] Support 2-argument ceil(value, scale) [datafusion]
via GitHub
-
[I] Improve performance of `corr` and `covar` [datafusion-comet]
via GitHub
-
Re: [I] Improve performance of Spark-compatible decimal aggregates [datafusion-comet]
via GitHub
-
Re: [I] Improve performance of standard deviation aggregate [datafusion-comet]
via GitHub
-
Re: [I] Spark 4.1 NullType parquet: parquet-rs rejects BOOLEAN + Unknown logical type [datafusion-comet]
via GitHub
-
Re: [I] Tracking: remaining Spark 4.1 CI failures [datafusion-comet]
via GitHub
-
Re: [I] bug: Median() truncates integers [datafusion]
via GitHub
-
Re: [I] feat: support `cast StructType to String` when duplicated column in the struct [datafusion-comet]
via GitHub
-
Re: [I] [EPIC] Iceberg compatibility fixes [datafusion-comet]
via GitHub
-
Re: [I] [iceberg] Tracking PR for deleted rows support [datafusion-comet]
via GitHub
-
Re: [I] Implement tests for casting complex types [datafusion-comet]
via GitHub
-
[I] Epic: Test coverage gaps for complex-type casts [datafusion-comet]
via GitHub
-
Re: [I] [iceberg] Tracking PR to update useDecimal128 in Iceberg [datafusion-comet]
via GitHub
-
Re: [I] [iceberg] Tracking PR to update Iceberg for Parquet shading issues [datafusion-comet]
via GitHub
-
Re: [I] [iceberg] Tracking PR to update Iceberg to enable Comet native execution with Iceberg [datafusion-comet]
via GitHub
-
Re: [I] Improve performance of native scan [datafusion-comet]
via GitHub
-
Re: [I] Improve integration of hadoop s3a and comet [datafusion-comet]
via GitHub
-
Re: [I] Type widening for unsigned type [datafusion-comet]
via GitHub
-
Re: [I] How can Comet be enabled by default without needing to configure memory? [datafusion-comet]
via GitHub
-
Re: [I] Suggestion: Move to SBT for faster development cycle? [datafusion-comet]
via GitHub
-
Re: [I] Implement support for MacOS in release binaries [datafusion-comet]
via GitHub
-
Re: [I] Support Connection through Arrow Flight RPC / ADBC [datafusion-comet]
via GitHub
-
Re: [I] Native PhysicalPlanner extension points [datafusion-comet]
via GitHub
-
Re: [I] [EPIC] Add Spark expression coverage [datafusion-comet]
via GitHub
-
Re: [I] [Feature] Support Spark expression: current_date [datafusion-comet]
via GitHub
-
Re: [I] [Feature] Support Spark expression: unix_date [datafusion-comet]
via GitHub
-
Re: [I] Use maven-assembly-plugin to set final artifact name [datafusion-comet]
via GitHub
-
Re: [I] [CometFuzz] Add support for scalar arguments, with custom generator for valid values [datafusion-comet]
via GitHub
-
Re: [I] [CometFuzz] Avoid OOM with functions like rpad and array_repeat [datafusion-comet]
via GitHub
-
Re: [I] [CometFuzz] Automate keeping function signatures up-to-date with Spark [datafusion-comet]
via GitHub
-
Re: [I] [CometFuzz] Show summary of test coverage [datafusion-comet]
via GitHub
-
Re: [I] Review experiemental status of array functions [datafusion-comet]
via GitHub
-
Re: [I] Add support for native hash join with LeftAnti + BuildRight [datafusion-comet]
via GitHub
-
Re: [I] Improve documentation for supported data sources [datafusion-comet]
via GitHub
-
Re: [I] Make cast from float/double to decimal compatible with Spark [datafusion-comet]
via GitHub
-
Re: [I] Update Maven plugin versions to match Spark 3.5 [datafusion-comet]
via GitHub
-
Re: [I] Update fuzz testing tool to support testing with Comet + Iceberg integration [datafusion-comet]
via GitHub
-
Re: [I] Standardize on import order in Rust code [datafusion-comet]
via GitHub
-
Re: [I] Implement withField and dropField for struct types [datafusion-comet]
via GitHub
-
Re: [I] Create integration tests that can run against a Spark cluster [datafusion-comet]
via GitHub
-
Re: [I] SparkCastOptions should represent timezone as Option<Tz> instead of String [datafusion-comet]
via GitHub
-
Re: [I] Optimize filters to remove redundant IsNotNull checks [datafusion-comet]
via GitHub
-
Re: [I] Provide pip install for PySpark users [datafusion-comet]
via GitHub
-
Re: [I] Improve performance of broadcast hash join [datafusion-comet]
via GitHub
-
Re: [I] Push projections into hash joins [datafusion-comet]
via GitHub
-
Re: [I] Improve performance of BloomFilterMightContain [datafusion-comet]
via GitHub
-
Re: [I] Unsupported OS/arch [datafusion-comet]
via GitHub
-
Re: [I] [EPIC] Add support for all array expressions [datafusion-comet]
via GitHub
-
Re: [I] Add Scala & Python support to benchmarking [datafusion-comet]
via GitHub