github
Thread
Date
Later messages
Messages by Date
2026/05/14
Re: [PR] feat: fix windows decimal casting frame [datafusion]
via GitHub
2026/05/14
Re: [PR] Add support for logical and physical codecs [datafusion-python]
via GitHub
2026/05/14
Re: [PR] Add support for logical and physical codecs [datafusion-python]
via GitHub
2026/05/14
Re: [PR] feat: Plumb Parquet virtual columns (row_number) through TableSchema and ParquetOpener [datafusion]
via GitHub
2026/05/14
Re: [PR] fix: Nested self-referential CASE chains should not cause exponential hashing work during physical planning. [datafusion]
via GitHub
2026/05/14
[PR] feat: disable Comet by default when CometShuffleManager is not registered [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] ci: use ubuntu-slim for lightweight jobs [datafusion-comet]
via GitHub
2026/05/14
Re: [I] ci: use ubuntu-slim where applicable [datafusion-comet]
via GitHub
2026/05/14
[I] Frequent CI failures for Spark 4.0.2 / JDK 21 [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] ci: use ubuntu-slim for lightweight jobs [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] ci: use ubuntu-slim for lightweight jobs [datafusion-comet]
via GitHub
2026/05/14
Re: [I] Add support for `size` expression [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] ci: use ubuntu-slim for lightweight jobs [datafusion-comet]
via GitHub
2026/05/14
Re: [I] Add support for `size` expression [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] ci: use ubuntu-slim for lightweight jobs [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] feat: Support Spark Expression Decode [datafusion-comet]
via GitHub
2026/05/14
Re: [I] Add support for scalar UDFs that operate on Arrow data [datafusion-comet]
via GitHub
2026/05/14
Re: [I] Add support for scalar UDFs that operate on Arrow data [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] feat: Plumb Parquet virtual columns (row_number) through TableSchema and ParquetOpener [datafusion]
via GitHub
2026/05/14
Re: [PR] ci: use ubuntu-slim for lightweight jobs [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] fix: Nested self-referential CASE chains should not cause exponential hashing work during physical planning. [datafusion]
via GitHub
2026/05/14
Re: [PR] Add support for logical and physical codecs [datafusion-python]
via GitHub
2026/05/14
[I] Upgrade workspace to Rust 1.95 [datafusion]
via GitHub
2026/05/14
Re: [PR] fix: Nested self-referential CASE chains should not cause exponential hashing work during physical planning. [datafusion]
via GitHub
2026/05/14
Re: [PR] fix: Nested self-referential CASE chains should not cause exponential hashing work during physical planning. [datafusion]
via GitHub
2026/05/14
Re: [PR] feat: Support Spark Expression Decode [datafusion-comet]
via GitHub
2026/05/14
Re: [I] [DISCUSSION] Extending Partitioning to Support More Variants [datafusion]
via GitHub
2026/05/14
[PR] Update Rust toolchain to 1.95 [datafusion]
via GitHub
2026/05/14
Re: [PR] fix: Nested self-referential CASE chains should not cause exponential hashing work during physical planning. [datafusion]
via GitHub
2026/05/14
Re: [I] [DISCUSSION] Future of Dynamic Filters Sync [datafusion]
via GitHub
2026/05/14
Re: [I] Physical planning CPU blowup hashing nested CASE expressions [datafusion]
via GitHub
2026/05/14
Re: [PR] Brent/case hash fix [datafusion]
via GitHub
2026/05/14
[PR] fix: propagate inner-field metadata through make_array and array_agg [datafusion]
via GitHub
2026/05/14
Re: [I] Physical planning CPU blowup hashing nested CASE expressions [datafusion]
via GitHub
2026/05/14
Re: [I] Physical planning CPU blowup hashing nested CASE expressions [datafusion]
via GitHub
2026/05/14
[PR] ci: switch 8 workflows to ubuntu-slim [datafusion-comet]
via GitHub
2026/05/14
[PR] Brent/case hash fix [datafusion]
via GitHub
2026/05/14
[PR] refactor: merge comet-common module into comet-spark [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] feat: vendor-pluggable S3 credentials for native scans [datafusion-comet]
via GitHub
2026/05/14
[PR] feat: fix windows decimal casting frame [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/14
[PR] docs: add versioning policy [datafusion-comet]
via GitHub
2026/05/14
[I] Physical planning CPU blowup hashing nested CASE expressions [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: Optimize `translate` to use new bulk-NULL string builders [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: simplify HashJoinExec dynamic filter, drop CASE routing [datafusion]
via GitHub
2026/05/14
[PR] perf: bypass values.value(i) for inline strings in ArrowBytesViewMap [datafusion]
via GitHub
2026/05/14
Re: [I] feat: expose ConfigOptions.set/get as generic SessionContextBuilder.setOption / SessionContext.getOption [datafusion-java]
via GitHub
2026/05/14
Re: [PR] feat(builder): expose ConfigOptions.set/get as setOption / setOptions / getOption [datafusion-java]
via GitHub
2026/05/14
Re: [PR] feat(json): expose NdJsonReadOptions via registerJson and readJson [datafusion-java]
via GitHub
2026/05/14
Re: [PR] feat(json): expose NdJsonReadOptions via registerJson and readJson [datafusion-java]
via GitHub
2026/05/14
Re: [PR] feat(json): expose NdJsonReadOptions via registerJson and readJson [datafusion-java]
via GitHub
2026/05/14
Re: [PR] feat(json): expose NdJsonReadOptions via registerJson and readJson [datafusion-java]
via GitHub
2026/05/14
Re: [PR] feat(json): expose NdJsonReadOptions via registerJson and readJson [datafusion-java]
via GitHub
2026/05/14
Re: [PR] perf: Optimize `translate` to use new bulk-NULL string builders [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: Optimize `translate` to use new bulk-NULL string builders [datafusion]
via GitHub
2026/05/14
[PR] perf: Optimize `translate` to use new bulk-NULL string builders [datafusion]
via GitHub
2026/05/14
[PR] feat: detect Iceberg V2 writes and emit fall-back reasons [datafusion-comet]
via GitHub
2026/05/14
[I] Writes to Apache Iceberg Tables [datafusion-comet]
via GitHub
2026/05/14
[PR] feat(datetime): prototype JVM UDF path for Hour/Minute/Second (engine=java) [datafusion-comet]
via GitHub
2026/05/14
Re: [I] Optimize `translate` using `append_with` [datafusion]
via GitHub
2026/05/14
Re: [PR] feat: Add support for Spark-compatible explode_outer function [datafusion]
via GitHub
2026/05/14
[I] Optimize `translate` using `append_with` [datafusion]
via GitHub
2026/05/14
Re: [PR] feat: Support Spark Expression Decode [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] Add configurable UNION DISTINCT to FILTER rewrite optimization [datafusion]
via GitHub
2026/05/14
Re: [PR] Add configurable UNION DISTINCT to FILTER rewrite optimization [datafusion]
via GitHub
2026/05/14
Re: [PR] feat: Add support for Spark-compatible explode_outer function [datafusion]
via GitHub
2026/05/14
Re: [PR] feat(parquet): row-group and row-range sampling on ParquetSource [datafusion]
via GitHub
2026/05/14
Re: [PR] feat: Spark custom credential providers for native scans [datafusion-comet]
via GitHub
2026/05/14
[PR] feat(udf): account JVM-UDF Arrow allocations to the Spark task [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/14
Re: [PR] feat(dataframe): add executeStream(allocator) for incremental batch iteration [datafusion-java]
via GitHub
2026/05/14
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/14
Re: [PR] feat(builder): expose ConfigOptions.set/get as setOption / setOptions / getOption [datafusion-java]
via GitHub
2026/05/14
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/14
Re: [PR] feat(builder): expose ConfigOptions.set/get as setOption / setOptions / getOption [datafusion-java]
via GitHub
2026/05/14
[PR] Add support for logical and physical codecs [datafusion-python]
via GitHub
2026/05/14
Re: [PR] Replace ANY/ALL CASE planning with array_has/min/max desugaring [datafusion]
via GitHub
2026/05/14
Re: [PR] docs: show child links on Expression Compatibility page [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [I] Implement JVM UDFs for all date/time expressions [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] perf: reuse mask in `truncate_list_nulls` and avoid counting all true bits [datafusion]
via GitHub
2026/05/14
[PR] Make use of Swatinem/rust-cache to make the CI workflows faster [datafusion-ballista]
via GitHub
2026/05/14
[PR] docs: show child links on Expression Compatibility page [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/14
[I] AbstractMethodError: CometBroadcastExchangeExec missing sparkContext() from BroadcastExchangeLike [datafusion-comet]
via GitHub
2026/05/14
[I] Create Comet versioning policy [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] chore(deps): bump runs-on/action from 2.1.0 to 2.1.2 [datafusion]
via GitHub
2026/05/14
Re: [I] Docker build workflow takes a really long time on each PR [datafusion-ballista]
via GitHub
2026/05/14
Re: [PR] fix: REST API does not show running jobs [datafusion-ballista]
via GitHub
2026/05/14
Re: [PR] feat(parquet): row-group and row-range sampling on ParquetSource [datafusion]
via GitHub
2026/05/14
Re: [PR] Add configurable UNION DISTINCT to FILTER rewrite optimization [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub
2026/05/14
Re: [PR] Add rand() alias for random() [datafusion]
via GitHub
2026/05/14
Re: [I] [DISCUSSION] Extending Partitioning to Support More Variants [datafusion]
via GitHub
2026/05/14
Re: [I] [DISCUSSION] Extending Partitioning to Support More Variants [datafusion]
via GitHub
2026/05/14
Re: [I] Introduce `StringViewArrayBuilder::map` to avoid duplication [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: Add `append_with` to string builders, use in `replace` [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: reuse mask in `truncate_list_nulls` and avoid counting all true bits [datafusion]
via GitHub
2026/05/14
Re: [PR] Add lambda substrait support [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: reuse mask in `truncate_list_nulls` and avoid counting all true bits [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: reuse mask in `truncate_list_nulls` and avoid counting all true bits [datafusion]
via GitHub
2026/05/14
Re: [PR] Add lambda substrait support [datafusion]
via GitHub
2026/05/14
Re: [PR] test: add test that validate partial reduce with different number of state fields [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] perf: Add `append_with` to string builders, use in `replace` [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
[PR] Support DISTINCT ON with aggregation and windows [datafusion]
via GitHub
2026/05/14
Re: [PR] Allow pickling PyExpr [datafusion-python]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Add metrics to `FFI_ExecutionPlan` [datafusion]
via GitHub
2026/05/14
Re: [I] Expose `ExecutionPlan::metrics()` across the FFI boundary [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Track spill read-back memory in SMJ [datafusion]
via GitHub
2026/05/14
Re: [PR] feat: rest api supports plan tree rendering [datafusion-ballista]
via GitHub
2026/05/14
[PR] [TUI] Add a config setting for rendering job stage's plan as a tree [datafusion-ballista]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Add lambda substrait support [datafusion]
via GitHub
2026/05/14
Re: [PR] Add lambda substrait support [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] minor: make HigherOrderSignature less error-prone [datafusion]
via GitHub
2026/05/14
Re: [D] DISCUSSION: Apache DataFusion New York Meetup May 2026 [datafusion]
via GitHub
2026/05/14
Re: [D] DISCUSSION: Apache DataFusion New York Meetup May 2026 [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
[D] DataFusion-Federation: Union Flattening Across Executors [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
Re: [PR] Call take arrays once per repartitioned input batch [datafusion]
via GitHub
2026/05/14
[PR] feat(dataframe): add executeStream(allocator) for incremental batch iteration [datafusion-java]
via GitHub
2026/05/14
Re: [PR] feat: eliminate GlobalLimitExec when input statistics prove limit is already satisfied [datafusion]
via GitHub
2026/05/14
[PR] fix: REST API does not show running jobs [datafusion-ballista]
via GitHub
2026/05/14
Re: [I] REST API does not show running jobs [datafusion-ballista]
via GitHub
2026/05/14
Re: [PR] fix: reject readBatch(0) in ArrowConstantColumnReader [datafusion-comet]
via GitHub
2026/05/14
Re: [PR] feat: Enable expressions as default value in lead/lag function [datafusion]
via GitHub
2026/05/14
[I] feat(dataframe): add executeStream(allocator) for incremental batch iteration [datafusion-java]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
[I] CREATE TABLE AS not checking column unicity [datafusion]
via GitHub
2026/05/14
Re: [I] CREATE TABLE AS not checking column unicity [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
[PR] Refactor Spark `format_string` numeric `%c` conversion dispatch [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
[PR] fix: reduce memory allocation overhead during partial aggregation ear… [datafusion]
via GitHub
2026/05/14
[I] Extra memory allocated during partial aggregation early emit during OOM handling [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] Minor: Disallow async function in lambdas [datafusion]
via GitHub
2026/05/14
[I] Refactor: Centralize numeric `%c` formatting dispatch in format_string.rs [datafusion]
via GitHub
2026/05/14
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/05/14
Re: [PR] bench: run array_replace kernels in benchmark [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] feat: eliminate GlobalLimitExec when input statistics prove limit is already satisfied [datafusion]
via GitHub
2026/05/14
Re: [PR] Track spill read-back memory in SMJ [datafusion]
via GitHub
2026/05/14
Re: [PR] Preserve recursive CTE nullability across logical and physical planning [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
[PR] Add blog: Sort Pushdown in DataFusion: Skip Sorts, Skip I/O [datafusion-site]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] [PoC] perf: optimize group-only group-by case for primitive cases (clickbench q4) [datafusion]
via GitHub
2026/05/14
Re: [PR] chore(deps): bump pytest from 9.0.2 to 9.0.3 in /python [datafusion-ballista]
via GitHub
2026/05/14
Re: [PR] bench: run array_replace kernels in benchmark [datafusion]
via GitHub
2026/05/14
Re: [PR] bench: run array_replace kernels in benchmark [datafusion]
via GitHub
2026/05/13
[PR] feat(builder): expose ConfigOptions.set/get as setOption / setOptions / getOption [datafusion-java]
via GitHub
2026/05/13
Re: [PR] bench: run array_replace kernels in benchmark [datafusion]
via GitHub
2026/05/13
Re: [PR] bench: run array_replace kernels in benchmark [datafusion]
via GitHub
2026/05/13
Re: [PR] Preserve recursive CTE nullability across logical and physical planning [datafusion]
via GitHub
2026/05/13
Re: [PR] Preserve recursive CTE nullability across logical and physical planning [datafusion]
via GitHub
2026/05/13
Re: [PR] minor: make HigherOrderSignature less error-prone [datafusion]
via GitHub
2026/05/13
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/05/13
Re: [PR] Replace ANY/ALL CASE planning with array_has/min/max desugaring [datafusion]
via GitHub
2026/05/13
Re: [PR] Replace ANY/ALL CASE planning with array_has/min/max desugaring [datafusion]
via GitHub
2026/05/13
Re: [PR] Preserve recursive CTE nullability across logical and physical planning [datafusion]
via GitHub
2026/05/13
Re: [PR] ClickHouse: Support scalar expressions in WITH clause [datafusion-sqlparser-rs]
via GitHub
2026/05/13
Re: [PR] Preserve recursive CTE nullability across logical and physical planning [datafusion]
via GitHub
2026/05/13
Re: [PR] [Experiment] Adaptive filter pushdown [datafusion]
via GitHub
2026/05/13
Re: [PR] [Experiment] Adaptive filter pushdown [datafusion]
via GitHub
2026/05/13
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/05/13
Re: [PR] perf: reuse mask in `truncate_list_nulls` and avoid counting all true bits [datafusion]
via GitHub
2026/05/13
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/05/13
Re: [PR] Compact more aggressively in TopK based upon memory usage [datafusion]
via GitHub
2026/05/13
Re: [PR] Refactor scalar min/max dispatch into function-based helpers [datafusion]
via GitHub
2026/05/13
Re: [PR] [Experiment] Adaptive filter pushdown [datafusion]
via GitHub
2026/05/13
Re: [PR] [Experiment] Adaptive filter pushdown [datafusion]
via GitHub
2026/05/13
Re: [PR] [Experiment] Adaptive filter pushdown [datafusion]
via GitHub
Later messages